llm-server

The premium Open Source alternative to SaaS

Visit Website
0.2k
Stars
MITLicense

What is llm-server?

Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alternative for multi-GPU rigs.

Tech Stack
GoUncategorized

Why llm-server?

    Limitations

      6/12/2026
      Last Update
      11
      Forks
      0
      Issues
      MIT
      License
      Financial Leak Detected

      Stop the "SaaS Tax"

      Your team could be burning cash. Switching to llm-server instantly boosts your runway.

      Competitor Cost
      -$1,440
      / year (est. based on SaaS)
      Self-Hosted
      $0
      / year
      Team Size10 Users
      150+
      SAVE 100%

      Community Discussion

      Comments