llm-server
The premium Open Source alternative to SaaS
Visit Website
0.2k
StarsMITLicense
What is llm-server?
Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alternative for multi-GPU rigs.
Tech Stack
GoUncategorized
Why llm-server?
Limitations
6/12/2026
Last Update
11
Forks
0
Issues
MIT
License
Financial Leak Detected
Stop the "SaaS Tax"
Your team could be burning cash. Switching to llm-server instantly boosts your runway.
Competitor Cost
-$1,440
/ year (est. based on SaaS)
Self-Hosted
$0
/ year
Team Size10 Users
150+
SAVE 100%