OpenGateLLM documentation
Welcome to the OpenGateLLM official documentation! This comprehensive guide will walk you through everything you need to know to run, configure, and extend OpenGateLLM.
The API is still under beta version, major breaking changes may occur.
What is OpenGateLLM?β
OpenGateLLM is an open-source, production-ready API gateway, optimized for self-hosted models. It's designed to centralize, secure, and manage Generative AI access in a sovereign and cost-effective way.
OpenGateLLM addresses three critical challenges for organizations:
- Cost control - Reduce expenses of commercial APIs and GPU infrastructure by using self-hosted models and build a mutualized infrastructure with your peers.
- Data sovereignty - Keep sensitive data under your control
- Privacy & security - No chat history storage, robust access control
Core principles
- Open source and free forever - All features available without commercial licensing
- High code quality - Built with maintainability and reliability in mind
- Lightweight architecture - Focused feature set for optimal performance
- High compatibility - Seamlessly integrates with GenAI ecosystem frameworks by OpenAI-compatible API
- Production-ready - Engineered to handle high loads with advanced QoS features

OpenGateLLM is an alternative to...
| Key features | OpenGateLLM | LiteLLM | TensorZero |
|---|---|---|---|
| π OpenAI Compatibility | β | β | β |
| π Open-source | β | β | β |
| π» Self-hostable | β | β | β |
| πΈ Free (all features) | β | β | β |
| π Support commercial and self-hosted models | β | β | β |
| Account management | |||
| π² Playground UI | β | β | β |
| π€ User management (API keys, budget...) | β | β | β |
| π’ Organization management | π§ | β | β |
| βοΈ Project management | π | β | β |
| π SSO support | π§ | β | β |
| High load features | |||
| β Rate limiting | β | β | β |
| β‘ Requests prioritization | β | β | β |
| π Quality of service thresholds | β | β | β |
| π Model load balancing | β | β | β |
| π Model fallback | π | β | β |
| Monitoring & analytics | |||
| π Usage tracking | β | β | β |
| π Carbon footprint | β | β | β |
| π Prometheus integration | β | β | β |
| Privacy & security | |||
| π« No chat history storage | β | β | β |
| π Role-based access control | β | β | β |
Legend: β supported β β not supported β π§ work in progress β π in roadmap
Quickstartβ
Get started with OpenGateLLM in minutes with our quickstart guide here.
Community & supportβ
OpenGateLLM is developed by Etalab since July 2024, the French government's open data and AI task force, with the support of CentraleSupΓ©lec.
This project exists thanks to all the people who contribute. OpenGateLLM thrives on open-source contributions. Join our community! Whether you're fixing bugs, adding features, or improving documentation, we welcome your help. Check out our Contributing Guide to get started.
Thanks to all contributors β€οΈ
Roadmapβ
OpenGateLLM is still under beta version, major breaking changes may occur. Check our current roadmap here to see what we are working on.
Licenseβ
OpenGateLLM is open-source software licensed under the MIT License.