Skip to content

Milestones

List view

  • OpenGateLLM roadmap targets the official **Release 1.0.0** by April 15, 2026. The primary objective of this version is to transition the project from a beta phase to an industrially robust API gateway for self-hosted LLMs, emphasizing cost control, data sovereignty, and privacy. The core pillars of this robustness-focused release include: * **Architectural Integrity:** A major initiative is underway to refactor the codebase toward a **"Clean Architecture"** (#618). This involves decoupling business logic from infrastructure, validating all requests via **Pydantic schemas** (#642), and optimizing data access with **SQL query improvements and Redis caching** (#652). * **Advanced Traffic Orchestration:** To ensure reliability under high loads, the release introduces: * A **Priority System** (#620) to manage incoming API requests and prevent resource saturation by "noisy neighbors". * **Sticky Sessions** (#621) to maintain conversation context on specific backend nodes, thereby optimizing GPU KV-cache usage. * **QoS-based Load Balancing** (#622) that dynamically routes traffic based on real-time performance metrics like Time-To-First-Token (TTFT) and throughput. * **Infrastructure and Scaling:** The milestone addresses critical data bottlenecks by fixing **Elasticsearch (ES) scaling** issues (#643) and improving document retrieval performance through optimized sorting (#647). * **Security and Governance:** Security is bolstered by making the **initial user creation process** more secure (#714) and refactoring administrative endpoints for stricter role management (#683). * **Operational Excellence:** Comprehensive documentation (#470) is being developed using Docusaurus, covering production recommendations, carbon monitoring details, and auto-generated error code references to ensure predictable system behavior. In summary, Release 1.0.0 aims to establish OpenGateLLM as a production-ready, sovereign alternative to commercial AI gateways by fortifying its technical foundation and management capabilities.

    Due by April 15, 2026
    2/13 issues closed