Three Design Decisions in LLM Observatory and Why I Made Them
Summary
The "LLM Observatory" platform employs three distinct design decisions to balance functionality, cost, and developer experience. First, it uses a dual authentication model: JWTs for human operators requiring role-based access and session management, and service API keys for machine-to-machine integrations needing scoped, revocable credentials. Second, it prioritizes heuristic evaluation before LLM-as-judge, enabling dependency-free demonstrations, robust schema validation for evaluation versioning, and explicit cost control. Third, the platform adopts a SQLite-first, PostgreSQL-forward storage strategy, facilitating rapid local development and schema validation while providing a clear, documented path to a production-grade PostgreSQL deployment. These choices introduce some complexity and trade-offs in precision or local-production parity, but ensure each component is fit for its specific purpose.
Key takeaway
For AI Architects or MLOps Engineers designing internal LLM platforms, consider adopting a dual authentication model to cleanly separate human and service access requirements. Prioritize heuristic evaluation for initial platform demonstrability and cost control, deferring more expensive LLM-as-judge methods. Additionally, starting with SQLite for local development before transitioning to PostgreSQL for production can significantly accelerate schema validation and developer onboarding, even with documented local-production gaps.
Key insights
Dual authentication, heuristic-first evaluation, and SQLite-first storage optimize LLM platform development and operation.
Principles
- Match authentication to caller type.
- Control evaluation cost and dependencies.
- Prioritize local development velocity.
Method
Implement JWTs for human operators and API keys for services. Build heuristic evaluation first for demonstrability and cost control. Start with SQLite for local development, then migrate to PostgreSQL for production.
In practice
- Use `POST /api/v1/auth/register` for human JWTs.
- Use `POST /api/v1/api-keys` for service API keys.
- Validate schema with Alembic against SQLite first.
Topics
- LLM Observatory
- Authentication Design
- Evaluation Sequencing
- Heuristic Evaluation
- LLM-as-Judge
Best for: MLOps Engineer, AI Engineer, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Naturallanguageprocessing on Medium.