QCon Previews 20th Anniversary Conferences: Production AI, Resilience, and Staff+ Engineering
Summary
QCon is celebrating its 20th anniversary with its 2026 conferences, QCon London (March 16–19) and QCon San Francisco (November 16–20), focusing on practical, production-ready insights from senior engineers. Key themes include the transition of AI from experimental LLMs to reliable "agentic systems," addressing non-determinism, observability, and security in production AI. The conferences will also emphasize "survivability" in complex distributed systems, advocating for cellular isolation and fault containment to limit failure "blast radius" rather than preventing all failures. Additionally, tracks will cover the human element of Staff+ Engineering, focusing on durable skills like technical judgment, and evaluating Platform Engineering ROI based on measurable outcomes like developer throughput and operational costs. Both events feature numerous peer-selected speakers.
Key takeaway
For AI Engineers moving models into production, you should prioritize understanding and mitigating non-determinism, observability, and security challenges in agentic systems. Focus on rigorous testing and validation to ensure probabilistic models perform reliably in live traffic, rather than solely on initial experimental success. Consider attending QCon London or San Francisco to gain practical insights on these specific production hurdles.
Key insights
QCon's 20th anniversary conferences focus on practical production challenges in AI, system resilience, and senior engineering roles.
Principles
- Prioritize survivability over simple uptime.
- Limit failure "blast radius" through fault containment.
- Evaluate platforms by measurable outcomes, not just adoption.
Method
The QCon approach involves curating sessions by senior engineers that focus on real-world successes and failures in production, moving beyond hype to discuss rigorous testing and validation for probabilistic models.
In practice
- Integrate models, tools, and workflows for agentic AI systems.
- Implement cellular isolation for distributed system resilience.
- Measure platform ROI via developer throughput and operational costs.
Topics
- Production AI
- Agentic Systems
- Distributed System Resilience
- Staff+ Engineering
- Platform Engineering ROI
Best for: Software Engineer, AI Engineer, MLOps Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by InfoQ.