Why Your AI UX Is Broken (and It's Not the Model's Fault) — Mike Christensen, Ably

· Source: AI Engineer · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Intermediate, long

Summary

The default pattern for AI chat applications, direct HTTP streaming using Server-Sent Events (SSE), fundamentally limits the quality and richness of user experiences. This approach couples the response stream to a single client connection, making it fragile to disconnections, unable to provide continuity across multiple devices or tabs, and lacking bidirectional control for live interaction with agents. Ably, a real-time messaging platform, identifies three foundational capabilities for superior AI experiences: resilient delivery, continuity across surfaces, and live control. To overcome the limitations of direct HTTP streaming, many engineering teams are adopting "durable sessions," a shared, persistent, and stateful resource that decouples the agent and client layers. This pattern enables resumable streams, multi-client interaction, and concurrent activity in multi-agent architectures, simplifying complex plumbing and enhancing user experience. Ably AI Transport is introduced as an SDK built on Ably channels to facilitate this durable session pattern.

Key takeaway

For CTOs and VPs of Engineering building AI-powered products, relying solely on direct HTTP streaming for chat applications will constrain product quality and user experience. You should evaluate adopting a "durable sessions" architecture to enable resilient delivery, seamless continuity across devices, and live agent control. This shift will simplify complex engineering challenges, reduce development overhead for features like stream resumability and multi-agent coordination, and ultimately allow your teams to focus on building richer, more interactive AI experiences.

Key insights

Direct HTTP streaming limits AI chat experiences; durable sessions enable resilient, multi-device, and interactive agent communication.

Principles

Method

Implement durable sessions as a shared, persistent, and stateful medium between agent and client layers, leveraging pub/sub channels to manage event streams, enable resumability, and support multi-client/multi-agent interactions.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Engineer, MLOps Engineer, AI Architect

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by AI Engineer.