How Capital One Delivers Multi-Agent Systems with Rashmi Shetty - #765

· Source: The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, FinTech & Digital Financial Services · Depth: Advanced, extended

Summary

Capital One has successfully designed, deployed, and scaled multi-agent systems within its highly regulated environment, as detailed by Rashmi Shetty, Senior Director of Enterprise Generative AI Platform. Their flagship application, Chat Concierge, is a multi-agent chat experience for auto dealerships that streamlines car buying by handling intent disambiguation, tool invocation, and human handoffs. This system uses self-reflection and layered reasoning with live API checks to assist with tasks like scheduling test drives and financing. Capital One employs a platform-centric approach that separates design from runtime governance, embedding policies, guardrails, and cyber controls across agent threat boundaries. The company emphasizes a "risk-first" strategy, leveraging its robust model risk framework and existing cloud-native infrastructure to accelerate safe and scalable agent development and deployment, focusing on closed-loop learning from production telemetry and model specialization through fine-tuning and distillation.

Key takeaway

For AI Engineers and MLOps teams building agentic systems in regulated industries, prioritize a platform-centric, "risk-first" approach. Your strategy should integrate robust governance, cyber controls, and model risk frameworks directly into the platform to enable rapid, safe deployment. Focus on end-to-end observability and closed-loop learning from production to continuously refine agent behavior and ensure compliance, rather than solely relying on design-time evaluations.

Key insights

Capital One deploys multi-agent AI systems in a regulated environment using a platform-centric, risk-first approach.

Principles

Method

Capital One's method involves breaking complex goals into specific agent tasks, implementing policy-bound operations via a platform, and using closed-loop learning from production telemetry for continuous improvement.

In practice

Topics

Best for: AI Engineer, MLOps Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence).