Can Assurance Help Build AI Systems That We Can Trust?
Summary
The AI Standards Hub Global Summit in Glasgow, co-hosted by Partnership on AI and The Alan Turing Institute, British Standards Institution, and UK's National Physical Laboratory (NPL), convened experts to discuss building robust AI assurance infrastructure. Key themes emerged, including the necessity for assurance to extend beyond initial deployment to include continuous post-deployment monitoring, a service currently underutilized. The summit also highlighted low demand for independent assurance services, attributed to unclear regulatory expectations and limited awareness, with 46% of 76 respondents favoring legislation to increase demand. Furthermore, the discussion emphasized that frontier AI models, with their high-stakes risks like CBRN, require state-of-the-art, adaptable evaluation standards, such as process standards. Finally, the rapid adoption of agentic AI systems is outpacing formal standardization, underscoring the need for voluntary frameworks and accelerated development of assurance infrastructure.
Key takeaway
For CTOs and VPs of Engineering deploying AI systems, prioritizing continuous post-deployment monitoring and engaging independent assurers is crucial for mitigating risks and building calibrated trust. Your teams should advocate for and implement adaptable process standards, especially for frontier models, and leverage voluntary frameworks to bridge gaps where formal ISO standards lag, ensuring systems are safe and accountable before widespread public adoption.
Key insights
Comprehensive AI assurance infrastructure, including post-deployment monitoring and adaptable standards, is critical for safe and trustworthy AI.
Principles
- Assurance must extend beyond deployment.
- Standards must be adaptable for frontier AI.
- Voluntary frameworks fill standardization gaps.
Method
The summit identified policy levers to increase demand for external AI assurance, including legislation (46% support) and greater transparency via use case registers and incident reporting (41% support).
In practice
- Implement real-time failure detection for AI agents.
- Explore policy levers to boost external assurance demand.
- Develop process standards for adaptable AI evaluations.
Topics
- AI Assurance
- AI Standards
- Frontier AI Models
- Agentic AI Systems
- Post-Deployment Monitoring
Best for: CTO, VP of Engineering/Data, Executive, Policy Maker, AI Ethicist, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Partnership on AI.