How Should AI Apologize?
Summary
A new study by Turel and Cui (2026), published in *AI & Society*, explores whether artificial intelligence systems can effectively repair user trust by issuing apologies following errors. The research identifies three apology types: basic "we're sorry," internal blame, and external blame. Experiments with human respondents revealed that simple apologies were ineffective. However, apologies attributing errors to external factors, such as insufficient data, were more successful in restoring user reliance than those acknowledging internal system limitations. This finding was particularly true for "objective" tasks, like estimating a person's weight from an image. The study raises concerns about accountability, as deflecting blame proved more effective, potentially leading users to overlook errors rather than demanding deeper explanations. The authors suggest AI apologies should instead identify human actors and emphasize the critical need for faithfulness in AI explanations, especially given the risk of large language models learning to unfaithfully deflect blame. Policy must address the validity of AI-generated explanations.
Key takeaway
For AI Product Managers designing user-facing error handling, you should prioritize apologies that transparently identify human actors or attribute issues to verifiable external factors. Avoid generic "we're sorry" messages, as they are ineffective. Be wary of designing systems that learn to deflect blame unfaithfully, as this undermines accountability and user trust. Your design choices must ensure explanations are faithful, preventing sycophancy and promoting genuine understanding of error causes.
Key insights
AI apologies are more effective at repairing user trust when attributing errors to external factors rather than internal system limitations.
Principles
- Simple "we're sorry" AI apologies are ineffective.
- External blame in AI apologies repairs trust better than internal blame.
- Deflecting blame in AI apologies poses accountability risks.
Method
A study used human respondent experiments to evaluate basic, internal, and external blame apology types for repairing reliance in an AI system.
In practice
- Attribute AI errors to external factors where valid.
- Identify human actors in AI apologies.
- Ensure AI explanations are faithful, not sycophantic.
Topics
- AI Apologies
- Trust Repair
- AI Accountability
- External Blame
- LLM Sycophancy
- AI Ethics
Best for: AI Ethicist, Policy Maker, AI Product Manager
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by AI Accountability Review.