How to create “humble” AI

· Source: MIT News - Artificial intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Emerging Technologies & Innovation · Depth: Intermediate, medium

Summary

An MIT-led international research team has developed a framework for designing "humble" AI systems for medical diagnosis, aiming to prevent overconfident and potentially incorrect decisions. Published on March 24, 2026, in *BMJ Health and Care Informatics*, this approach encourages AI to reveal diagnostic uncertainty and prompt users for additional information. The framework incorporates computational modules, including an "Epistemic Virtue Score" developed by the University of Melbourne, which allows AI models to evaluate their own certainty. If confidence exceeds available evidence, the system can flag the mismatch, request specific tests, or recommend specialist consultation. This initiative is part of a broader effort by MIT Critical Data to create more inclusive AI systems, addressing biases from training data primarily sourced from the United States and the limitations of electronic health records.

Key takeaway

For AI scientists developing diagnostic tools, you should prioritize integrating humility and uncertainty awareness into your models. This framework provides a method to program AI to self-assess confidence and prompt for more data or human consultation when uncertain, thereby reducing the risk of overconfident errors and fostering more collaborative human-AI partnerships in clinical settings. Consider how your training data may introduce biases and actively seek diverse viewpoints.

Key insights

Humble AI systems in medicine enhance collaboration by revealing uncertainty and prompting further information gathering.

Principles

Method

The framework integrates computational modules, like the Epistemic Virtue Score, enabling AI to self-evaluate diagnostic certainty and tailor responses by requesting more data or recommending specialist input when confidence is low.

In practice

Topics

Best for: AI Scientist, Research Scientist, AI Engineer, AI Researcher, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by MIT News - Artificial intelligence.