A Framework for Measuring Appropriate Reliance on Set-Valued AI Advice

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, quick

Summary

A new formal framework has been developed to measure appropriate reliance on set-valued AI advice, addressing a gap in existing research that primarily focuses on point predictions. This framework operates within the sequential judge-advisor paradigm and applies to both classification and regression tasks. For classification, it introduces dimensions for evaluating set-valued AI advice and defines two metrics: correct reliance rate on AI and correct reliance rate on self, which jointly characterize appropriate reliance. For regression, the framework introduces quantity of AI reliance and quality of AI reliance, measuring whether a decision maker utilized the AI advice and if reliance improved accuracy relative to their initial estimate. This framework demonstrates how its metrics capture important nuances in human-AI collaboration that existing measures overlook.

Key takeaway

For research scientists evaluating human-AI collaboration, particularly when AI provides set-valued advice, you should recognize that traditional point-prediction metrics are insufficient. This new framework offers a robust method to measure appropriate reliance, distinguishing between classification and regression tasks with specific metrics like correct reliance rates and reliance quality. Incorporating this framework will enable a more nuanced and accurate assessment of how humans interact with and utilize uncertain AI recommendations.

Key insights

This framework is the first to formally measure appropriate reliance on set-valued AI advice in human-AI collaboration.

Principles

In practice

Topics

Best for: AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.