An Industry Benchmark for Data Fairness: Sony’s Alice Xiang

· Source: MIT Sloan Management Review · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Cybersecurity & Data Privacy · Depth: Intermediate, extended

Summary

Sony's Global Head of AI Governance, Alice Xiang, discusses the company's pioneering work in responsible AI ethics and governance, including the development of Phoebe (Fair Human-Centric Image Benchmark). Established in 2018, Sony's AI ethics guidelines have evolved into a comprehensive governance framework. Phoebe, recently published in Nature, addresses the critical gap in ethically sourced, diverse datasets for evaluating bias in human-centric computer vision models. Unlike traditional web-scraped datasets, Phoebe emphasizes appropriate consent, compensation, and global diversity, using self-reported demographic information and extensive environmental annotations. This benchmark helps practitioners diagnose and mitigate biases that can lead to inconveniences or severe consequences like financial fraud or wrongful arrests. The initiative aims to set new industry standards for responsible data collection and empower modelers to make technical improvements and implement non-technical mitigations.

Key takeaway

For AI scientists and engineers developing human-centric computer vision models, you should prioritize using ethically sourced and diverse datasets like Phoebe for bias evaluation. This approach not only helps diagnose specific failure modes related to demographics and environmental factors but also enables more effective mitigation strategies, ranging from data collection improvements and model optimization to practical real-world system adjustments, ultimately leading to more trustworthy and responsible AI deployments.

Key insights

Ethically sourced, diverse datasets are crucial for effectively measuring and mitigating bias in AI models.

Principles

Method

Phoebe uses self-reported demographics and extensive annotations (environment, physical attributes, cameras) to create a globally diverse, ethically sourced dataset for granular bias diagnosis in human-centric computer vision.

In practice

Topics

Best for: Computer Vision Engineer, AI Scientist, AI Engineer, AI Ethicist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by MIT Sloan Management Review.