Bias in the machine (edited)
Summary
This episode, "Bias in the Machine," from "The Dark Side of AI" mini-series, explores the pervasive issue of bias in both historical medical practices and contemporary AI systems. It highlights how medical studies historically excluded women, leading to misdiagnoses like those for cardiovascular disease in female patients. The discussion then transitions to three types of bias in AI: stochastic drift, model parameter bias, and training sample bias, emphasizing the latter as the most dangerous. The episode critically examines ImageNet, a 14-million-photo database used for computer vision, revealing how its crowdsourced labels, originating from WordNet, embedded gendered, racialized, and ableist prejudices. The "ImageNet Roulette" project demonstrated these biases by misclassifying individuals. Furthermore, it discusses BERT, Google's NLP system, which exhibits sexist biases by associating professions with males, propagating inequality through search engines and HR algorithms. The hosts stress that AI systems offer interpretations, not objective reality, due to inherent human biases in data creation and model design.
Key takeaway
For AI engineers and data scientists building or deploying models, you must rigorously question the provenance and composition of your training datasets. Recognize that AI systems, like ImageNet and BERT, inherit and amplify human biases from their data sources, leading to discriminatory outcomes in areas like job applications or judicial decisions. Prioritize diverse teams to foster cognitive diversity, which directly translates into more robust and equitable AI products, and never take model predictions at face value.
Key insights
Bias in AI systems stems from historical human prejudices embedded in training data and model design, leading to real-world harm.
Principles
- Bias warps understanding of reality.
- Classification systems reflect the classifier's worldview.
- Images do not describe themselves.
Method
The "bikini approach" in medicine focused only on reproductive organs when studying women, assuming other differences from men were negligible, leading to widespread misdiagnosis.
In practice
- Question model predictions critically.
- Scrutinize training sample origins and creators.
- Recognize AI models interpret, not faithfully depict, reality.
Topics
- Algorithmic Bias
- Training Data Bias
- ImageNet Dataset
- BERT Model
- AI Ethics
Best for: NLP Engineer, Computer Vision Engineer, CTO, AI Engineer, Machine Learning Engineer, AI Ethicist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Data Science at Home Podcast.