Can we trust AI to detect healthy multilingual English speakers among the cognitively impaired cohort in the UK? An investigation using real-world conversational speech

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Natural Language Processing · Depth: Advanced, quick

Summary

A study investigated the trustworthiness and bias of AI models in detecting cognitive decline among healthy multilingual English speakers in the UK. Researchers recruited monolingual participants nationally and multilingual speakers from community centers in Sheffield and Bradford, who spoke Somali, Chinese, or South Asian languages and had distinct Yorkshire accents. While Automatic Speech Recognition (ASR) systems showed no significant bias, classification and regression models utilizing acoustic and linguistic features demonstrated bias against multilingual speakers, particularly in memory, fluency, and reading tasks. This bias was exacerbated when models were trained on the DementiaBank dataset, leading to a higher rate of misclassification of healthy multilingual individuals as cognitively impaired. The study found that current AI models are not yet reliable for diagnostic use in these diverse populations.

Key takeaway

For research scientists developing AI tools for cognitive assessment, you should prioritize bias mitigation strategies, especially when working with diverse linguistic and accent groups. Your models must be rigorously tested on multilingual populations, including those with non-native English accents, to prevent misclassification and ensure diagnostic reliability. Do not rely solely on publicly available datasets like DementiaBank without thorough bias analysis and mitigation.

Key insights

AI models for cognitive decline detection exhibit bias against multilingual English speakers, particularly those with specific accents.

Principles

Method

The study recruited monolingual and multilingual UK speakers, dividing multilinguals by language and accent, then evaluated ASR, classification, and regression models for bias using acoustic and linguistic features.

In practice

Topics

Best for: Research Scientist, AI Researcher, AI Scientist, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.