ChatGPT and other AI bots made huge errors before Scottish election, study finds

· Source: AI (artificial intelligence) | The Guardian · Field: Government & Public Sector — Public Policy & Governance, Regulatory & Compliance, Artificial Intelligence & Machine Learning · Depth: Fundamental Awareness, short

Summary

A study by the thinktank Demos, commissioned by the Electoral Commission, found that AI chatbots like ChatGPT and Google Gemini made significant errors providing election information before the recent Scottish election. The investigation, which posed 75 questions to five free AI tools, revealed misinformation in 34% of responses. Specific inaccuracies included inventing fictitious scandals, providing incorrect election dates, misstating voter ID requirements, and placing candidates in wrong contests. An accompanying poll indicated 20% of British voters, equivalent to 10 million people, used AI tools for election information. Replika performed worst with 56% errors, followed by ChatGPT at 46%, and Google Gemini at 22%. Grok had the lowest error rate at 9% but provided poor external links. The Electoral Commission is urging ministers to introduce new legal controls and clearer duties for AI platforms to combat misinformation, while Demos advocates for making AI companies liable under UK law and mandating accuracy safeguards.

Key takeaway

For policy makers developing AI regulation, this study underscores the urgent need for specific legal controls over AI chatbots in electoral contexts. You should prioritize legislation that establishes clear duties for AI platforms to prevent misinformation, mandates accuracy safeguards for election-related content, and ensures companies are liable under electoral law. This proactive approach is critical to protect democratic processes from the rapid spread of false information.

Key insights

AI chatbots frequently generate election misinformation, necessitating urgent regulatory intervention to protect democratic processes.

Principles

Method

Demos simulated voter queries by asking 75 questions to five free AI tools (ChatGPT, Google Gemini, Replika) about three real-life constituencies to assess accuracy and evidence.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, Policy Maker, AI Ethicist, Tech Journalist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by AI (artificial intelligence) | The Guardian.