VoxSafeBench: Not Just What Is Said, but Who, How, and Where
Summary
VoxSafeBench is a new benchmark designed to evaluate speech language models (SLMs) for social alignment across safety, fairness, and privacy in multi-user environments. Unlike existing benchmarks that focus on basic audio comprehension or isolated risks, VoxSafeBench considers how speaker identity, paralinguistic cues, and environmental context can render otherwise benign requests unsafe or privacy-violating. It employs a Two-Tier design: Tier1 assesses content-centric risks with matched text and audio, while Tier2 evaluates audio-conditioned risks where the transcript is benign but the response depends on acoustic context. Across 22 bilingual tasks, the benchmark reveals that SLMs often fail to apply social norms when decisive cues are speech-grounded, showing degraded safety awareness, eroded fairness with vocal demographic differences, and faltering privacy protections. Code and data are publicly available.
Key takeaway
For research scientists developing or deploying SLMs in shared environments, you must move beyond text-centric evaluations. Your models likely have a "speech grounding gap," failing to act appropriately on critical safety, fairness, and privacy cues conveyed acoustically. Integrate benchmarks like VoxSafeBench to rigorously test how your SLMs handle speaker identity, paralinguistic cues, and environmental context to prevent unsafe or privacy-violating responses.
Key insights
SLMs struggle with social alignment when critical safety, fairness, or privacy cues are conveyed acoustically.
Principles
- Acoustic context can transform benign requests into unsafe ones.
- Text-based safeguards degrade when cues are speech-grounded.
Method
VoxSafeBench uses a Two-Tier design: Tier1 for content-centric risks (text/audio match) and Tier2 for audio-conditioned risks (benign transcript, acoustic cues).
In practice
- Evaluate SLMs for speaker- and scene-conditioned risks.
- Test privacy protections with acoustic contextual cues.
Topics
- Speech Language Models
- VoxSafeBench
- Social Alignment
- Audio-Conditioned Risks
- Speech Grounding Gap
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Ethicist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.