Detecting Stuttering with Artificial Intelligence: A Hybrid Method for Brazilian Portuguese

· Source: Paper Index on ACL Anthology · Field: Health & Wellbeing — Clinical Care & Medical Practice, Medical Devices & Health Technology · Depth: Advanced, quick

Summary

A new software system has been developed to automatically detect and classify stuttering-related disfluencies in Brazilian Portuguese, addressing the traditional manual, subjective, and time-consuming nature of speech-language assessment. The system utilizes a two-stage hybrid approach: initially, deterministic algorithms based on automatic speech recognition (ASR) and temporal data identify simple disfluencies like repetitions and pauses. Subsequently, a hierarchical architecture, combining a Kohonen network (Self-Organizing Map, SOM) and a Multilayer Perceptron (MLP), classifies complex disfluencies such as blocks and prolongations using acoustic features. An initial dataset, annotated by specialists, was created due to the lack of public resources for this task in Brazilian Portuguese. The system achieved an 89.5% accuracy in classifying complex disfluencies, with a Matthews Correlation Coefficient (MCC) of 0.812.

Key takeaway

For speech-language pathologists assessing stuttering in Brazilian Portuguese, this AI-driven system offers a promising tool to enhance objectivity and efficiency. Its 89.5% accuracy in complex disfluency classification suggests it can significantly support clinical decision-making, reducing manual effort and standardizing assessments. Consider integrating such automated tools to streamline diagnostic processes and establish baselines for patient progress.

Key insights

A hybrid AI system automates stuttering detection and classification in Brazilian Portuguese with high accuracy.

Principles

Method

The method involves a two-stage hybrid approach: ASR and temporal analysis for simple disfluencies, followed by a hierarchical SOM-MLP architecture using acoustic features for complex disfluency classification.

In practice

Topics

Best for: NLP Engineer, AI Scientist, Research Scientist, AI Engineer, Domain Expert

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Paper Index on ACL Anthology.