Software for Automatic Speech Recognition via Whisper models applied to Oral History interviews in the Portuguese language

· Source: Paper Index on ACL Anthology · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Advanced, quick

Summary

Ethos AT is a new desktop software designed for automatic speech transcription, utilizing OpenAI Whisper models for local processing to ensure data privacy and accessibility. Developed for non-programming experts, such as oral history researchers, it facilitates transcription of Portuguese language interviews. A comparative analysis was performed on six Whisper models: small, medium, large, large-v2, large-v3, and turbo. The study found that larger models offered higher lexical accuracy, while smaller models provided faster execution with acceptable quality. The turbo model achieved an effective balance between accuracy and speed, making Ethos AT a secure, efficient, and user-friendly solution for academic and institutional contexts.

Key takeaway

For oral history researchers or academic institutions needing secure, efficient transcription of Portuguese audio, Ethos AT offers a user-friendly desktop solution. You should consider the Whisper turbo model for its effective balance of accuracy and speed, or larger models like large-v3 if maximum lexical accuracy is paramount, despite longer processing times. This tool ensures data privacy through local processing, a critical factor for sensitive research.

Key insights

Ethos AT provides private, accessible automatic speech transcription for Portuguese using Whisper models.

Principles

Method

Six Whisper models (small, medium, large, large-v2, large-v3, turbo) were compared for transcription accuracy, error types, and processing time on Portuguese oral history interviews.

In practice

Topics

Best for: AI Engineer, NLP Engineer, Research Scientist, Domain Expert, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Paper Index on ACL Anthology.