Use of Machine Learning techniques and Large Language Models for automatic evaluation of Celpe-Bras exam texts
Summary
A study mapped and compared methods for the automatic evaluation of texts produced for the Celpe-Bras exam, Brazil's official proficiency test in Portuguese as an Additional Language. This exam requires participants to write four texts based on multimedia prompts, leading to a high volume of texts for teachers to correct and limited accessible didactic resources for students. Researchers investigated various models, including traditional machine learning algorithms and pre-trained language models like BERT, BART, and T5. The findings indicated that adaptations of the BERT model achieved the best evaluation results, though these improvements came with a considerably higher computational cost compared to other tested models.
Key takeaway
For NLP Engineers developing automated assessment tools for language proficiency exams, consider fine-tuning BERT-based models for superior accuracy in text evaluation. However, be prepared for the increased computational resources required, and evaluate if the performance gains justify the higher operational costs for your specific deployment environment and user base.
Key insights
BERT adaptations achieved the best automatic evaluation for Celpe-Bras texts, but at a higher computational cost.
Principles
- LLMs can automate language proficiency assessment.
- Model performance often correlates with computational expense.
Method
The study mapped and compared traditional machine learning algorithms and pre-trained language models (BERT, BART, T5) for automatic text evaluation in the Celpe-Bras exam context.
In practice
- Consider BERT for high-accuracy text evaluation.
- Weigh computational cost against performance gains.
Topics
- Celpe-Bras
- Automatic Text Evaluation
- Large Language Models
- BERT
- Natural Language Processing
Best for: NLP Engineer, AI Scientist, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Paper Index on ACL Anthology.