Use of Machine Learning techniques and Large Language Models for automatic evaluation of Celpe-Bras exam texts

· Source: Paper Index on ACL Anthology · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Intermediate, quick

Summary

A study mapped and compared methods for the automatic evaluation of texts produced for the Celpe-Bras exam, Brazil's official proficiency test in Portuguese as an Additional Language. This exam requires participants to write four texts based on multimedia prompts, leading to a high volume of texts for teachers to correct and limited accessible didactic resources for students. Researchers investigated various models, including traditional machine learning algorithms and pre-trained language models like BERT, BART, and T5. The findings indicated that adaptations of the BERT model achieved the best evaluation results, though these improvements came with a considerably higher computational cost compared to other tested models.

Key takeaway

For NLP Engineers developing automated assessment tools for language proficiency exams, consider fine-tuning BERT-based models for superior accuracy in text evaluation. However, be prepared for the increased computational resources required, and evaluate if the performance gains justify the higher operational costs for your specific deployment environment and user base.

Key insights

BERT adaptations achieved the best automatic evaluation for Celpe-Bras texts, but at a higher computational cost.

Principles

Method

The study mapped and compared traditional machine learning algorithms and pre-trained language models (BERT, BART, T5) for automatic text evaluation in the Celpe-Bras exam context.

In practice

Topics

Best for: NLP Engineer, AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Paper Index on ACL Anthology.