Analysis of Machine Translators on Sentences Generated by Portuguese Image Captioning Models

· Source: Paper Index on ACL Anthology · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, quick

Summary

This study evaluates the performance of image captioning models trained on datasets translated into Portuguese by various automatic machine translators. The research addresses the prevalent focus on English in computer vision and natural language processing, necessitating adaptation for other languages like Portuguese. Researchers employed a translate-train approach, translating training datasets into Portuguese using different translators. The evaluation was conducted both quantitatively, assessing cost, training time, and test set metrics, and qualitatively, through comparative evaluation forms and error analysis. The findings demonstrate that valid automatic descriptions in Portuguese can be generated by image captioning models trained on translated datasets, with more robust translators yielding more meaningful descriptions.

Key takeaway

For research scientists developing image captioning models for non-English languages, you should prioritize the quality of your machine translation tool when using a translate-train approach. Investing in more robust translators will directly improve the meaningfulness and validity of the generated descriptions, potentially reducing post-processing and error correction efforts.

Key insights

Machine translation quality directly impacts image captioning model performance when adapting to new languages.

Principles

Method

Image captioning models were trained on Portuguese datasets translated by different machine translators, then evaluated quantitatively (cost, time, metrics) and qualitatively (human review, error analysis).

In practice

Topics

Best for: Research Scientist, AI Scientist, NLP Engineer, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Paper Index on ACL Anthology.