MLLP-VRAIN UPV system for the IWSLT 2026 Simultaneous Speech Translation task

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

The MLLP-VRAIN research group submitted its system for the IWSLT 2026 Simultaneous Speech Translation (SimulST) task, utilizing the recently released Parakeet and Qwen 3.5 models. This cascaded solution addresses long-form SimulST through adaptive "black-box" policies, with explorations into policy relaxations to optimize quality-latency trade-offs. The system participates in all language directions, and for En→{De, It, Zh}, it includes a new context track. This context track integrates ASR word-boosting and a Retrieval-Augmented Generation (RAG) mechanism, leveraging offline pre-translated exemplars to guide generation and enrich domain-specific context. A detailed latency analysis is also provided. The system demonstrated a substantial quality improvement of +5.82 XCOMET-XL on the MCIF En→De test set compared to the previous year, with context processing adding another +1.03 improvement.

Key takeaway

For NLP Engineers developing simultaneous speech translation systems, consider integrating adaptive "black-box" policies with models like Parakeet and Qwen 3.5. Your team should explore policy relaxations to fine-tune quality-latency trade-offs. Additionally, for domain-specific contexts, implement ASR word-boosting alongside a RAG mechanism using pre-translated exemplars to significantly enhance performance, as demonstrated by the +5.82 XCOMET-XL improvement.

Key insights

The system combines Parakeet and Qwen 3.5 with adaptive policies and RAG for improved SimulST quality and latency.

Principles

Method

A cascaded SimulST solution uses Parakeet and Qwen 3.5, applying adaptive "black-box" policies. The context track employs ASR word-boosting and RAG with pre-translated exemplars.

In practice

Topics

Best for: Research Scientist, AI Scientist, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.