arxiv_cl 90% Match Research Paper NLP researchers,Machine translation researchers,AI researchers working on multilingual models,Developers of global NLP applications 2 weeks ago

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

large-language-models › reasoning

📄 Abstract

Abstract: Multilingual generation with large language models (LLMs) is often of poor quality for mid- to low-resource languages, but the causes for this are not well-understood. We first demonstrate the existence of an implicit task-solving-->translation pipeline for generation, whereby the model first solves the required task in a largely target-language-agnostic manner, and subsequently translates answer concepts into the intended target language. We hypothesize that the failure of the translation stage, despite task-solving success, is an important culprit for the observed low quality of final outputs, and formalize this as the translation barrier hypothesis. We quantify the extent to which either stage in the pipeline is responsible for final failure for a word translation task across 108 language pairs, and find that the translation barrier explains a dominant portion of error for a majority of language pairs, and is especially severe for low-resource target languages. Our results highlight an important bottleneck for end-to-end multilingual generation, relevant for future work seeking to improve multilinguality in LLMs.

Authors (7)

Niyati Bafna

Tianjian Li

Kenton Murray

David R. Mortensen

David Yarowsky

Hale Sirin

+1 more

Submitted

June 28, 2025

arXiv Category

cs.CL

arXiv PDF

Key Contributions

Proposes the 'Translation Barrier Hypothesis', suggesting that poor multilingual generation in LLMs stems from an implicit task-solving -> translation pipeline where the translation stage fails, especially for low-resource languages. Quantifies this barrier across 108 language pairs, showing it's a dominant error source.

Business Value

Helps developers and researchers identify key bottlenecks in multilingual AI systems, leading to more effective strategies for improving global language support and reducing development costs.

Paper Metadata

Innovation Type

Theoretical Framework and Analysis

Deployment Feasibility

High, as it's an analytical framework.

Limitations Addressed

The poorly understood reasons behind the low quality of multilingual generation from LLMs, particularly for low-resource languages.

Performance Gains

Provides a framework to understand and diagnose performance issues, guiding future improvements.

Technical Tags

multilingual generationlarge language models (LLMs)translation barrierimplicit translationlow-resource languagestask-solving pipelinelanguage generationcross-lingual performanceerror analysis

Research Topics

Multilingual NLPLarge Language ModelsMachine TranslationNatural Language GenerationCross-Lingual Transfer

Methods & Architectures

Translation Barrier Hypothesistask-solving-->translation pipeline analysisquantification of stage-wise errorword translation task evaluation Large Language Models (LLMs)

Applications & Tasks

Natural Language Generation Machine Translation Poor quality multilingual generation for mid- to low-resource languagesUnderlying causes of multilingual generation failureImplicit translation issues in LLMs Explaining poor multilingual generation performanceQuantifying the contribution of translation failureImproving multilingual generation quality

Related Fields

Natural Language ProcessingMachine TranslationComputational LinguisticsArtificial Intelligence

Keywords

multilingual generationLLMtranslation barrierlow-resource languagesNLPmachine translationcross-lingualerror analysishypothesisgeneration quality

Academic Context

#Multilingual NLP#Large Language Models#Machine Translation#Natural Language Generation#Cross-Lingual Transfer

Commercial Potential

Potential Products

Diagnostic tools for multilingual NLP performanceFrameworks for improving cross-lingual generationTraining methodologies focused on translation robustness

Target Industries

TechnologyGlobal CommunicationsSoftware DevelopmentAI Services

Use Case Examples

Identifying why an LLM struggles to generate text in SwahiliDeveloping better strategies for training multilingual modelsImproving the quality of automated content translation

Competitive Edge

Offers a novel theoretical explanation ('Translation Barrier Hypothesis') for multilingual generation failures, providing a more specific and actionable diagnostic framework than general performance metrics.

Market Opportunity

Large, as improving multilingual AI is a key goal for global technology.

Revenue Models

Consultingintegration into AI development platformsresearch publications.

Resource Requirements

Compute Needs

Minimal, primarily for analysis and experimentation.

Data Requirements

Requires data for evaluating multilingual generation and translation tasks across various language pairs.

Scalability

The hypothesis is applicable to any multilingual LLM.

Production Readiness

Maturity Level

Theoretical Framework

Time to Market

N/A (theoretical contribution).

Patent Potential

Low, as it's a theoretical contribution.

View Full Paper Back to Papers