New Trends for Modern Machine Translation with Large Reasoning Models
Abstract
Large Reasoning Models leveraging Chain-of-Thought reasoning have redefined Machine Translation by transforming it into a dynamic reasoning task that emphasizes contextual coherence, cultural intent, and self-reflection, demonstrating significant improvements in various translation scenarios.
Recent advances in Large Reasoning Models (LRMs), particularly those leveraging Chain-of-Thought reasoning (CoT), have opened brand new possibility for Machine Translation (MT). This position paper argues that LRMs substantially transformed traditional neural MT as well as LLMs-based MT paradigms by reframing translation as a dynamic reasoning task that requires contextual, cultural, and linguistic understanding and reasoning. We identify three foundational shifts: 1) contextual coherence, where LRMs resolve ambiguities and preserve discourse structure through explicit reasoning over cross-sentence and complex context or even lack of context; 2) cultural intentionality, enabling models to adapt outputs by inferring speaker intent, audience expectations, and socio-linguistic norms; 3) self-reflection, LRMs can perform self-reflection during the inference time to correct the potential errors in translation especially extremely noisy cases, showing better robustness compared to simply mapping X->Y translation. We explore various scenarios in translation including stylized translation, document-level translation and multimodal translation by showcasing empirical examples that demonstrate the superiority of LRMs in translation. We also identify several interesting phenomenons for LRMs for MT including auto-pivot translation as well as the critical challenges such as over-localisation in translation and inference efficiency. In conclusion, we think that LRMs redefine translation systems not merely as text converters but as multilingual cognitive agents capable of reasoning about meaning beyond the text. This paradigm shift reminds us to think of problems in translation beyond traditional translation scenarios in a much broader context with LRMs - what we can achieve on top of it.
Community
Recent advances in Large Reasoning Models (LRMs) with Chain-of-Thought (CoT) capabilities are transforming machine translation. This paper argues that LRMs reframe translation as a dynamic reasoning task requiring contextual, cultural, and linguistic understanding. Three key shifts are identified: contextual coherence, cultural intentionality, and self-reflection. We explore various translation scenarios, showcasing LRMs' superiority, and discusses phenomena like auto-pivot translation. Challenges such as over-localization and inference efficiency are also addressed. We think that LRMs redefine translation systems as multilingual cognitive agents capable of reasoning beyond text, opening new possibilities in a broader context.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Evaluating o1-Like LLMs: Unlocking Reasoning for Translation through Comprehensive Analysis (2025)
- R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning (2025)
- AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought (2025)
- Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation (2025)
- Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems (2025)
- Lost in Literalism: How Supervised Training Shapes Translationese in LLMs (2025)
- Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
 You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: 
@librarian-bot
	 recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
 
					 
						
