Reasoning over Grammar: Can Synthetic Linguistic Reasoning Traces Enhance Low-Resource Machine Translation?
Summary
Large language models can improve translation for low-resource languages through structured linguistic reasoning traces, with the most significant benefits occurring during inference rather than training.
View Cached Full Text
Cached at: 06/09/26, 12:41 PM
Paper page - Reasoning over Grammar: Can Synthetic Linguistic Reasoning Traces Enhance Low-Resource Machine Translation?
Source: https://huggingface.co/papers/2606.03782
Abstract
Large language models can improve translation for low-resource languages through structured linguistic reasoning traces, with the most significant benefits occurring during inference rather than training.
Large language models(LLMs) offer a promising approach tomachine translation(MT) for extremelylow-resource languagesby incorporating linguistic resources throughin-context learning. However, LLMs often struggle to apply grammatical information effectively during translation. Inspired by recent progress inchain-of-thought reasoning, we investigate whether low-resource MT can benefit from structured intermediate steps of linguistic analysis and grammatical reasoning. We propose a pipeline for automatically generating step-by-steplinguistic reasoning tracesfromUniversal Dependencies treebanks, dictionaries, and grammar-rule banks. We evaluate these traces in three settings:in-context learning(ICL),supervised fine-tuning(SFT), andreinforcement fine-tuning(RFT), on Xibe and Chintang as test cases. Our results show thatlinguistic reasoning tracesare most effective as inference-time guidance: in ICL, reliable sentence-specific traces substantially improve translation performance across most models, languages, and metrics. In contrast, using thelinguistic reasoning tracesas training data yields smaller and less consistent gains, as models learn the trace format but often generate erroneous content. These findings suggest that LLMs can leverage grammatical information for low-resource MT when given reliable linguistic analyses, while learning to generate such analyses remains a major bottleneck.
View arXiv pageView PDFAdd to collection
Get this paper in your agent:
hf papers read 2606\.03782
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2606.03782 in a model README.md to link it from this page.
Datasets citing this paper1
#### OLAResearchX/LingReason Viewer• Updated5 days ago • 4.58k • 33
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2606.03782 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners
This paper investigates multilingual latent reasoning in large reasoning models across 11 languages, revealing that while latent reasoning capabilities exist, they are unevenly distributed—stronger in resource-rich languages and weaker in low-resource ones. The study finds that despite surface-level differences, the internal reasoning mechanisms are largely aligned with an English-centered pathway.
Enhanced and Efficient Reasoning in Large Learning Models
This paper proposes a method for improving reasoning in large language models by recoding data to explicitly represent relationships, enabling efficient principled reasoning with polynomial-time learnability for relational rules, which addresses hallucinations and supports sound reasoning across multiple calls.
ReasoningFlow: Discourse Structures for Understanding LLM Reasoning Traces
Introduces ReasoningFlow, a framework to capture discourse structures of large language model reasoning traces as directed acyclic graphs, enabling fine-grained analysis of reasoning behaviors like self-reflection and backtracking. Based on manual and automatic annotation of thousands of traces, it reveals structural similarities across models and that most erroneous steps do not contribute to final answers.
Think Multilingual, Not Harder: A Data-Efficient Framework for Teaching Reasoning Models to Code-Switch
This paper introduces a data-efficient fine-tuning framework for teaching reasoning models to code-switch (mix languages) effectively, demonstrating that strategic code-switching can improve reasoning capabilities for lower-resource languages. The work analyzes code-switching behaviors in large language models across diverse languages, tasks, and domains, then develops interventions to promote beneficial code-switching patterns.
Decoding the Critique Mechanism in Large Reasoning Models
This paper investigates how large reasoning models can detect and correct their own errors internally, identifying a highly interpretable critique vector that enhances error detection without additional training, improving test-time scaling performance.