seq2seq

#seq2seq

Learning Complementary Action Modeling from Automotive Maintenance Instructions

arXiv cs.CL ↗ · 2026-06-29 Cached

This paper introduces Complementary Action Modeling (CAM), a task that identifies or generates procedural counterparts of automotive maintenance instructions by modifying the action phrase while preserving context. Using a German automotive dataset, the authors examine candidate matching and controlled Seq2Seq generation to model these complementary instructions.

0 favorites 0 likes

#seq2seq

MathFormer: Testing whether symbolic math is pattern matching or reasoning [D]

Reddit r/MachineLearning ↗ · 2026-06-27

MathFormer is a small seq2seq model that achieves ~98.6% accuracy on symbolic math tasks, suggesting that mathematical reasoning in LLMs may be large-scale structured pattern completion rather than true reasoning.

0 favorites 0 likes

#seq2seq

Reference-Free Reinforcement Learning Fine-Tuning for MT: A Seq2Seq Perspective

arXiv cs.CL ↗ · 2026-05-18 Cached

This paper applies Group Relative Policy Optimization (GRPO) to encoder-decoder Seq2Seq models for machine translation fine-tuning, using reference-free rewards (LaBSE and COMET-Kiwi) that require no parallel data, and achieves consistent improvements across 13 languages.

0 favorites 0 likes

seq2seq

Learning Complementary Action Modeling from Automotive Maintenance Instructions

MathFormer: Testing whether symbolic math is pattern matching or reasoning [D]

Reference-Free Reinforcement Learning Fine-Tuning for MT: A Seq2Seq Perspective

Submit Feedback