ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging

Hugging Face Daily Papers 05/12/26, 12:00 AM Papers

Summary

ORBIT proposes a method to mitigate catastrophic forgetting in large language models fine-tuned for generative retrieval by tracking parameter distances and using weight averaging, outperforming common continual learning baselines.

Despite the rapid advancements in large language model (LLM) development, fine-tuning them for specific tasks often results in the catastrophic forgetting of their general, language-based reasoning abilities. This work investigates and addresses this challenge in the context of the Generative Retrieval (GenRetrieval) task. During GenRetrieval fine-tuning, we find this forgetting occurs rapidly and correlates with the distance between the fine-tuned and original model parameters. Given these observations, we propose ORBIT, a novel approach that actively tracks the distance between fine-tuned and initial model weights, and uses a weight averaging strategy to constrain model drift during GenRetrieval fine-tuning when this inter-model distance exceeds a maximum threshold. Our results show that ORBIT retains substantial text and retrieval performance by outperforming both common continual learning baselines and related regularization methods that also employ weight averaging.

Original Article

View Cached Full Text

Cached at: 05/13/26, 04:13 PM

Paper page - ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging

Source: https://huggingface.co/papers/2605.12419

Abstract

ORBIT addresses catastrophic forgetting in large language model fine-tuning for generative retrieval by tracking parameter distances and employing weight averaging to maintain model performance.

Despite the rapid advancements inlarge language model(LLM) development,fine-tuningthem for specific tasks often results in thecatastrophic forgettingof their general, language-based reasoning abilities. This work investigates and addresses this challenge in the context of theGenerative Retrieval(GenRetrieval) task. During GenRetrievalfine-tuning, we find this forgetting occurs rapidly and correlates with the distance between the fine-tuned and original model parameters. Given these observations, we propose ORBIT, a novel approach that actively tracks the distance between fine-tuned and initial model weights, and uses aweight averagingstrategy to constrainmodel driftduring GenRetrievalfine-tuningwhen this inter-model distance exceeds a maximum threshold. Our results show that ORBIT retains substantial text and retrieval performance by outperforming both commoncontinual learningbaselines and related regularization methods that also employweight averaging.

View arXiv page View PDF Add to collection

Get this paper in your agent:

hf papers read 2605\.12419

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2605.12419 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2605.12419 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2605.12419 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging

Paper page - ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper0

Similar Articles

The Attribution Blind Spot: Detecting When Language Models Rely on Memory Rather Than Retrieved Context

The Cost of Context: Mitigating Textual Bias in Multimodal Retrieval-Augmented Generation

Parameter Alignment Mitigates Catastrophic Forgetting in Multilingual Expert Language Models

Attribution-Guided Continual Learning for Large Language Models

Micro-Macro Retrieval: Reducing Long-Form Hallucination in Large Language Models

Submit Feedback

Similar Articles

The Attribution Blind Spot: Detecting When Language Models Rely on Memory Rather Than Retrieved Context

The Cost of Context: Mitigating Textual Bias in Multimodal Retrieval-Augmented Generation

Parameter Alignment Mitigates Catastrophic Forgetting in Multilingual Expert Language Models

Attribution-Guided Continual Learning for Large Language Models

Micro-Macro Retrieval: Reducing Long-Form Hallucination in Large Language Models