lora-merging

#lora-merging

From "Weak" Signals to Strong Models: Preference Delta Aggregation with LoRA Merging

arXiv cs.AI ↗ · 5d ago Cached

This paper introduces Preference Delta Aggregation (PDA) and Geometric Alignment Merging (GAM) to aggregate multiple 'weak' preference signals from weaker model pairs via LoRA merging, improving strong LLMs on knowledge reasoning and agentic search tasks by over 6% on average.

0 favorites 0 likes

lora-merging

From "Weak" Signals to Strong Models: Preference Delta Aggregation with LoRA Merging

Submit Feedback