lora-merging

Tag

Cards List
#lora-merging

From "Weak" Signals to Strong Models: Preference Delta Aggregation with LoRA Merging

arXiv cs.AI · 5d ago Cached

This paper introduces Preference Delta Aggregation (PDA) and Geometric Alignment Merging (GAM) to aggregate multiple 'weak' preference signals from weaker model pairs via LoRA merging, improving strong LLMs on knowledge reasoning and agentic search tasks by over 6% on average.

0 favorites 0 likes
← Back to home

Submit Feedback