peft

#peft

When Top-1 Fails: Calibrating LoRA Monitors for Masked Diffusion LMs

arXiv cs.LG ↗ · 2d ago Cached

This paper investigates the effectiveness of top-1 collapse rate as a stability monitor for short-horizon LoRA fine-tuning of discrete diffusion language models, finding it has zero precision, and proposes max gradient norm as a more reliable alternative with higher precision and F1 score on LLaDA-family models.

0 favorites 0 likes

#peft

ARIADNE: Agnostic Routing for Inference-time Adapter DyNamic sElection

arXiv cs.AI ↗ · 2026-06-18 Cached

Proposes ARIADNE, a training-free, adapter-agnostic routing framework that selects the optimal PEFT adapter at inference time by measuring input proximity to adapter-specific centroids in embedding space, recovering 97.44% of upper-bound performance on 23 tasks.

0 favorites 0 likes

#peft

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Hugging Face Blog ↗ · 2026-06-18 Cached

Explores whether LoRA is the best parameter-efficient fine-tuning technique and introduces the PEFT library's tools to compare methods.

0 favorites 0 likes

#peft

The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning

arXiv cs.LG ↗ · 2026-06-08 Cached

This paper benchmarks sub-1B models on mathematical reasoning tasks, revealing that full fine-tuning actively harms performance in models under 300M parameters, while parameter-efficient fine-tuning (PEFT) like LoRA and DoRA provides stability. The authors recommend defaulting to PEFT for all aligned sub-1B models and caution against full FT for architectures smaller than 500M to prevent catastrophic forgetting.

0 favorites 0 likes

#peft

Turning every "no thats not what i meant" in chat into actual LoRA training data

Reddit r/LocalLLaMA ↗ · 2026-05-27

A desktop app that lets users correct model responses in chat and train LoRA adapters locally, closing the feedback loop without manual notebook work.

0 favorites 0 likes

#peft

KappaTune is now part of Hugging Face's PEFT

Reddit r/ArtificialInteligence ↗ · 2026-05-20

KappaTune, a fine-tuning method designed to mitigate catastrophic forgetting, has been integrated into Hugging Face's PEFT library.

0 favorites 0 likes

#peft

@_vmlops: FINE-TUNING A 12B MODEL ON A SINGLE GPU IS REAL NOW most people think you need a massive gpu cluster to fine-tune large…

X AI KOLs Timeline ↗ · 2026-05-17 Cached

Hugging Face's PEFT library enables parameter-efficient fine-tuning of large models on a single GPU, reducing compute and storage costs while maintaining performance.

0 favorites 0 likes

#peft

Echo-LoRA: Parameter-Efficient Fine-Tuning via Cross-Layer Representation Injection

arXiv cs.LG ↗ · 2026-05-12 Cached

The article introduces Echo-LoRA, a new parameter-efficient fine-tuning method that injects cross-layer representations from deeper source layers into shallow LoRA modules to improve performance without adding inference-time overhead.

0 favorites 0 likes

#peft

CERSA: Cumulative Energy-Retaining Subspace Adaptation for Memory-Efficient Fine-Tuning

arXiv cs.LG ↗ · 2026-05-12 Cached

The paper introduces CERSA, a novel parameter-efficient fine-tuning method that uses singular value decomposition to retain principal components, significantly reducing memory usage while outperforming existing methods like LoRA.

0 favorites 0 likes

#peft

ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning

arXiv cs.CL ↗ · 2026-04-22 Cached

ShadowPEFT introduces a centralized parameter-efficient fine-tuning method that uses a depth-shared shadow module to refine transformer layer representations, matching or outperforming LoRA/DoRA with comparable trainable parameters.

0 favorites 0 likes

peft

Submit Feedback