parameter-efficient-fine-tuning

Tag

Cards List
#parameter-efficient-fine-tuning

SSM Adapters via Hankel Reduced-order Modeling: Injection Site Determines Task Suitability in Long-Context Fine-Tuning

arXiv cs.LG · yesterday Cached

Introduces Hankel Reduced order Model (HRM) adapter, an SSM-based residual module initialized via Balanced Truncation for parameter-efficient fine-tuning, outperforming LoRA on long-context tasks.

0 favorites 0 likes
#parameter-efficient-fine-tuning

@jbhuang0604: LoRA, low-rank adaptation, is arguably the most popular parameter-efficient fine-tuning method for LLMs. But how does i…

X AI KOLs Timeline · yesterday Cached

LoRA (low-rank adaptation) is the most popular parameter-efficient fine-tuning method for LLMs. This video introduces how LoRA and its variants (LoRA+, QLoRA, VeRA, DoRA) work.

0 favorites 0 likes
#parameter-efficient-fine-tuning

@0xSero: Highly recommended educational content. LoRA is one of the coolest things to dabble in, lets anyone fine tune models re…

X AI KOLs Timeline · 4d ago Cached

This article delves into the principles of LoRA and its variants (QLoRA, VeRA, DoRA), explaining how low-rank decomposition reduces trainable parameters to enable efficient fine-tuning of large models.

0 favorites 0 likes
#parameter-efficient-fine-tuning

ARIADNE: Agnostic Routing for Inference-time Adapter DyNamic sElection

arXiv cs.AI · 2026-06-18 Cached

Proposes ARIADNE, a training-free, adapter-agnostic routing framework that selects the optimal PEFT adapter at inference time by measuring input proximity to adapter-specific centroids in embedding space, recovering 97.44% of upper-bound performance on 23 tasks.

0 favorites 0 likes
#parameter-efficient-fine-tuning

Beyond LoRA: Is Sparsity-Induced Adaptation Better?

arXiv cs.LG · 2026-06-15 Cached

This paper proposes sparsity-induced adaptations to LoRA, including Cheap LoRA (cLA) and a chained circulant variant (c³LA), and provides theoretical generalization bounds along with empirical evaluations showing up to 10% training time reduction and 15% peak GPU memory savings while maintaining competitive performance.

0 favorites 0 likes
#parameter-efficient-fine-tuning

Small Data, Big Noise: Adversarial Training for Robust Parameter-Efficient Fine-Tuning

arXiv cs.CL · 2026-06-10 Cached

This paper proposes SDBN, a framework combining adversarial training with parameter-efficient fine-tuning to improve robustness of foundation models under noise and limited data, demonstrating substantial improvements in low-resource settings.

0 favorites 0 likes
#parameter-efficient-fine-tuning

Which LoRA? An Empirical Study on the Effectiveness of LoRA Techniques During Multilingual Instruction Tuning

arXiv cs.CL · 2026-06-10 Cached

This paper empirically compares several LoRA variants for multilingual instruction tuning and finds no significant advantage of complex variants over basic LoRA in balancing cross-lingual transfer and knowledge retention.

0 favorites 0 likes
#parameter-efficient-fine-tuning

PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis

arXiv cs.CL · 2026-06-05 Cached

Presents a systematic study of parameter-efficient fine-tuning using LoRA on Qwen2.5-3B for telecommunications customer support, comparing 16 LoRA configurations with both traditional metrics and energy consumption analysis. Finds divergence between quantitative and qualitative performance.

0 favorites 0 likes
#parameter-efficient-fine-tuning

ReLoRA: Knowledge-Reusing Adaptation for Fast Rollout of Evolving LLM Services

arXiv cs.LG · 2026-06-03 Cached

ReLoRA is a knowledge-reusing adaptation framework that efficiently restores service-ready LoRA adapters for evolving LLM services, reducing time-to-readiness by up to 8.9× and improving accuracy by up to 4.6% through adaptive initialization and scheduled regularization.

0 favorites 0 likes
#parameter-efficient-fine-tuning

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Hugging Face Daily Papers · 2026-06-01 Cached

This paper explores using parameter-efficient fine-tuning (PEFT) as a compact substrate for persistent personal models, studying scaling up, down, and out, and introduces MinT for managing adapters.

0 favorites 0 likes
#parameter-efficient-fine-tuning

FoRA: Fisher-orthogonal Rank Adaptation for Parameter-Efficient Fine-Tuning

arXiv cs.CL · 2026-05-29 Cached

FoRA introduces a parameter-efficient fine-tuning method that selects task-informative layers via Fisher scores and trains LoRA down-projections on the Stiefel manifold, reducing parameters while preserving accuracy.

0 favorites 0 likes
#parameter-efficient-fine-tuning

Hybrid-LoRA: Bridging Full Fine-Tuning and Low-Rank Adaptation for Post-Training

arXiv cs.LG · 2026-05-20

Hybrid-LoRA proposes a framework that selectively applies full fine-tuning to a small subset of modules while using LoRA for the rest, achieving performance near full fine-tuning with significantly lower computational cost. Experiments show improvements of up to 5.65% over existing parameter-efficient baselines.

0 favorites 0 likes
#parameter-efficient-fine-tuning

Freeze Deep, Train Shallow: Interpretable Layer Allocation for Continued Pre-Training

arXiv cs.CL · 2026-05-13 Cached

This paper proposes LayerTracer, an interpretable framework for layer allocation in continued pre-training, demonstrating that freezing deep layers while training shallow ones outperforms full-parameter fine-tuning. It offers a low-cost, actionable strategy for resource-constrained teams optimizing Large Language Models.

0 favorites 0 likes
#parameter-efficient-fine-tuning

Echo-LoRA: Parameter-Efficient Fine-Tuning via Cross-Layer Representation Injection

arXiv cs.LG · 2026-05-12 Cached

The article introduces Echo-LoRA, a new parameter-efficient fine-tuning method that injects cross-layer representations from deeper source layers into shallow LoRA modules to improve performance without adding inference-time overhead.

0 favorites 0 likes
#parameter-efficient-fine-tuning

CERSA: Cumulative Energy-Retaining Subspace Adaptation for Memory-Efficient Fine-Tuning

arXiv cs.LG · 2026-05-12 Cached

The paper introduces CERSA, a novel parameter-efficient fine-tuning method that uses singular value decomposition to retain principal components, significantly reducing memory usage while outperforming existing methods like LoRA.

0 favorites 0 likes
#parameter-efficient-fine-tuning

Gradient-Based LoRA Rank Allocation Under GRPO: An Empirical Study

arXiv cs.CL · 2026-05-11 Cached

This study empirically demonstrates that gradient-based LoRA rank allocation, effective in supervised fine-tuning, degrades performance in GRPO-based reinforcement learning due to flatter gradient landscapes and a gradient amplification effect.

0 favorites 0 likes
#parameter-efficient-fine-tuning

Beyond Factor Aggregation: Gauge-Aware Low-Rank Server Representations for Federated LoRA

arXiv cs.LG · 2026-05-11 Cached

This paper introduces GLoRA, a gauge-aware server representation for Federated LoRA that addresses the semantic mismatch in factor aggregation by estimating a consensus update subspace. Experiments show GLoRA outperforms baselines in performance and efficiency across heterogeneous client scenarios.

0 favorites 0 likes
#parameter-efficient-fine-tuning

Decomposing the Basic Abilities of Large Language Models: Mitigating Cross-Task Interference in Multi-Task Instruct-Tuning

arXiv cs.CL · 2026-05-08 Cached

This paper proposes Badit, a method that decomposes large language model parameters into orthogonal high-singular-value LoRA experts to mitigate cross-task interference during multi-task instruction tuning.

0 favorites 0 likes
#parameter-efficient-fine-tuning

Queryable LoRA: Instruction-Regularized Routing Over Shared Low-Rank Update Atoms

Hugging Face Daily Papers · 2026-05-08 Cached

Introduces Queryable LoRA, a data-adaptive method for efficient fine-tuning that uses a shared memory of low-rank update atoms with attention-based routing and instruction regularization to enable dynamic, context-sensitive parameter updates while maintaining scalability.

0 favorites 0 likes
#parameter-efficient-fine-tuning

SAMoRA: Semantic-Aware Mixture of LoRA Experts for Task-Adaptive Learning

arXiv cs.CL · 2026-04-22 Cached

SAMoRA introduces a semantic-aware router and task-adaptive scaling to improve expert specialization and dynamic weighting in MoE-LoRA fine-tuning, outperforming prior methods on multi-task benchmarks.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback