model-adaptation

#model-adaptation

MiCA is now part of Hugging Face PEFT

Reddit r/LocalLLaMA ↗ · 6d ago

MiCA (Minor Component Adaptation), a new fine-tuning method that initializes adapters in the minor singular subspace for better knowledge uptake and less forgetting, has been merged into the Hugging Face PEFT library. It is available via the PEFT main branch and integrates through the existing LoRA interface with init_lora_weights='mica'.

0 favorites 0 likes

#model-adaptation

@techNmak: Everyone is fine-tuning LLMs. Almost nobody understands what is actually being updated inside the model. Here are 5 tec…

X AI KOLs Timeline ↗ · 2026-05-21

Explains five parameter-efficient fine-tuning techniques: LoRA, LoRA-FA, VeRA, Delta-LoRA, and LoRA+, detailing how each modifies model weights during adaptation.

0 favorites 0 likes

#model-adaptation

EMA: Efficient Model Adaptation for Learning-based Systems

arXiv cs.LG ↗ · 2026-05-15 Cached

This paper presents EMA, a model adaptation system for learning-based systems that reduces training and labeling costs while improving system performance in evolving environments.

0 favorites 0 likes

#model-adaptation

T5Gemma: A new collection of encoder-decoder Gemma models

Google DeepMind Blog ↗ · 2025-10-25 Cached

Google introduces T5Gemma, a new collection of encoder-decoder models adapted from the Gemma 2 decoder-only architecture, offering improved quality-efficiency trade-offs for tasks like summarization and translation.

0 favorites 0 likes

#model-adaptation

GPT-3.5 Turbo fine-tuning and API updates

OpenAI Blog ↗ · 2023-08-22 Cached

OpenAI has released fine-tuning capabilities for GPT-3.5 Turbo, allowing developers to customize models for specific use cases with improved performance, steerability, and output formatting. The update enables fine-tuned GPT-3.5 Turbo to match GPT-4 performance on certain tasks while reducing prompt sizes by up to 90%.

0 favorites 0 likes

model-adaptation

MiCA is now part of Hugging Face PEFT

@techNmak: Everyone is fine-tuning LLMs. Almost nobody understands what is actually being updated inside the model. Here are 5 tec…

EMA: Efficient Model Adaptation for Learning-based Systems

T5Gemma: A new collection of encoder-decoder Gemma models

GPT-3.5 Turbo fine-tuning and API updates

Submit Feedback