neftune

#neftune

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

arXiv cs.AI ↗ · 2026-06-10 Cached

This paper investigates instruction finetuning of DeepSeek-R1-8B using LoRA and NEFTune for financial named-entity recognition, achieving a micro-F1 of 0.912 and outperforming several baseline models.

0 favorites 0 likes

#neftune

Understanding and Improving Noisy Embedding Techniques in Instruction Finetuning

arXiv cs.LG ↗ · 2026-05-25 Cached

This paper analyzes noisy embedding techniques for instruction fine-tuning, explains why uniform noise outperforms Gaussian, and introduces SymNoise, a symmetric noise method that significantly improves LLaMA-2-7B performance on AlpacaEval over NEFTune.

0 favorites 0 likes

neftune

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

Understanding and Improving Noisy Embedding Techniques in Instruction Finetuning

Submit Feedback