fine-tuning

Tag

Cards List
#fine-tuning

PACT: Preserving Anchored Cores in Task-vectors for Model Merging

arXiv cs.LG · 5d ago Cached

The paper identifies 'Load-Bearing Wall' dimensions in pre-trained models that retain task-specific knowledge not fully captured by task vectors in model merging, and proposes PACT (PreserveAnchoredCores) to preserve these cores, achieving state-of-the-art performance across benchmarks.

0 favorites 0 likes
#fine-tuning

ProfiLLM: Utility-Aligned Agentic User Profiling for Industrial Ride-Hailing Dispatch

arXiv cs.AI · 5d ago Cached

ProfiLLM introduces an agentic LLM pipeline that generates utility-aligned user profiles from platform-scale behavioral logs for industrial ride-hailing dispatch, achieving significant improvements in outcome prediction and GMV in production at DiDi.

0 favorites 0 likes
#fine-tuning

Toward Parking Spot Occupancy Recognition: A Self-Supervised Approach

Hugging Face Daily Papers · 6d ago Cached

This paper presents a self-supervised transfer learning approach for parking spot occupancy recognition that achieves high accuracy (up to 97.8%) with minimal labeled data using a two-stage training strategy with SimCLR and ResNet-50.

0 favorites 0 likes
#fine-tuning

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Hugging Face Blog · 6d ago Cached

Explores whether LoRA is the best parameter-efficient fine-tuning technique and introduces the PEFT library's tools to compare methods.

0 favorites 0 likes
#fine-tuning

@LangChain: Fine-tuning open models can exceed or match frontier models. Base @Alibaba_Qwen out of the box w/ good prompting: Stron…

X AI KOLs Following · 6d ago Cached

Fine-tuning open models like Alibaba's Qwen with LoRA can match or exceed frontier model performance on error classification tasks.

0 favorites 0 likes
#fine-tuning

ostris/ideogram_4_turbotime_lora

Hugging Face Models Trending · 6d ago Cached

A LoRA that adapts Ideogram 4 to generate high-quality images in as few as 2 steps without CFG, using a novel continuous turbo training method.

0 favorites 0 likes
#fine-tuning

@cjzafir: A 3B parameter SLM: VibeThinker (fine-tuned on Qwen 2.5) matches Claude Opus 4.5 performance. Same performance as: > De…

X AI KOLs Timeline · 6d ago Cached

VibeThinker, a 3B parameter model fine-tuned on Qwen 2.5, achieves performance comparable to Claude Opus 4.5 and much larger models like DeepSeek v3 through innovative post-training that includes multi-path thinking and staged training on math, coding, and science.

0 favorites 0 likes
#fine-tuning

@h100envy: Daniel Han wrote Unsloth, the reason half of open-source can fine-tune a model on one GPU instead of a cluster. He didn…

X AI KOLs Timeline · 6d ago Cached

Daniel Han built Unsloth, a tool that rewrites GPU kernels to make fine-tuning 2-3 times faster on a single GPU, enabling many open-source users to train models without a cluster.

0 favorites 0 likes
#fine-tuning

@witcheer: this is the first Qwen3.6-27B coding tune I've measured that improves real bug-fixing (!!!). - quality (MMLU/ARC/HellaS…

X AI KOLs Timeline · 6d ago Cached

A community fine-tune of Qwen3.6-27B improves real bug-fixing on SWE-bench while maintaining quality, unlike synthetic distillations that regress.

0 favorites 0 likes
#fine-tuning

@ai_explorer25: Confused by all the AI hype? These people make it easy to understand and they teach you how to actually build your own …

X AI KOLs Timeline · 6d ago Cached

A curated list of X/Twitter accounts that explain AI concepts and teach how to build tools, agents, and frameworks, covering retrieval, testing, fine-tuning, and more.

0 favorites 0 likes
#fine-tuning

@MiaAI_lab: MTP is up, test it out https://huggingface.co/Mia-AiLab/Qwable-3.6-27b-MTP…

X AI KOLs Timeline · 6d ago Cached

Mia-AiLab releases Qwable-3.6-27b-MTP, a full fine-tuned checkpoint of Qwen3.6-27B using a cleaned Fable 5 reasoning and instruction dataset, focused on code, structured reasoning, and local inference with MTP layers.

0 favorites 0 likes
#fine-tuning

Fine-tuning LLMs for Passive Depression Severity Estimation from AI Mental Health Dialogue

arXiv cs.CL · 6d ago Cached

This paper presents a method for fine-tuning LLMs to predict PHQ-9 depression severity scores directly from transcripts of conversations with an AI mental health application, achieving strong correlation with clinical thresholds using a augmented dataset of 6,283 users.

0 favorites 0 likes
#fine-tuning

Improving low-resource ASR using bilingual fine-tuning with language identification: a cross-linguistic evaluation

arXiv cs.CL · 6d ago Cached

This study evaluates bilingual fine-tuning with language identification tokens for improving ASR in low-resource languages across nine diverse language pairs, finding that high LID accuracy is beneficial and that providing the LID token at inference can boost performance when LID accuracy is low.

0 favorites 0 likes
#fine-tuning

SuCo: Sufficiency-guided Continuous Adaptive Reasoning

arXiv cs.CL · 6d ago Cached

Introduces SuCo, a two-stage training framework for Large Reasoning Models that uses the concept of Minimal Sufficient CoT to reduce reasoning tokens while improving accuracy across math, code, and science benchmarks.

0 favorites 0 likes
#fine-tuning

Are you speaking my languages? On spoken language adherence in multimodal LLMs

arXiv cs.CL · 6d ago Cached

This paper addresses the problem of spoken language adherence in multimodal LLMs for ASR, proposing a soft prompting approach and novel metric to quantify language violations. It evaluates three mitigation strategies—zero-shot prompting, supervised fine-tuning, and chain-of-thought reasoning—across multiple languages to improve transcription fidelity.

0 favorites 0 likes
#fine-tuning

When the Next Step Is Not One Step: Distribution-Aware Execution Modeling for Concurrent Go Programs

arXiv cs.LG · 6d ago Cached

This paper proposes a distribution-aware training approach for modeling next-event predictions in concurrent Go programs, treating scheduler nondeterminism as a signal. Fine-tuning a 7B model on fewer than a thousand traces achieves 36.2% accuracy on production bugs, outperforming Gemini 3.5 Flash zero-shot.

0 favorites 0 likes
#fine-tuning

RepSelect: Robust LLM Unlearning via Representation Selectivity

arXiv cs.CL · 6d ago Cached

RepSelect introduces a method for robust LLM unlearning that isolates forget-set-specific representations by collapsing top principal components of weight gradients, achieving 4-50× better robustness against relearning attacks compared to existing baselines across multiple model families.

0 favorites 0 likes
#fine-tuning

@barrowjoseph: I think the MS FastContext paper is a good glimpse of the types of agentic systems companies are going to be building i…

X AI KOLs Timeline · 2026-06-17 Cached

A Microsoft and SJTU research paper introduces FastContext, a dedicated exploration subagent for coding agents that separates repository navigation from task solving, reducing orchestrator token usage by up to 60% and improving resolution rates by 5.5% on SWE-bench benchmarks.

0 favorites 0 likes
#fine-tuning

Native Active Perception as Reasoning for Omni-Modal Understanding

Hugging Face Daily Papers · 2026-06-17 Cached

Introduces OmniAgent, an omni-modal agent that uses an iterative Observation-Thought-Action cycle with active perception to achieve superior long video understanding, outperforming larger models like Qwen2.5-VL-72B on benchmarks.

0 favorites 0 likes
#fine-tuning

@pcuenq: GLM 5.2 has just been released Here it's already running with MLX on two Mac Studios (M3 Ultra). This is comparable to …

X AI KOLs Timeline · 2026-06-16 Cached

GLM 5.2, an open-weight AI model comparable to top closed models, has been released and is now running on MLX on two Mac Studios (M3 Ultra).

0 favorites 0 likes
← Previous
Next →
← Back to home

Submit Feedback