qlora

Tag

Cards List
#qlora

@0xSero: Highly recommended educational content. LoRA is one of the coolest things to dabble in, lets anyone fine tune models re…

X AI KOLs Timeline · 2d ago Cached

This article delves into the principles of LoRA and its variants (QLoRA, VeRA, DoRA), explaining how low-rank decomposition reduces trainable parameters to enable efficient fine-tuning of large models.

0 favorites 0 likes
#qlora

@SergioPaniego: https://x.com/SergioPaniego/status/2066498136273531363

X AI KOLs Timeline · 2026-06-15 Cached

This post demonstrates how to fine-tune a model for free using a single prompt, leveraging the new Google Colab CLI along with Hugging Face's TRL and trackio tools, all orchestrated by an AI agent.

0 favorites 0 likes
#qlora

Small LLMs for Biomedical Claim Verification: Cost-Effective Fine-Tuning, Structural Dataset Shortcuts, and Cross-Domain Generalization

arXiv cs.CL · 2026-06-12 Cached

Fine-tuning small LLMs (3B-7B) with QLoRA on biomedical claim verification achieves higher F1 than GPT-4o and GPT-5 at 44.5x lower cost, and reveals a structural artifact in SciFact. The study demonstrates robust cross-domain transfer when training on structurally sound data.

0 favorites 0 likes
#qlora

bytkim/Qwen3.6-27B-MTP-pi-tune-GGUF

Hugging Face Models Trending · 2026-06-02 Cached

bytkim releases a 4-bit QLoRA SFT Multi-Token Prediction fine-tune of Qwen3.6-27B, packaged as GGUF for local agentic coding. The no-thinking tune is designed for low-latency direct output in agent loops.

0 favorites 0 likes
#qlora

LinguIUTics at PsyDefDetect: Iterative Imbalance-Aware Fine-tuning of Qwen3-8B for Psychological Defense Mechanism Classification

arXiv cs.CL · 2026-06-02 Cached

This paper presents an iterative imbalance-aware fine-tuning approach using Qwen3-8B with QLoRA for psychological defense mechanism classification, achieving a macro F1 of 0.3917 and ranking 4th out of 21 teams in the PsyDefDetect 2026 shared task.

0 favorites 0 likes
#qlora

qwen 3.6 27B AR-> Diffusion - local training on 5090

Reddit r/LocalLLaMA · 2026-05-26

The author details attempts to locally train a Qwen 3.6 27B autoregressive-to-diffusion model on an Nvidia 5090 GPU using qlora and modifications from open-dllm and d3LLM, facing VRAM constraints and hardware issues while exploring one-shot diffusion techniques.

0 favorites 0 likes
#qlora

@DanKornas: Fine-tuning local LLMs shouldn’t require renting a cloud GPU. Silicon Studio is an open-source desktop app for local LL…

X AI KOLs Following · 2026-05-21 Cached

Silicon Studio is an open-source desktop app that enables local LLM fine-tuning and inference on Apple Silicon Macs using MLX, with features for data preparation, model management, and visual configuration.

0 favorites 0 likes
#qlora

HPC-LLM: Practical Domain Adaptation and Retrieval-Augmented Generation for HPC Support

arXiv cs.LG · 2026-05-19 Cached

This paper presents HPC-LLM, a retrieval-augmented and domain-adapted assistant for HPC workflows, fine-tuning Llama 3.1 8B with QLoRA on HPC documentation. It demonstrates performance comparable to larger general-purpose models with significantly lower resource requirements.

0 favorites 0 likes
#qlora

I trained TIME: short context-triggered thinking on Qwen model instead of overthinking

Reddit r/LocalLLaMA · 2026-05-18

A personal project led to an ACL 2026 paper introducing TIME, a method training Qwen3 models to engage in short, context-triggered thinking rather than excessive reasoning. The work uses QLoRA and a four-phase curriculum, with all data and code released open-source.

0 favorites 0 likes
#qlora

@_vmlops: FINE-TUNING A 12B MODEL ON A SINGLE GPU IS REAL NOW most people think you need a massive gpu cluster to fine-tune large…

X AI KOLs Timeline · 2026-05-17 Cached

Hugging Face's PEFT library enables parameter-efficient fine-tuning of large models on a single GPU, reducing compute and storage costs while maintaining performance.

0 favorites 0 likes
#qlora

Dropping learning rate fixed my Qlora fine-tune more than anything else i tried

Reddit r/LocalLLaMA · 2026-05-14

A user found that reducing the learning rate from 2e-4 to 1e-4 significantly improved QLoRA fine-tuning of Llama 3.1 8B on a small dataset (8k samples), preventing overfitting and leading to better evaluation results.

0 favorites 0 likes
#qlora

Development and Preliminary Evaluation of a Domain-Specific Large Language Model for Tuberculosis Care in South Africa

arXiv cs.CL · 2026-04-23 Cached

Researchers fine-tuned BioMistral-7B with QLoRA and GraphRAG to create a TB-care LLM for South Africa, showing improved contextual alignment over the base model.

0 favorites 0 likes
#qlora

KyleHessling1/Qwopus-GLM-18B-Merged-GGUF

Hugging Face Models Trending · 2026-04-17 Cached

An experimental 18B-parameter model created by stacking two Qwen-3.5-9B finetunes and healing the layer boundary with 1000-step QLoRA; the resulting GGUF beats Qwen 3.6-35B MoE on a 44-test suite while fitting in 9.2 GB VRAM.

0 favorites 0 likes
← Back to home

Submit Feedback