training-free

#training-free

SharQ: Bridging Activation Sparsity and FP4 Quantization for LLM Inference

arXiv cs.LG ↗ · 6h ago Cached

SharQ introduces a training-free method combining activation sparsity and FP4 quantization for LLM inference, using sparse-dense decomposition and a unified FP4 weight payload. It achieves significant latency reduction and accuracy recovery over FP4-only baselines.

0 favorites 0 likes

#training-free

Dynamic-dLLM: Dynamic Cache-Budget and Adaptive Parallel Decoding for Training-Free Acceleration of Diffusion LLM

arXiv cs.CL ↗ · 6h ago Cached

This paper proposes Dynamic-dLLM, a training-free framework that accelerates diffusion large language models by dynamically allocating cache-update budgets and calibrating decoding thresholds, achieving over 3x speedup on models like LLaDA and Dream while maintaining performance.

0 favorites 0 likes

#training-free

TRACER: Training-Free Closed-Loop Structured Inference for Traffic Accident Reconstruction

arXiv cs.LG ↗ · yesterday Cached

TRACER is a training-free framework for traffic accident reconstruction that formulates the problem as closed-loop structured inference, iteratively refining event-anchored motion hypotheses under geometric and kinematic constraints, achieving improved fidelity and consistency over data-driven and physics-based baselines.

0 favorites 0 likes

#training-free

Ground Then Rank: Revisiting Knowledge-Based VQA with Training-Free Entity Identification

arXiv cs.CL ↗ · 2d ago Cached

This paper proposes a training-free 'identify-before-answer' (IBA) framework for Knowledge-Based Visual Question Answering (KB-VQA) that decouples entity identification from evidence ranking, outperforming fine-tuned multi-modal retrieval-augmented generation baselines while reducing complexity.

0 favorites 0 likes

#training-free

Safe Few-Step Generation via Velocity Editing

Hugging Face Daily Papers ↗ · 4d ago Cached

VESFlow is a training-free safety method for flow matching-based text-to-image generation that edits velocity fields to ensure safe output while maintaining prompt integrity.

0 favorites 0 likes

#training-free

Most multi-hop RAG goes stale the moment your data changes, what about a training-free approach that skips the graph rebuild?

Reddit r/artificial ↗ · 4d ago

Presents a training-free method for multi-hop retrieval-augmented generation that avoids costly graph rebuilds when underlying data changes, tackling the staleness issue in dynamic environments.

0 favorites 0 likes

#training-free

Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding

Hugging Face Daily Papers ↗ · 6d ago Cached

This paper introduces Confident Decoding, a training-free decoding strategy that dynamically selects the most reliable intermediate layer in LLMs using entropy-guided search, mitigating the alignment tax and improving reasoning performance on benchmarks like GPQA-Diamond and Omni-MATH with negligible overhead.

0 favorites 0 likes

#training-free

Lost in a Single Vector: Improving Long-Document Retrieval with Chunk Evidence Aggregation

arXiv cs.CL ↗ · 2026-06-18 Cached

This paper identifies document-side early compression as a failure mode in long-document dense retrieval and introduces the Evidence Dilution Index (EDI) to measure it. The authors propose DICE, a training-free method that splits documents into chunks, encodes them independently, and aggregates them into a single vector, significantly improving retrieval on long documents.

0 favorites 0 likes

#training-free

JanusMesh: Fast and Zero-Shot 3D Visual Illusion Generation via Cross-Space Denoising

Hugging Face Daily Papers ↗ · 2026-06-18 Cached

JanusMesh is a fast, training-free framework that generates text-driven 3D visual illusions—a single mesh revealing different semantics from different viewing angles—by decoupling generation into cross-space dual-branch denoising and view-conditioned texture synthesis, achieving high realism in just 3-5 minutes.

0 favorites 0 likes

#training-free

Beyond Parallel Sampling: Diverse Query Initialization for Agentic Search

arXiv cs.AI ↗ · 2026-06-17 Cached

This paper identifies an anchor collapse phenomenon in agentic search where parallel trajectories converge due to similar initial queries, and proposes DivInit, a training-free method that samples diverse initial queries to improve multi-hop question answering performance.

0 favorites 0 likes

#training-free

Bag of Dims: Training-Free Mechanistic Interpretability via Dimension-Level Sign Patterns

Hugging Face Daily Papers ↗ · 2026-06-17 Cached

Proposes the Bag of Dims framework showing that the standard basis of transformer hidden states provides a training-free, architecture-general feature representation where dimensions encode semantic content via sign patterns; validated across language, vision, and audio models, achieving high accuracy with no learned rotations.

0 favorites 0 likes

#training-free

@sheriyuo: This paper proposes ASAG, Attention-State Adaptive Generation, a training-free, plug-and-play stopping framework for re…

X AI KOLs Timeline ↗ · 2026-06-16 Cached

ASAG uses attention entropy to detect when reasoning is unproductive, stopping early to improve accuracy and reduce token generation. Experiments on Qwen3-8B show a 4.4% accuracy gain and over 40% fewer generated tokens.

0 favorites 0 likes

#training-free

DiRecT: Safe Diffusion-Based Planning via Receding-Horizon Denoising

arXiv cs.LG ↗ · 2026-06-16 Cached

DiRecT introduces a training-free algorithm for safe diffusion-based planning that enforces constraints only on final clean trajectories using receding-horizon denoising, improving safety and performance over existing methods.

0 favorites 0 likes

#training-free

High-Dimensional Random Projection for Activation Steering in Language Models

arXiv cs.LG ↗ · 2026-06-16 Cached

HiDRA is a training-free method that uses high-dimensional random projection for activation steering in LLMs, capturing discriminative signals beyond linear methods and consistently outperforming existing baselines across diverse model families and benchmarks.

0 favorites 0 likes

#training-free

Numbers Already Carry Their Own Embeddings

arXiv cs.LG ↗ · 2026-06-15 Cached

Introduces Adelic operation-preserved embeddings (AOE), a training-free representation that encodes numbers by combining real value with p-adic expansions, preserving additive and multiplicative structure. Achieves perfect accuracy on the Weaving Pattern benchmark.

0 favorites 0 likes

#training-free

A Stationarity-and-Coupling Criterion for Training-Free Time-Lagged Spectral Embeddings of Multivariate Time Series

arXiv cs.LG ↗ · 2026-06-15 Cached

This paper proposes a falsifiable applicability criterion for a training-free, fixed-length descriptor for multivariate time series based on time-lagged spectral embeddings, showing when it can be expected to work and validating it on multiple benchmarks.

0 favorites 0 likes

#training-free

@HuggingPapers: SpatialClaw NVIDIA drops a training-free spatial reasoning agent that uses code as its action interface. A VLM writes P…

X AI KOLs Following ↗ · 2026-06-12 Cached

NVIDIA introduces SpatialClaw, a training-free spatial reasoning agent that uses a VLM to write Python code in a persistent kernel, compose perception tools, and revise plans, achieving +11.2 points over prior agents on 20 benchmarks.

0 favorites 0 likes

#training-free

SkillCAT: Contrastive Assessment and Topology-Aware Skill Self-Evolution for LLM Agents

arXiv cs.CL ↗ · 2026-06-12 Cached

SkillCAT is a training-free framework for LLM agent skill self-evolution that addresses limitations of single-trace bias, unverified merging, and full corpus loading via three stages: Contrastive Causal Extraction, Assessment-Augmented Evolution, and Topology-Aware Task Execution, achieving up to 40.40% improvement on benchmarks.

0 favorites 0 likes

#training-free

RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference

arXiv cs.LG ↗ · 2026-06-10 Cached

Introduces RKSC, a training-free inference framework for multi-branch LLM reasoning that reduces KV cache redundancy via similarity-based sharing and early exit, achieving up to 3x speedup with minimal error.

0 favorites 0 likes

#training-free

Sample Where You Struggle: Sharpening Base Model Reasoning via Entropy-Guided Power Sampling

arXiv cs.LG ↗ · 2026-06-10 Cached

This paper introduces Entropy-Guided Power Sampling (EGPS), a training-free and verifier-free sampler that improves the efficiency of power sampling for enhancing base language model reasoning. EGPS achieves up to 12.6x speedup over standard Metropolis-Hastings sampling while reaching best or tied-best accuracy on benchmarks like MATH500, HumanEval, and GPQA.

0 favorites 0 likes

training-free

Submit Feedback