training-free

#training-free

Ground Then Rank: Revisiting Knowledge-Based VQA with Training-Free Entity Identification

arXiv cs.CL ↗ · 12h ago Cached

This paper proposes a training-free 'identify-before-answer' (IBA) framework for Knowledge-Based Visual Question Answering (KB-VQA) that decouples entity identification from evidence ranking, outperforming fine-tuned multi-modal retrieval-augmented generation baselines while reducing complexity.

0 favorites 0 likes

#training-free

Safe Few-Step Generation via Velocity Editing

Hugging Face Daily Papers ↗ · 2d ago Cached

VESFlow is a training-free safety method for flow matching-based text-to-image generation that edits velocity fields to ensure safe output while maintaining prompt integrity.

0 favorites 0 likes

#training-free

Most multi-hop RAG goes stale the moment your data changes, what about a training-free approach that skips the graph rebuild?

Reddit r/artificial ↗ · 2d ago

Presents a training-free method for multi-hop retrieval-augmented generation that avoids costly graph rebuilds when underlying data changes, tackling the staleness issue in dynamic environments.

0 favorites 0 likes

#training-free

Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding

Hugging Face Daily Papers ↗ · 4d ago Cached

This paper introduces Confident Decoding, a training-free decoding strategy that dynamically selects the most reliable intermediate layer in LLMs using entropy-guided search, mitigating the alignment tax and improving reasoning performance on benchmarks like GPQA-Diamond and Omni-MATH with negligible overhead.

0 favorites 0 likes

#training-free

Lost in a Single Vector: Improving Long-Document Retrieval with Chunk Evidence Aggregation

arXiv cs.CL ↗ · 6d ago Cached

This paper identifies document-side early compression as a failure mode in long-document dense retrieval and introduces the Evidence Dilution Index (EDI) to measure it. The authors propose DICE, a training-free method that splits documents into chunks, encodes them independently, and aggregates them into a single vector, significantly improving retrieval on long documents.

0 favorites 0 likes

#training-free

JanusMesh: Fast and Zero-Shot 3D Visual Illusion Generation via Cross-Space Denoising

Hugging Face Daily Papers ↗ · 6d ago Cached

JanusMesh is a fast, training-free framework that generates text-driven 3D visual illusions—a single mesh revealing different semantics from different viewing angles—by decoupling generation into cross-space dual-branch denoising and view-conditioned texture synthesis, achieving high realism in just 3-5 minutes.

0 favorites 0 likes

#training-free

Beyond Parallel Sampling: Diverse Query Initialization for Agentic Search

arXiv cs.AI ↗ · 2026-06-17 Cached

This paper identifies an anchor collapse phenomenon in agentic search where parallel trajectories converge due to similar initial queries, and proposes DivInit, a training-free method that samples diverse initial queries to improve multi-hop question answering performance.

0 favorites 0 likes

#training-free

Bag of Dims: Training-Free Mechanistic Interpretability via Dimension-Level Sign Patterns

Hugging Face Daily Papers ↗ · 2026-06-17 Cached

Proposes the Bag of Dims framework showing that the standard basis of transformer hidden states provides a training-free, architecture-general feature representation where dimensions encode semantic content via sign patterns; validated across language, vision, and audio models, achieving high accuracy with no learned rotations.

0 favorites 0 likes

#training-free

@sheriyuo: This paper proposes ASAG, Attention-State Adaptive Generation, a training-free, plug-and-play stopping framework for re…

X AI KOLs Timeline ↗ · 2026-06-16 Cached

ASAG uses attention entropy to detect when reasoning is unproductive, stopping early to improve accuracy and reduce token generation. Experiments on Qwen3-8B show a 4.4% accuracy gain and over 40% fewer generated tokens.

0 favorites 0 likes

#training-free

DiRecT: Safe Diffusion-Based Planning via Receding-Horizon Denoising

arXiv cs.LG ↗ · 2026-06-16 Cached

DiRecT introduces a training-free algorithm for safe diffusion-based planning that enforces constraints only on final clean trajectories using receding-horizon denoising, improving safety and performance over existing methods.

0 favorites 0 likes

#training-free

High-Dimensional Random Projection for Activation Steering in Language Models

arXiv cs.LG ↗ · 2026-06-16 Cached

HiDRA is a training-free method that uses high-dimensional random projection for activation steering in LLMs, capturing discriminative signals beyond linear methods and consistently outperforming existing baselines across diverse model families and benchmarks.

0 favorites 0 likes

#training-free

Numbers Already Carry Their Own Embeddings

arXiv cs.LG ↗ · 2026-06-15 Cached

Introduces Adelic operation-preserved embeddings (AOE), a training-free representation that encodes numbers by combining real value with p-adic expansions, preserving additive and multiplicative structure. Achieves perfect accuracy on the Weaving Pattern benchmark.

0 favorites 0 likes

#training-free

A Stationarity-and-Coupling Criterion for Training-Free Time-Lagged Spectral Embeddings of Multivariate Time Series

arXiv cs.LG ↗ · 2026-06-15 Cached

This paper proposes a falsifiable applicability criterion for a training-free, fixed-length descriptor for multivariate time series based on time-lagged spectral embeddings, showing when it can be expected to work and validating it on multiple benchmarks.

0 favorites 0 likes

#training-free

@HuggingPapers: SpatialClaw NVIDIA drops a training-free spatial reasoning agent that uses code as its action interface. A VLM writes P…

X AI KOLs Following ↗ · 2026-06-12 Cached

NVIDIA introduces SpatialClaw, a training-free spatial reasoning agent that uses a VLM to write Python code in a persistent kernel, compose perception tools, and revise plans, achieving +11.2 points over prior agents on 20 benchmarks.

0 favorites 0 likes

#training-free

SkillCAT: Contrastive Assessment and Topology-Aware Skill Self-Evolution for LLM Agents

arXiv cs.CL ↗ · 2026-06-12 Cached

SkillCAT is a training-free framework for LLM agent skill self-evolution that addresses limitations of single-trace bias, unverified merging, and full corpus loading via three stages: Contrastive Causal Extraction, Assessment-Augmented Evolution, and Topology-Aware Task Execution, achieving up to 40.40% improvement on benchmarks.

0 favorites 0 likes

#training-free

RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference

arXiv cs.LG ↗ · 2026-06-10 Cached

Introduces RKSC, a training-free inference framework for multi-branch LLM reasoning that reduces KV cache redundancy via similarity-based sharing and early exit, achieving up to 3x speedup with minimal error.

0 favorites 0 likes

#training-free

Sample Where You Struggle: Sharpening Base Model Reasoning via Entropy-Guided Power Sampling

arXiv cs.LG ↗ · 2026-06-10 Cached

This paper introduces Entropy-Guided Power Sampling (EGPS), a training-free and verifier-free sampler that improves the efficiency of power sampling for enhancing base language model reasoning. EGPS achieves up to 12.6x speedup over standard Metropolis-Hastings sampling while reaching best or tied-best accuracy on benchmarks like MATH500, HumanEval, and GPQA.

0 favorites 0 likes

#training-free

Mitigating Manifold Departure: Uncertainty-Aware Subspace Rectification for Trustworthy MLLM Decoding

arXiv cs.LG ↗ · 2026-06-10 Cached

This paper introduces MGAP, a training-free decoding method that reduces hallucinations in Multimodal Large Language Models by adaptively suppressing only the harmful parts of language priors while preserving the model's semantic manifold. The method outperforms prior baselines on POPE and CHAIR benchmarks.

0 favorites 0 likes

#training-free

Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning

arXiv cs.CL ↗ · 2026-06-10 Cached

Dep-LLM is a training-free framework that uses frozen large language models to diagnose depression from clinical interviews by decomposing dialogue into five clinically aligned themes with evidence-grounded reasoning and confidence modulation, outperforming zero-shot and some supervised methods on DAIC-WOZ and E-DAIC datasets.

0 favorites 0 likes

#training-free

Prefilling-dLLM: Predictive Prefilling for Long-Context Inference in Diffusion Language Models

arXiv cs.CL ↗ · 2026-06-10 Cached

This paper proposes Prefilling-dLLM, a training-free framework that partitions the prefix into chunks and caches KV representations, achieving state-of-the-art quality and up to 28x speedup for long-context inference in diffusion language models.

0 favorites 0 likes

training-free

Submit Feedback