test-time-adaptation

Tag

Cards List
#test-time-adaptation

Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning

Hugging Face Daily Papers · 4d ago Cached

QGF is an RL algorithm that improves policies at test time by using a value gradient to guide a pre-trained flow policy, avoiding training-time instability while maintaining competitive performance.

0 favorites 0 likes
#test-time-adaptation

From Demonstrations to Rewards: Test-Time Prompt Optimization for VLM Reward Models

arXiv cs.LG · 2026-06-02 Cached

Proposes Demo2Reward, a test-time prompt optimization technique for VLM reward models using a few expert demonstrations, significantly reducing false positives and improving policy learning in robotics without additional model training.

0 favorites 0 likes
#test-time-adaptation

MMD-Balls as Credal Sets: A PAC-Bayesian Framework for Epistemic Uncertainty in Test-Time Adaptation

arXiv cs.LG · 2026-05-22 Cached

This paper develops a PAC-Bayesian framework for test-time adaptation that uses MMD-balls as credal sets, providing formal generalization bounds and separating epistemic from aleatoric uncertainty under distribution shift.

0 favorites 0 likes
#test-time-adaptation

Hierarchical Variational Policies for Reward-Guided Diffusion

arXiv cs.LG · 2026-05-22 Cached

Proposes a hierarchical variational policy framework for reward-guided diffusion, enabling high-quality sampling with reduced inference cost. Achieves strong quality-speed tradeoff on tasks like super-resolution.

0 favorites 0 likes
#test-time-adaptation

SOLAR: A Self-Optimizing Open-Ended Autonomous Agent for Lifelong Learning and Continual Adaptation

arXiv cs.AI · 2026-05-22 Cached

SOLAR proposes a self-optimizing autonomous agent that leverages parameter-level meta-learning and multi-level reinforcement learning to enable lifelong adaptation of LLMs to non-stationary data streams, outperforming baselines on reasoning tasks.

0 favorites 0 likes
#test-time-adaptation

Federated Nested Learning: Collaborative Training of Self-Referential Memories for Test-Time Adaptation

arXiv cs.LG · 2026-05-19 Cached

Proposes Federated Nested Learning (FedNL), a framework that reformulates federated learning as a three-level nested optimization system, enabling collaborative training of self-referential memories for test-time adaptation to handle Non-IID data and long-tail distributions.

0 favorites 0 likes
#test-time-adaptation

Reliability-Gated Source Anchoring for Continual Test-Time Adaptation

arXiv cs.LG · 2026-05-15 Cached

This paper proposes RMemSafe, a reliability-gated extension for continual test-time adaptation that attenuates source anchoring when the frozen source's predictive entropy becomes high, preventing blind anchoring under source collapse. The method achieves state-of-the-art error reduction on the CCC benchmark.

0 favorites 0 likes
#test-time-adaptation

TacoMAS: Test-Time Co-Evolution of Topology and Capability in LLM-based Multi-Agent Systems

Hugging Face Daily Papers · 2026-05-10 Cached

This paper introduces TacoMAS, a framework for test-time co-evolution of agent capabilities and communication topology in LLM-based multi-agent systems. It demonstrates that jointly adapting fast capability loops and slow topology loops improves performance and stability over existing baselines.

0 favorites 0 likes
#test-time-adaptation

FAAST: Forward-Only Associative Learning via Closed-Form Fast Weights for Test-Time Supervised Adaptation

Hugging Face Daily Papers · 2026-05-08 Cached

FAAST proposes a forward-only method that compiles labeled examples into fast weights analytically, enabling efficient test-time supervised adaptation without backpropagation, achieving over 90% speedup and 95% memory savings while maintaining performance.

0 favorites 0 likes
#test-time-adaptation

Preconditioned Test-Time Adaptation for Out-of-Distribution Debiasing in Narrative Generation

arXiv cs.CL · 2026-04-20 Cached

This paper proposes CAP-TTA, a test-time adaptation framework that uses preconditioned LoRA updates triggered by bias-risk scores to mitigate toxicity and bias in large language models during narrative generation, achieving faster optimization and better fluency than standard baselines.

0 favorites 0 likes
#test-time-adaptation

TTL: Test-time Textual Learning for OOD Detection with Pretrained Vision-Language Models

arXiv cs.CL · 2026-04-20 Cached

TTL introduces a test-time textual learning framework for OOD detection using pretrained vision-language models like CLIP, which dynamically learns OOD semantics from unlabeled test streams without external OOD labels. The method uses pseudo-labeled samples and an OOD knowledge purification strategy to improve detection robustness across diverse and evolving OOD distributions.

0 favorites 0 likes
← Back to home

Submit Feedback