test-time-training

#test-time-training

Linearizing Vision Transformer with Test-Time Training

Hugging Face Daily Papers ↗ · 2026-05-28 Cached

This paper proposes a method to convert pretrained Softmax attention models into linear-complexity Test-Time Training (TTT) architectures, achieving comparable text-to-image quality to fine-tuned Softmax models while significantly accelerating inference. The approach is validated by linearizing Stable Diffusion 3.5, resulting in SD3.5-T^5 with 1.32x speedup at 1K resolution.

0 favorites 0 likes

#test-time-training

Test-Time Training Undermines Safety Guardrails

arXiv cs.LG ↗ · 2026-05-25 Cached

This paper identifies three threat models for test-time training (TTT) that adversaries can exploit to bypass safety filters in LLMs, achieving high attack success rates. The findings reveal that TTT introduces new vulnerabilities that undermine existing safety guardrails.

0 favorites 0 likes

#test-time-training

TEMPO: Scaling Test-time Training for Large Reasoning Models

Hugging Face Daily Papers ↗ · 2026-04-21 Cached

TEMPO introduces a test-time training framework that alternates policy refinement with critic recalibration to prevent diversity collapse and sustain performance gains in large reasoning models, boosting AIME 2024 scores for Qwen3-14B from 42.3% to 65.8%.

0 favorites 0 likes

test-time-training

Linearizing Vision Transformer with Test-Time Training

Test-Time Training Undermines Safety Guardrails

TEMPO: Scaling Test-time Training for Large Reasoning Models

Submit Feedback