test-time-computation

#test-time-computation

@DorothyDDU: LoopCoder-v2 is out Loop Transformers reuse the same block for recurrent hidden-state refinement — letting models “thin…

X AI KOLs Timeline ↗ · 2d ago Cached

This paper introduces LoopCoder-v2, a family of 7B parameter parallel loop transformers for code generation, and studies the optimal number of loops, finding that two loops yield significant gains while more loops cause degradation.

0 favorites 0 likes

#test-time-computation

Test-Time Personalization: A Diagnostic Framework and Probabilistic Fix for Scaling Failures

arXiv cs.LG ↗ · 2026-05-13 Cached

This paper introduces Test-Time Personalization (TTP), a framework that improves LLM personalization by scaling inference-time computation through candidate sampling and reward-based selection. It diagnoses failure modes in standard reward models and proposes a probabilistic personalized reward model to mitigate them.

0 favorites 0 likes

#test-time-computation

PaT: Planning-after-Trial for Efficient Test-Time Code Generation

arXiv cs.CL ↗ · 2026-05-11 Cached

This paper introduces PaT (Planning-after-Trial), an adaptive test-time computation strategy for code generation that reduces inference costs by approximately 69% while maintaining performance comparable to larger models.

0 favorites 0 likes

#test-time-computation

Reliable Chain-of-Thought via Prefix Consistency

Hugging Face Daily Papers ↗ · 2026-05-08 Cached

This paper introduces 'prefix consistency,' a method that weights candidate responses in Chain-of-Thought reasoning based on answer reproduction rates during trace regeneration. It achieves high accuracy with significantly fewer tokens than standard majority voting across various reasoning models and benchmarks.

0 favorites 0 likes

test-time-computation

@DorothyDDU: LoopCoder-v2 is out Loop Transformers reuse the same block for recurrent hidden-state refinement — letting models “thin…

Test-Time Personalization: A Diagnostic Framework and Probabilistic Fix for Scaling Failures

PaT: Planning-after-Trial for Efficient Test-Time Code Generation

Reliable Chain-of-Thought via Prefix Consistency

Submit Feedback