inference-time-alignment

Tag

Cards List
#inference-time-alignment

To Intervene or Not: Guiding Inference-time Alignment with Probabilistic Model Blending

arXiv cs.LG · 4d ago Cached

This paper introduces BlendIn, an inference-time alignment framework that uses probabilistic model blending to assess guidance reliability and proportionally weight model contributions, achieving up to 50% performance improvement by avoiding harmful interventions.

0 favorites 0 likes
#inference-time-alignment

Conflict-Aware Additive Guidance for Flow Models under Compositional Rewards

arXiv cs.AI · 2026-05-22 Cached

The paper identifies off-manifold drift in guided flow models under compositional rewards and proposes Conflict-Aware Additive Guidance (CAR), a lightweight method that dynamically resolves gradient conflicts to improve generation fidelity without retraining.

0 favorites 0 likes
#inference-time-alignment

Harnesses for Inference-Time Alignment over Execution Trajectories

arXiv cs.LG · 2026-05-22 Cached

This paper studies harness design for LLM agents, separating it into task decomposition and guided execution, and shows that more elaborate harnesses are not uniformly better; it reveals failure modes and proposes partial harnesses as effective.

0 favorites 0 likes
← Back to home

Submit Feedback