inference-time

#inference-time

Beyond Steering Vector: Flow-based Activation Steering for Inference-Time Intervention

arXiv cs.CL ↗ · 2d ago Cached

This paper introduces FLAS, a flow-based activation steering method that learns a concept-conditioned velocity field to steer language model activations at inference time. On the AxBench benchmark, FLAS is the first learned method to consistently outperform in-context prompting on held-out concepts without per-concept tuning.

0 favorites 0 likes

#inference-time

Mitigating Multimodal Hallucination via Phase-wise Self-reward

Hugging Face Daily Papers ↗ · 2026-04-20 Cached

PSRD framework halves multimodal hallucination in LVLMs by using phase-wise self-reward decoding and a distilled lightweight reward model without extra supervision.

0 favorites 0 likes

inference-time

Beyond Steering Vector: Flow-based Activation Steering for Inference-Time Intervention

Mitigating Multimodal Hallucination via Phase-wise Self-reward

Submit Feedback