frame-selection

#frame-selection

PEEK: Picking Essential frames via Efficient Knowledge distillation

Hugging Face Daily Papers ↗ · 2026-05-29 Cached

Introduces PEEK, an efficient dynamic frame sampling method that distills caption-conditioned frame relevance rankings from a teacher model into a lightweight temporal model, outperforming state-of-the-art methods in video captioning while maintaining computational efficiency.

0 favorites 0 likes

#frame-selection

Swift Sampling: Selecting Temporal Surprises via Taylor Series

Hugging Face Daily Papers ↗ · 2026-05-21 Cached

Swift Sampling is a training-free algorithm that uses Taylor expansion to identify high-information moments in long-form videos by detecting deviations from predicted feature trajectories, improving accuracy on video QA tasks with minimal computational overhead.

0 favorites 0 likes

#frame-selection

FrameSkip: Learning from Fewer but More Informative Frames in VLA Training

Hugging Face Daily Papers ↗ · 2026-05-13 Cached

FrameSkip is a data-layer frame selection method that improves Vision-Language-Action (VLA) policy training by prioritizing high-importance frames based on action variation and visual-coherence metrics, achieving a macro-average success rate of 76.15% across three benchmarks while using only 20% of unique frames.

0 favorites 0 likes

frame-selection

PEEK: Picking Essential frames via Efficient Knowledge distillation

Swift Sampling: Selecting Temporal Surprises via Taylor Series

FrameSkip: Learning from Fewer but More Informative Frames in VLA Training

Submit Feedback