video-captioning

Tag

Cards List
#video-captioning

PEEK: Picking Essential frames via Efficient Knowledge distillation

Hugging Face Daily Papers · 2026-05-29 Cached

Introduces PEEK, an efficient dynamic frame sampling method that distills caption-conditioned frame relevance rankings from a teacher model into a lightweight temporal model, outperforming state-of-the-art methods in video captioning while maintaining computational efficiency.

0 favorites 0 likes
#video-captioning

NemoStation/Marlin-2B

Hugging Face Models Trending · 2026-05-13

NemoStation/Marlin-2B is a fine-tuned model based on Qwen3.5-2B for video-text-to-text tasks, supporting video captioning and temporal grounding.

0 favorites 0 likes
← Back to home

Submit Feedback