video-summarization

Tag

Cards List
#video-summarization

A Systematic Evaluation of Positional Bias in Multi-Video Summarization with MLLMs

arXiv cs.CL · yesterday Cached

Researchers from Jilin University systematically evaluate positional bias in multi-video summarization using MLLMs, constructing a benchmark from ActivityNet and News videos and assessing nine models with metrics including Coverage, Directional Positional Bias, and Middle-Edge Gap. Results show positional effects are domain- and model-dependent, and increasing visual or generation budget does not uniformly resolve the imbalance.

0 favorites 0 likes
#video-summarization

Video2LoRA: Parametric Video Internalization for Vision-Language Models

Hugging Face Daily Papers · 2d ago Cached

This paper introduces Video2LoRA, a method that predicts Low-Rank Adaptation (LoRA) weights directly from video representations, enabling efficient video processing in frozen vision-language models. It reduces visual token load by up to 1500x and query TTFT by 6-80x while maintaining performance on video summarization and captioning benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback