educational-videos

#educational-videos

Leveraging Vision-Language Models to Detect Attention in Educational Videos

arXiv cs.AI ↗ · 2026-05-22 Cached

This paper explores using a Vision-Language Model (VLM) to detect attention loss in educational videos by combining gaze data with video content, but finds that VLM approaches do not outperform traditional machine learning baselines.

0 favorites 0 likes

educational-videos

Leveraging Vision-Language Models to Detect Attention in Educational Videos

Submit Feedback