Tag
Perceptron Inc. released its flagship video analysis model Mk1, claiming 80-90% lower cost than competitors while achieving strong performance on spatial and video reasoning benchmarks.
This paper introduces three parameter-efficient methods for multi-view proficiency estimation on the Ego-Exo4D dataset, shifting from discriminative classification to generative feedback. The proposed models achieve state-of-the-art accuracy with significantly fewer parameters and training epochs than video-transformer baselines.