Tag
A developer at VideoDB highlights the problem of precise spatial output from vision models when used by agents, sharing that small grounding errors can lead to wrong actions, and announces an open-sourced evaluation harness for checking spatial accuracy on custom footage.