physical-properties

Tag

Cards List
#physical-properties

AFFORDANCE20Q: Evaluating Affordance Reasoning from Physical Properties

arXiv cs.AI · 2026-06-15 Cached

Affordance20Q is a benchmark that evaluates LLMs' ability to reason about object affordances from physical properties without revealing object identity, using a 20-Questions format. Experiments show a ~20 point gap between LLMs and humans, and a proposed pipeline KARI improves open-source LLMs by up to 15.2 points.

0 favorites 0 likes
← Back to home

Submit Feedback