semantic-reasoning

Tag

Cards List
#semantic-reasoning

Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models

arXiv cs.CL · 2026-04-21 Cached

Researchers present SemanticQA, a benchmark for evaluating language models on semantic phrase processing tasks including idioms, noun compounds, and verbal constructions, revealing significant performance variation across model architectures and scales on semantic reasoning tasks.

0 favorites 0 likes
← Back to home

Submit Feedback