capability-estimation

Tag

Cards List
#capability-estimation

Capturing LLM Capabilities via Evidence-Calibrated Query Clustering

arXiv cs.AI · 2026-05-19 Cached

This paper introduces ECC, an algorithm that calibrates semantic embeddings with limited model comparisons to cluster queries by latent capability requirements, improving LLM capability ranking quality by over 17 percentage points over baselines.

0 favorites 0 likes
← Back to home

Submit Feedback