scientific-assistant

Tag

Cards List
#scientific-assistant

SCICONVBENCH: Benchmarking LLMs on Multi-Turn Clarification for Task Formulation in Computational Science

Hugging Face Daily Papers · 2026-05-18 Cached

SCICONVBENCH is a benchmark that evaluates LLMs on multi-turn clarification for ill-posed scientific queries across computational science domains, finding that even frontier models struggle with disambiguation and frequently make silent assumptions.

0 favorites 0 likes
← Back to home

Submit Feedback