structural-causal-models

#structural-causal-models

ReplaySCM: A Benchmark for Executable Causal Mechanism Induction from Interventions

arXiv cs.LG ↗ · 3d ago Cached

This article introduces ReplaySCM, a benchmark designed to evaluate language models' ability to induce executable causal mechanisms from interventional evidence, focusing on semantic replay behavior rather than syntactic matches.

0 favorites 0 likes

structural-causal-models

ReplaySCM: A Benchmark for Executable Causal Mechanism Induction from Interventions

Submit Feedback