causal-reasoning

Tag

Cards List
#causal-reasoning

PropLLM: Propagation-Aware Scene Reconstruction for Network Fault Diagnosis

arXiv cs.AI · 2d ago Cached

PropLLM integrates hop-by-hop scene reconstruction with LLMs for network fault diagnosis. It uses a dual-layer knowledge graph and a temporal causal propagation attention mechanism to trace back along propagation paths, improving accuracy and reducing hallucinations.

0 favorites 0 likes
#causal-reasoning

Evaluating Bivariate Causal Statements Based on Mutual Compatibility

arXiv cs.AI · 2d ago Cached

This paper introduces compatibility and incompatibility scores for evaluating collections of bivariate causal statements without relying on faithfulness, and demonstrates their applicability by analyzing causal claims from large language models.

0 favorites 0 likes
#causal-reasoning

BEAMS: Benchmarking and Evaluating AI for Modeling and Simulation

arXiv cs.AI · 6d ago Cached

The BEAMS Initiative presents a benchmark suite for evaluating AI tools in modeling and simulation, focusing on human-centered and responsible AI practices. Tests reveal variability across LLM-based engines, with better performance in qualitative tasks than causal reasoning.

0 favorites 0 likes
#causal-reasoning

SVI-Bench: A Dynamic Microworld for Strategic Video Intelligence

Hugging Face Daily Papers · 2026-05-29 Cached

Introduces SVI-Bench, a large-scale benchmark for strategic video intelligence using team sports, designed to evaluate models on dynamic scene understanding, causal reasoning, strategic simulation, and agentic synthesis. The benchmark reveals a capability cliff where models perform well on perceptual tasks but sharply degrade on higher-level strategic reasoning.

0 favorites 0 likes
#causal-reasoning

Why We Need World Models for AGI: Where LLMs Fail and How World Models May Outperform

arXiv cs.AI · 2026-05-26 Cached

This paper argues that large language models struggle with causal reasoning and long-horizon planning due to a mismatch between sequence prediction and reasoning over latent environment dynamics, and introduces the Latent Dynamics Inference perspective along with the Flux environment to study these limitations.

0 favorites 0 likes
#causal-reasoning

Why scaling alone will not give us rational AI

Reddit r/ArtificialInteligence · 2026-05-18

This article argues that fundamental architectural limitations, not scaling deficits, prevent current LLMs from achieving true rationality—the ability to recognize and switch frames—citing empirical failures like the reversal curse and frame-transfer issues, and suggests that scaling alone may not bridge this gap.

0 favorites 0 likes
#causal-reasoning

Deterministic Event-Graph Substrates as World Models for Counterfactual Reasoning

arXiv cs.AI · 2026-05-18 Cached

该论文提出并评估了一类称为事件图基质的因果推理世界模型,通过确定性重放在类型化RDF事件日志上进行反事实查询,在多个基准上优于基线模型,同时保证了可检查性和可重放一致性。

0 favorites 0 likes
#causal-reasoning

@berryxia: Folks, Google's latest paper has completely flipped the fundamental logic of time series forecasting. All previous models were stuck on historical data: whatever the curve does, predict that. Nexus says: prediction needs not just history, but "event context". The real reasons behind the numbers—policies, sudden events, macro trends, local shocks...

X AI KOLs Timeline · 2026-05-18 Cached

Google's new paper Nexus proposes transforming time series forecasting from statistical extrapolation to multi-agent reasoning, improving prediction accuracy via event context, achieving an 86.6% reduction in MAPE on the Zillow dataset.

0 favorites 0 likes
#causal-reasoning

ReplaySCM: A Benchmark for Executable Causal Mechanism Induction from Interventions

arXiv cs.LG · 2026-05-12 Cached

This article introduces ReplaySCM, a benchmark designed to evaluate language models' ability to induce executable causal mechanisms from interventional evidence, focusing on semantic replay behavior rather than syntactic matches.

0 favorites 0 likes
#causal-reasoning

On Semantic Loss Fine-Tuning Approach for Preventing Model Collapse in Causal Reasoning

arXiv cs.LG · 2026-05-08 Cached

This paper identifies a critical 'model collapse' issue in standard fine-tuning for causal reasoning and proposes a semantic loss function with graph-based logical constraints to prevent it.

0 favorites 0 likes
← Back to home

Submit Feedback