logical-reasoning

Tag

Cards List
#logical-reasoning

When Rule Violations Are Rare: Chimera Training for Logical Anomaly Detection

arXiv cs.LG · 2026-05-27 Cached

The paper introduces Chimera Training, a method for logical anomaly detection that uses counterfactual construction at the feature level to train neural rule evaluators without requiring real anomalous images, improving rule-level anomaly detection performance on benchmarks like CLEVRER, OpenImages, and VidOR.

0 favorites 0 likes
#logical-reasoning

ChaosBench-Logic v2: Evaluating LLM Logical Reasoning over Dynamical Systems at Scale

arXiv cs.LG · 2026-05-26 Cached

ChaosBench-Logic v2 is a large-scale benchmark of 40,886 questions over 165 dynamical systems that evaluates LLMs' logical reasoning abilities, revealing near-random performance on regime transition reasoning and systematic failure modes even in frontier models.

0 favorites 0 likes
#logical-reasoning

High Quality Embeddings for Horn Logic Reasoning

arXiv cs.AI · 2026-05-22 Cached

This paper introduces novel methods for generating high-quality embeddings for Horn logic reasoning using triplet loss, including techniques for balanced training example generation and hard example emphasis, which improve the efficiency of downstream logical reasoning.

0 favorites 0 likes
#logical-reasoning

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

arXiv cs.CL · 2026-05-20 Cached

LLMEval-Logic is a new Chinese benchmark for evaluating logical reasoning in LLMs, featuring solver-verified answers and adversarial hardening. The benchmark reveals significant gaps in current models, with the best reaching only 37.5% accuracy on hard items.

0 favorites 0 likes
#logical-reasoning

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Hugging Face Daily Papers · 2026-05-07 Cached

This paper introduces ScaleLogic, a framework demonstrating that RL training compute scales as a power law with reasoning depth in LLMs. It highlights that logical expressiveness is key to improving downstream transfer and training efficiency.

0 favorites 0 likes
← Back to home

Submit Feedback