ReasoningLens: Hierarchical Visualization and Diagnostic Auditing for Large Reasoning Models
Summary
ReasoningLens is an open-source framework that provides hierarchical visualization and diagnostic auditing for complex reasoning chains in large reasoning models, enabling structured analysis and error detection.
View Cached Full Text
Cached at: 06/30/26, 03:33 AM
Paper page - ReasoningLens: Hierarchical Visualization and Diagnostic Auditing for Large Reasoning Models
Source: https://huggingface.co/papers/2606.23404
Abstract
ReasoningLens is an open-source framework that provides hierarchical visualization and diagnostic auditing for complex reasoning chains in large reasoning models, enabling structured analysis and error detection through interactive hierarchies and automated auditing.
The emergence of Large Reasoning Models has introduced exceptionally longChain-of-Thought traces, creating a transparency burden where critical logic is often buried under massive procedural text. To address this, we present ReasoningLens, an open-source framework designed for thehierarchical visualizationanddiagnostic auditingof complex reasoning chains. ReasoningLens addressesinformation necropsyby: (1) structuring traces into interactive hierarchies that separate high-level strategy from low-level execution; (2) leveraging anagentic auditorfor automated error detection andtool-augmented verification; and (3) synthesizingsystemic reasoning profilesto reveal model-specific blind spots. By transforming unstructured walls of text into actionable insights, ReasoningLens provides a modular foundation for interpreting, debugging, and optimizing the next generation of reasoning-centric AI.
View arXiv pageView PDFGitHub24Add to collection
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2606.23404 in a model README.md to link it from this page.
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2606.23404 in a dataset README.md to link it from this page.
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2606.23404 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
Enhanced and Efficient Reasoning in Large Learning Models
This paper proposes a method for improving reasoning in large language models by recoding data to explicitly represent relationships, enabling efficient principled reasoning with polynomial-time learnability for relational rules, which addresses hallucinations and supports sound reasoning across multiple calls.
Decoding the Critique Mechanism in Large Reasoning Models
This paper investigates how large reasoning models can detect and correct their own errors internally, identifying a highly interpretable critique vector that enhances error detection without additional training, improving test-time scaling performance.
ReasoningFlow: Discourse Structures for Understanding LLM Reasoning Traces
Introduces ReasoningFlow, a framework to capture discourse structures of large language model reasoning traces as directed acyclic graphs, enabling fine-grained analysis of reasoning behaviors like self-reflection and backtracking. Based on manual and automatic annotation of thousands of traces, it reveals structural similarities across models and that most erroneous steps do not contribute to final answers.
Where Instruction Hierarchy Breaks: Diagnosing and Repairing Failures in Reasoning Language Models
This paper introduces a white-box diagnostic framework that localizes instruction hierarchy failures in reasoning language models into identification, conflict resolution, and response realization stages. It evaluates several models and proposes two training-free self-monitoring mechanisms that reduce non-compliance by 81–99%.
Reasoning emerges from constrained inference manifolds in large language models
This paper investigates reasoning in LLMs as an intrinsic dynamical process, finding that inference-time representations self-organize into low-dimensional manifolds. It proposes a label-free diagnostic based on internal dynamics to assess reasoning quality, suggesting that effective reasoning is governed by geometric and informational constraints.