ReasoningLens: Hierarchical Visualization and Diagnostic Auditing for Large Reasoning Models

Hugging Face Daily Papers Papers

Summary

ReasoningLens is an open-source framework that provides hierarchical visualization and diagnostic auditing for complex reasoning chains in large reasoning models, enabling structured analysis and error detection.

The emergence of Large Reasoning Models has introduced exceptionally long Chain-of-Thought traces, creating a transparency burden where critical logic is often buried under massive procedural text. To address this, we present ReasoningLens, an open-source framework designed for the hierarchical visualization and diagnostic auditing of complex reasoning chains. ReasoningLens addresses information necropsy by: (1) structuring traces into interactive hierarchies that separate high-level strategy from low-level execution; (2) leveraging an agentic auditor for automated error detection and tool-augmented verification; and (3) synthesizing systemic reasoning profiles to reveal model-specific blind spots. By transforming unstructured walls of text into actionable insights, ReasoningLens provides a modular foundation for interpreting, debugging, and optimizing the next generation of reasoning-centric AI.
Original Article
View Cached Full Text

Cached at: 06/30/26, 03:33 AM

Paper page - ReasoningLens: Hierarchical Visualization and Diagnostic Auditing for Large Reasoning Models

Source: https://huggingface.co/papers/2606.23404

Abstract

ReasoningLens is an open-source framework that provides hierarchical visualization and diagnostic auditing for complex reasoning chains in large reasoning models, enabling structured analysis and error detection through interactive hierarchies and automated auditing.

The emergence of Large Reasoning Models has introduced exceptionally longChain-of-Thought traces, creating a transparency burden where critical logic is often buried under massive procedural text. To address this, we present ReasoningLens, an open-source framework designed for thehierarchical visualizationanddiagnostic auditingof complex reasoning chains. ReasoningLens addressesinformation necropsyby: (1) structuring traces into interactive hierarchies that separate high-level strategy from low-level execution; (2) leveraging anagentic auditorfor automated error detection andtool-augmented verification; and (3) synthesizingsystemic reasoning profilesto reveal model-specific blind spots. By transforming unstructured walls of text into actionable insights, ReasoningLens provides a modular foundation for interpreting, debugging, and optimizing the next generation of reasoning-centric AI.

View arXiv pageView PDFGitHub24Add to collection

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2606.23404 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2606.23404 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2606.23404 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

Similar Articles

Enhanced and Efficient Reasoning in Large Learning Models

arXiv cs.AI

This paper proposes a method for improving reasoning in large language models by recoding data to explicitly represent relationships, enabling efficient principled reasoning with polynomial-time learnability for relational rules, which addresses hallucinations and supports sound reasoning across multiple calls.

Decoding the Critique Mechanism in Large Reasoning Models

Hugging Face Daily Papers

This paper investigates how large reasoning models can detect and correct their own errors internally, identifying a highly interpretable critique vector that enhances error detection without additional training, improving test-time scaling performance.

ReasoningFlow: Discourse Structures for Understanding LLM Reasoning Traces

arXiv cs.CL

Introduces ReasoningFlow, a framework to capture discourse structures of large language model reasoning traces as directed acyclic graphs, enabling fine-grained analysis of reasoning behaviors like self-reflection and backtracking. Based on manual and automatic annotation of thousands of traces, it reveals structural similarities across models and that most erroneous steps do not contribute to final answers.

Reasoning emerges from constrained inference manifolds in large language models

arXiv cs.LG

This paper investigates reasoning in LLMs as an intrinsic dynamical process, finding that inference-time representations self-organize into low-dimensional manifolds. It proposes a label-free diagnostic based on internal dynamics to assess reasoning quality, suggesting that effective reasoning is governed by geometric and informational constraints.