RAG-Anything: All-in-One RAG Framework

Papers with Code Trending 10/14/25, 09:25 AM Papers

rag multimodal retrieval-augmented-generation open-source framework knowledge-retrieval

Summary

RAG-Anything is a new open-source framework that enhances multimodal knowledge retrieval by integrating cross-modal relationships and semantic matching, outperforming existing methods on complex benchmarks.

Retrieval-Augmented Generation (RAG) has emerged as a fundamental paradigm for expanding Large Language Models beyond their static training limitations. However, a critical misalignment exists between current RAG capabilities and real-world information environments. Modern knowledge repositories are inherently multimodal, containing rich combinations of textual content, visual elements, structured tables, and mathematical expressions. Yet existing RAG frameworks are limited to textual content, creating fundamental gaps when processing multimodal documents. We present RAG-Anything, a unified framework that enables comprehensive knowledge retrieval across all modalities. Our approach reconceptualizes multimodal content as interconnected knowledge entities rather than isolated data types. The framework introduces dual-graph construction to capture both cross-modal relationships and textual semantics within a unified representation. We develop cross-modal hybrid retrieval that combines structural knowledge navigation with semantic matching. This enables effective reasoning over heterogeneous content where relevant evidence spans multiple modalities. RAG-Anything demonstrates superior performance on challenging multimodal benchmarks, achieving significant improvements over state-of-the-art methods. Performance gains become particularly pronounced on long documents where traditional approaches fail. Our framework establishes a new paradigm for multimodal knowledge access, eliminating the architectural fragmentation that constrains current systems. Our framework is open-sourced at: https://github.com/HKUDS/RAG-Anything.

Original Article Export to Word Export to PDF

View Cached Full Text

Cached at: 05/08/26, 08:39 AM

Paper page - RAG-Anything: All-in-One RAG Framework

Source: https://huggingface.co/papers/2510.12323 Published on Oct 14, 2025

Abstract

RAG-Anything is a unified framework that enhances multimodal knowledge retrieval by integrating cross-modal relationships and semantic matching, outperforming existing methods on complex benchmarks.

Retrieval-Augmented Generation(RAG) has emerged as a fundamental paradigm for expandingLarge Language Modelsbeyond their static training limitations. However, a critical misalignment exists between currentRAGcapabilities and real-world information environments. Modern knowledge repositories are inherentlymultimodal, containing rich combinations oftextual content, visual elements,structured tables, andmathematical expressions. Yet existingRAGframeworks are limited totextual content, creating fundamental gaps when processingmultimodaldocuments. We presentRAG-Anything, a unified framework that enables comprehensive knowledge retrieval across all modalities. Our approach reconceptualizesmultimodalcontent as interconnected knowledge entities rather than isolated data types. The framework introduces dual-graph construction to capture both cross-modal relationships and textual semantics within a unified representation. We developcross-modal hybrid retrievalthat combinesstructural knowledge navigationwithsemantic matching. This enables effective reasoning over heterogeneous content where relevant evidence spans multiple modalities.RAG-Anything demonstrates superior performance on challengingmultimodal benchmarks, achieving significant improvements over state-of-the-art methods. Performance gains become particularly pronounced onlong documentswhere traditional approaches fail. Our framework establishes a new paradigm formultimodalknowledge access, eliminating the architectural fragmentation that constrains current systems. Our framework is open-sourced at: https://github.com/HKUDS/RAG-Anything.

View arXiv page View PDF GitHub19.9k Add to collection

Get this paper in your agent:

hf papers read 2510\.12323

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2510.12323 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2510.12323 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2510.12323 in a Space README.md to link it from this page.

Collections including this paper37

Browse 37 collections that include this paper

RAG-Anything: All-in-One RAG Framework

Paper page - RAG-Anything: All-in-One RAG Framework

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper37

Similar Articles

HKUDS/RAG-Anything

AgenticRAG: Agentic Retrieval for Enterprise Knowledge Bases

LightRAG: Simple and Fast Retrieval-Augmented Generation

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation

LatentRAG: Latent Reasoning and Retrieval for Efficient Agentic RAG

Submit Feedback

Similar Articles

AgenticRAG: Agentic Retrieval for Enterprise Knowledge Bases

LightRAG: Simple and Fast Retrieval-Augmented Generation

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation

LatentRAG: Latent Reasoning and Retrieval for Efficient Agentic RAG