multimodal-rag

#multimodal-rag

From Scenes to Elements: Multi-Granularity Evidence Retrieval for Verifiable Multimodal RAG

arXiv cs.CL ↗ · 3d ago Cached

This paper introduces GranuVistaVQA, a multimodal benchmark with element-level annotations, and GranuRAG, a framework that treats visual elements as first-class retrieval units for verifiable multimodal RAG, achieving up to 29.2% improvement over baselines.

0 favorites 0 likes

#multimodal-rag

Retrieval-Augmented Tutoring for Algorithm Tracing and Problem-Solving in AI Education

arXiv cs.AI ↗ · 4d ago Cached

This paper presents KITE, a Retrieval-Augmented Generation (RAG)-based intelligent tutoring system for algorithmic reasoning and problem-solving in AI education. The system uses intent-aware Socratic response strategies and multimodal RAG to provide course-grounded, pedagogically appropriate feedback, and is evaluated through metrics, expert review, and simulated student interactions.

0 favorites 0 likes

multimodal-rag

From Scenes to Elements: Multi-Granularity Evidence Retrieval for Verifiable Multimodal RAG

Retrieval-Augmented Tutoring for Algorithm Tracing and Problem-Solving in AI Education

Submit Feedback