visual-representations

Tag

Cards List
#visual-representations

LLM Agents Can See Code Repositories

Hugging Face Daily Papers · 5d ago Cached

This paper presents the first systematic empirical study of using visual repository representations to enhance LLM-based coding agents, showing that integrating visual graphs as a supplementary modality reduces token consumption by up to 26% while maintaining or improving issue-resolution accuracy.

0 favorites 0 likes
#visual-representations

Causal Probing for Internal Visual Representations in Multimodal Large Language Models

arXiv cs.AI · 2026-05-08 Cached

This paper proposes a causal framework for probing internal visual representations in Multimodal Large Language Models, revealing differences in how entities and abstract concepts are encoded. The study highlights that increasing model depth is crucial for encoding abstract concepts and uncovers a disconnect between perception and reasoning in current MLLMs.

0 favorites 0 likes
← Back to home

Submit Feedback