multi-modal-llm

Tag

Cards List
#multi-modal-llm

RS-Claw: Progressive Active Tool Exploration via Hierarchical Skill Trees for Remote Sensing Agents

arXiv cs.AI · 2026-05-14 Cached

RS-Claw proposes an active tool exploration paradigm for remote sensing agents using hierarchical skill trees, enabling on-demand sequential decision-making and achieving up to 86% input token compression while outperforming passive selection baselines on Earth-Bench.

0 favorites 0 likes
#multi-modal-llm

FoodCHA: Multi-Modal LLM Agent for Fine-Grained Food Analysis

arXiv cs.AI · 2026-05-08 Cached

This paper introduces FoodCHA, a multi-modal LLM agent framework designed for fine-grained food analysis, addressing challenges in hierarchical consistency and attribute discrimination for dietary monitoring.

0 favorites 0 likes
← Back to home

Submit Feedback