modular-framework

#modular-framework

CaVe-VLM-CoT: An Interpretable Vision-Language Model Framework

arXiv cs.AI ↗ · 2026-06-18 Cached

CaVe-VLM-CoT is a modular reflection-based agentic-RAG framework for vision-language models that enforces evidence-grounded reasoning through a five-stage pipeline, achieving 87.1% accuracy on ScienceQA and proposing a suite of 23 metrics for evaluation.

0 favorites 0 likes

#modular-framework

AgentSpec: Understanding Embodied Agent Scaffolds Through Controlled Composition

arXiv cs.CL ↗ · 2026-06-15 Cached

Introduces AgentSpec, a modular specification framework for systematically composing and analyzing embodied LLM agent scaffolds, revealing that performance depends on scaffold compatibility and interaction effects rather than isolated module strength.

0 favorites 0 likes

#modular-framework

Palette: A Modular, Controllable, and Efficient Framework for On-demand Authorized Safety Alignment Relaxation in LLMs

arXiv cs.AI ↗ · 2026-05-26 Cached

Palette proposes a modular framework for selectively relaxing safety refusal behaviors in LLMs for authorized professional domains, using multi-objective search and lightweight adaptation to avoid costly retraining.

0 favorites 0 likes

#modular-framework

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs

Hugging Face Daily Papers ↗ · 2026-05-07 Cached

GeoStack introduces a geometric framework to compose independently trained domain experts in Vision-Language Models without catastrophic forgetting, achieving constant-time inference and a 10x reduction in geometric error.

0 favorites 0 likes

modular-framework

CaVe-VLM-CoT: An Interpretable Vision-Language Model Framework

AgentSpec: Understanding Embodied Agent Scaffolds Through Controlled Composition

Palette: A Modular, Controllable, and Efficient Framework for On-demand Authorized Safety Alignment Relaxation in LLMs

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs

Submit Feedback