sequential-decision-making

Tag

Cards List
#sequential-decision-making

Agentick: A Unified Benchmark for General Sequential Decision-Making Agents

arXiv cs.AI · 2026-05-11 Cached

This paper introduces Agentick, a unified benchmark for evaluating general sequential decision-making agents across RL, LLM, and VLM paradigms. It provides 37 procedurally generated tasks and reveals that no single approach currently dominates, highlighting significant room for improvement in agent autonomy.

0 favorites 0 likes
#sequential-decision-making

PRISM: Perception Reasoning Interleaved for Sequential Decision Making

arXiv cs.AI · 2026-05-08 Cached

This paper introduces PRISM, a framework that integrates Vision-Language Models and Large Language Models through a dynamic question-answering pipeline to improve sequential decision-making in embodied AI tasks.

0 favorites 0 likes
← Back to home

Submit Feedback