robot-planning

#robot-planning

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

Hugging Face Daily Papers ↗ · 2026-06-16 Cached

GeneralVLA-2 introduces GeoFuse-MV3D for improved 3D reconstruction and a governed KnowledgeBank for better memory management in robotic manipulation tasks, achieving performance gains on several benchmarks.

0 favorites 0 likes

#robot-planning

EgoPhys: Learning Generalizable Physics Models of Deformable Objects from Egocentric Video

Hugging Face Daily Papers ↗ · 2026-06-15 Cached

EgoPhys introduces a framework to construct deformable physical digital twins from egocentric RGB video using generalizable priors and a compact codebook, enabling zero-shot generalization to unseen objects without per-spring optimization. The system is demonstrated on a real robot, showing that egocentric human play video can serve as internal world representation for deformable-object planning.

0 favorites 0 likes

#robot-planning

What Objects Enable, Not What They Are: Functional Latent Spaces for Affordance Reasoning

arXiv cs.LG ↗ · 2026-06-05 Cached

This paper introduces A4D, a framework that maps visual observations into a shared latent space structured around affordances (e.g., 'movable') for robot planning. It achieves 94% inference accuracy on existing affordances, outperforming state-of-the-art by 15%, and enables 100x faster inference with superior generalization to unseen object functionalities.

0 favorites 0 likes

#robot-planning

Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding

Hugging Face Daily Papers ↗ · 2026-05-12 Cached

This paper introduces LC-MAPF, a pre-trained model with a learnable communication module for multi-agent pathfinding that improves coordination and outperforms existing learning-based solvers while maintaining scalability.

0 favorites 0 likes

#robot-planning

A better method for planning complex visual tasks

MIT News — Artificial Intelligence ↗ · 2026-03-11 Cached

MIT researchers developed VLMFP, a two-stage generative AI approach combining vision-language models with formal planning software to achieve 70% success rate on complex visual planning tasks like robot navigation, nearly 2.3x better than existing baselines. The method automatically translates visual scenarios into planning files that classical solvers can process, enabling effective long-horizon planning in novel environments.

0 favorites 0 likes

robot-planning

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

EgoPhys: Learning Generalizable Physics Models of Deformable Objects from Egocentric Video

What Objects Enable, Not What They Are: Functional Latent Spaces for Affordance Reasoning

Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding

A better method for planning complex visual tasks

Submit Feedback