rgb-d

#rgb-d

Human Universal Grasping

Hugging Face Daily Papers ↗ · 6d ago Cached

A flow-matching model generates diverse human grasps from RGB-D images, enabling zero-shot robotic grasping with improved performance over existing methods. The model, trained on a large egocentric dataset, significantly outperforms state-of-the-art baselines on a new benchmark.

0 favorites 0 likes

#rgb-d

Revisiting Articulated Parts Perception in Robot Manipulation

Hugging Face Daily Papers ↗ · 2026-06-06 Cached

This paper introduces Geometric Primary Structure (GPS), a new representation for articulated parts perception in robot manipulation, enabling efficient VR-based annotation and achieving a 73% success rate without fine-tuning.

0 favorites 0 likes

#rgb-d

AFUN: Towards an Affordance Foundation Model for Functionality Understanding

Hugging Face Daily Papers ↗ · 2026-06-01 Cached

AFUN proposes an affordance foundation model that predicts functional masks and 3D motion curves from RGB-D observations and language descriptions, enabling generalizable robot manipulation across diverse environments. The model outperforms baselines on multiple benchmarks and can be deployed for real-world tasks without fine-tuning.

0 favorites 0 likes

#rgb-d

CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage

Hugging Face Daily Papers ↗ · 2026-05-15 Cached

This paper proposes COVER, a training-free method for converting 3D assets into sparse panoramic RGB-D-pose data with complete scene coverage and low redundancy, and introduces the CM-EVS dataset containing 36,373 curated frames from indoor and outdoor scenes.

0 favorites 0 likes

rgb-d

Human Universal Grasping

Revisiting Articulated Parts Perception in Robot Manipulation

AFUN: Towards an Affordance Foundation Model for Functionality Understanding

CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage

Submit Feedback