3d-vision

#3d-vision

A Cookbook of 3D Vision: Data, Learning Paradigms, and Application

Hugging Face Daily Papers ↗ · 2026-06-02 Cached

This paper presents a comprehensive taxonomy of 3D vision research, covering geometric representations, datasets, learning paradigms, and applications in reconstruction, generation, and video modeling.

0 favorites 0 likes

#3d-vision

Geometry Matters: 3D Foundation Priors for Learning Semantic Correspondence

Hugging Face Daily Papers ↗ · 2026-05-28 Cached

This paper introduces a post-training framework that leverages 3D priors from SAM3D to improve semantic correspondence in 2D foundation features, addressing issues like left-right confusion and repeated parts. The method uses instance-specific 3D reconstruction without pose annotations or spherical geometry shortcuts.

0 favorites 0 likes

#3d-vision

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Hugging Face Daily Papers ↗ · 2026-05-26 Cached

SpatialBench is a comprehensive benchmark for evaluating spatial foundation models across diverse domains and tasks, revealing limitations in current models and introducing DA-Next-5M and DA-Next to advance spatial representation learning.

0 favorites 0 likes

#3d-vision

@rwayne: Regarding how to pursue a PhD, a Zhejiang University expert posted the solution directly on GitHub. It covers the entire research lifecycle, including getting started, topic selection, conducting experiments, advisor meetings, project management, writing, rebuttals, and presentation slides. The `getting_started` file for the 3D Vision direction...

X AI KOLs Timeline ↗ · 2026-05-10

A Zhejiang University researcher shared a comprehensive PhD guide on GitHub, covering the entire research lifecycle from topic selection to rebuttals, specifically tailored for the 3D Vision direction.

0 favorites 0 likes

#3d-vision

facebook/VGGT-Omega

Hugging Face Models Trending ↗ · 2026-03-17 Cached

Meta AI and Oxford VGG released VGGT-Omega, a foundation model for 3D vision, with project page and GitHub repository.

0 favorites 0 likes

3d-vision

A Cookbook of 3D Vision: Data, Learning Paradigms, and Application

Geometry Matters: 3D Foundation Priors for Learning Semantic Correspondence

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

facebook/VGGT-Omega

Submit Feedback