Tag
Fable 5 has been released, capable of generating games, cities, or recreating classics with just one sentence, demonstrating the powerful ability of text-to-3D content.
This paper identifies and addresses 'latent sink traps' in text-to-3D generative models where they become insensitive to text prompts, proposing a framework that decouples geometric representation from linguistic sensitivity to enable robust text-based 3D shape editing of out-of-distribution shapes.
HY-World 2.0 is a multi-modal world model framework that generates high-fidelity 3D Gaussian Splatting scenes from text, images, and videos through specialized modules for panorama generation, trajectory planning, and scene composition, achieving state-of-the-art performance among open-source approaches.
OpenAI introduces Point-E, a system for generating 3D point clouds from text prompts in 1-2 minutes on a single GPU by combining text-to-image and image-to-3D diffusion models. The method achieves significant speedup over prior methods while releasing pre-trained models and code.