@HowToAI_: Microsoft has released a 4B parameter model that turns any image into a 3D asset in 3 seconds. It uses a new geometry f…
Summary
Microsoft released a 4B parameter model that converts any image into a 3D asset in 3 seconds, using the O-Voxel geometry format and outputting GLB files with full PBR textures, compatible with Blender, Unity, and Unreal.
View Cached Full Text
Cached at: 05/19/26, 04:39 AM
Microsoft has released a 4B parameter model that turns any image into a 3D asset in 3 seconds.
It uses a new geometry format called O-Voxel that converts to a textured mesh in under 100ms on CUDA.
Outputs GLB files with full PBR textures, ready for Blender, Unity, and Unreal. https://t.co/KdKwQ1FZth
Similar Articles
@HuggingPapers: Microsoft just released Lens on Hugging Face A 3.8B parameter text-to-image model delivering efficient training and hig…
Microsoft released Lens, a 3.8B parameter text-to-image model on Hugging Face, capable of efficient training and high-resolution generation up to 1440×1440.
TencentARC/Pixal3D
Pixal3D is a high-fidelity single-image-to-3D model by TencentARC and Microsoft, which explicitly lifts pixel features into 3D via back-projection for near-reconstruction-level geometry and PBR textures. The model is accepted to SIGGRAPH 2026, with inference code and demo available.
@Azure: Three open-source image models, one platform. Microsoft Foundry and Hugging Face bring developers the largest catalog f…
Microsoft Foundry integrates three open-source image models (SDXL, FLUX.1-schnell, and Z-Image-Turbo) via Hugging Face, offering developers a unified platform for AI image generation.
microsoft/Lens
Microsoft releases Lens, a 3.8B-parameter foundational text-to-image model designed for efficient training and fast high-resolution generation, achieving competitive quality with reduced compute.
New AI 3D Model Generates Modular UE5 Environments From a Single Image
A new AI model can generate modular Unreal Engine 5 environments from a single input image, enabling rapid 3D scene creation.