Comfy-Org/ERNIE-Image
Summary
Comfy-Org has repackaged Baidu's ERNIE-Image and ERNIE-Image-Turbo models for ComfyUI integration, providing ready-to-use model files organized for the ComfyUI node-based image generation framework.
View Cached Full Text
Cached at: 04/20/26, 02:47 PM
Comfy-Org/ERNIE-Image Β· Hugging Face
Source: https://huggingface.co/Comfy-Org/ERNIE-Image
Repackaged model files for ComfyUI.
Original model repository:
Place the files in the following folders:
π ComfyUI/
βββ π models/
β βββ π diffusion_models/
β β βββ ernie-image.safetensors
β β βββ ernie-image-turbo.safetensors
β βββ π text_encoders/
β β βββ ernie-image-prompt-enhancer.safetensors
β β βββ ministral-3-3b.safetensors
β βββ π vae/
β βββ flux2-vae.safetensors
Model tree forComfy-Org/ERNIE-Imagehttps://huggingface.co/docs/hub/model-cards#specifying-a-base-model
Space usingComfy-Org/ERNIE-Image1
Similar Articles
baidu/ERNIE-Image-Turbo
Baidu releases ERNIE-Image-Turbo, a distilled text-to-image generation model that achieves fast generation in 8 inference steps while maintaining strong text rendering, instruction following, and structured image generation capabilities.
baidu/ERNIE-Image
Baidu releases ERNIE-Image, an open-weight text-to-image generation model with 8B parameters built on Diffusion Transformer architecture, achieving state-of-the-art performance among open-weight models with strong capabilities in text rendering, instruction following, and structured image generation.
unsloth/ERNIE-Image-Turbo-GGUF
unsloth releases a GGUF quantized version of Baidu's ERNIE-Image-Turbo model using Unsloth Dynamic 2.0 methodology, enabling efficient text-to-image generation in 8 inference steps on consumer GPUs with 24GB VRAM.
New BEST local AI image generator is here!
Ernie Image, a new open-source diffusion model, surpasses Zage in text rendering and prompt fidelity and can be run locally via ComfyUI with ~20 GB VRAM.
@heyshrutimishra: Baidu recently open-sourced ERNIE-Image, an 8B parameter model with weights available for commercial use. This is big. β¦
Baidu open-sourced ERNIE-Image, an 8B parameter text-to-image model with commercial-use weights, making it one of the few fully open and fine-tunable alternatives to closed models like Midjourney.