image-synthesis

#image-synthesis

GEAR: Guided End-to-End AutoRegression for Image Synthesis

Hugging Face Daily Papers ↗ · 5d ago Cached

GEAR proposes a method to jointly train a vector-quantized tokenizer and autoregressive generator end-to-end via representation alignment, achieving up to 10x faster convergence on ImageNet gFID compared to strong baselines.

0 favorites 0 likes

#image-synthesis

Nemotron-Labs-Diffusion-Image: Advancing Masked Discrete Diffusion for High-Resolution Image Synthesis

Hugging Face Daily Papers ↗ · 6d ago Cached

This paper proposes Nemotron-Labs-Diffusion-Image, a masked discrete diffusion model for high-resolution text-to-image synthesis, introducing a token-editing mechanism and grouped cross-entropy objective to improve token refinement and training efficiency.

0 favorites 0 likes

#image-synthesis

Colored Noise Diffusion Sampling

Hugging Face Daily Papers ↗ · 2026-05-28 Cached

Introduces Colored Noise Sampling (CNS), a training-free stochastic solver for diffusion models that dynamically allocates energy based on frequency-dependent schedules, improving image quality metrics like FID significantly on ImageNet-256.

0 favorites 0 likes

#image-synthesis

Efficient Image Synthesis with Sphere Latent Encoder

Hugging Face Daily Papers ↗ · 2026-05-15 Cached

This paper proposes Sphere Latent Encoder, an efficient few-step image generation framework that performs denoising entirely in a spherical latent space, achieving high-quality 256×256 images with significantly reduced computational cost and improved FID scores on ImageNet-1K.

0 favorites 0 likes

image-synthesis

GEAR: Guided End-to-End AutoRegression for Image Synthesis

Nemotron-Labs-Diffusion-Image: Advancing Masked Discrete Diffusion for High-Resolution Image Synthesis

Colored Noise Diffusion Sampling

Efficient Image Synthesis with Sphere Latent Encoder

Submit Feedback