image-generation

#image-generation

HiDream-ai/HiDream-O1-Image

Hugging Face Models Trending ↗ · yesterday Cached

HiDream-ai has open-sourced HiDream-O1-Image (8B), a unified image generative foundation model built on a Pixel-level Unified Transformer (UiT) that natively handles text-to-image, image editing, and subject-driven personalization at up to 2048×2048 resolution without external VAEs or disjoint text encoders. It debuted at #8 in the Artificial Analysis Text to Image Arena and is positioned as a leading open-weights text-to-image model.

0 favorites 0 likes

#image-generation

Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

Hugging Face Daily Papers ↗ · 2d ago Cached

This paper introduces Continuous-Time Distribution Matching (CDM), a method for few-step diffusion distillation that migrates from discrete to continuous optimization to improve visual fidelity and preserve fine details.

0 favorites 0 likes

#image-generation

D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models

Hugging Face Daily Papers ↗ · 3d ago Cached

This paper introduces D-OPSD, a novel training paradigm for step-distilled diffusion models that enables on-policy self-distillation during supervised fine-tuning. It allows models to learn new concepts or styles without compromising their efficient few-step inference capabilities.

0 favorites 0 likes

#image-generation

Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation

Hugging Face Daily Papers ↗ · 4d ago Cached

The paper introduces JoyAI-Image, a unified multimodal foundation model that integrates a spatially enhanced MLLM with MMDiT to achieve state-of-the-art performance in visual understanding, text-to-image generation, and instruction-guided editing.

0 favorites 0 likes

#image-generation

Representation Fréchet Loss for Visual Generation

Papers with Code Trending ↗ · 2026-04-30 Cached

This paper introduces FD-loss, a method to optimize Fréchet Distance as a training objective for visual generation by decoupling population and batch sizes. It demonstrates that this approach improves generator quality and suggests FID may not always accurately reflect visual quality.

0 favorites 0 likes

#image-generation

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Papers with Code Trending ↗ · 2026-04-27 Cached

Tuna-2 is a unified multimodal model that achieves state-of-the-art performance by processing visual understanding and generation directly from pixel embeddings, eliminating the need for pretrained vision encoders.

0 favorites 0 likes

#image-generation

@FinanceYF5: It’s been barely 24 hours since OpenAI dropped ChatGPT Images 2.0 and people are already going wild—10 insane examples:

X AI KOLs Timeline ↗ · 2026-04-23 Cached

Within 24 hours of OpenAI’s launch of ChatGPT Images 2.0, users have unleashed a flood of creative, viral image demos.

0 favorites 0 likes

#image-generation

GPT Image 2 is the first image ai that’s blown my mind (prompted for a screenshot from a combined GTA 6-Cyberpunk 2077 game)

Reddit r/singularity ↗ · 2026-04-22

GPT Image 2 impresses users with its ability to blend abstract concepts from GTA 6 and Cyberpunk 2077 into a cohesive screenshot.

0 favorites 0 likes

#image-generation

@BenjaminDEKR: Ummmm I accidentally used Gemini 3.1 Pro instead of Nano Banana when I asked for an image, and now it's (checks notes) …

X AI KOLs Following ↗ · 2026-04-22 Cached

User reports Gemini 3.1 Pro unexpectedly streaming an image as line-by-line Base64 instead of returning a normal file.

0 favorites 0 likes

#image-generation

GPT image 2 is insane

Reddit r/singularity ↗ · 2026-04-22

GPT-Image-2 can generate highly detailed interiors of WW2 submarines rendered in the distinctive low-poly GoldSrc style of Half-Life 1.

0 favorites 0 likes

#image-generation

The 'President Test' with GPT-Image-2

Reddit r/singularity ↗ · 2026-04-22

Article discusses evaluating GPT-Image-2's capabilities through a 'President Test' scenario.

0 favorites 0 likes

#image-generation

Exploring Spatial Intelligence from a Generative Perspective

Hugging Face Daily Papers ↗ · 2026-04-22 Cached

Researchers introduce GSI-Bench, the first benchmark to quantify generative spatial intelligence in multimodal models by evaluating 3D spatial constraint compliance during image generation. Fine-tuning on their synthetic dataset boosts both spatial editing fidelity and downstream spatial understanding, showing generative training can strengthen spatial reasoning.

0 favorites 0 likes

#image-generation

@mattshumer_: GPT-Image-2 is absolutely fucking insane. I added it as a tool that Agent-S can use, and it's now generating slide deck…

X AI KOLs Following ↗ · 2026-04-21 Cached

GPT-Image-2 shows a major leap in image generation quality, enabling Agent-S to auto-create polished slide decks and apps.

0 favorites 0 likes

#image-generation

@skirano: Built a time machine powered by OpenAI’s new image generation model. Describe where and when you want to go, and it cre…

X AI KOLs Following ↗ · 2026-04-21 Cached

A developer created an immersive "time machine" tool using OpenAI’s new image model that generates explorable panoramic scenes from text prompts.

0 favorites 0 likes

#image-generation

@OpenAI: Aspect Ratios & Resolution in ChatGPT Images 2.0, demonstrated by @dibyayB

X AI KOLs ↗ · 2026-04-21 Cached

ChatGPT Images 2.0 now supports configurable aspect ratios and resolution, as demonstrated by user @dibyayB.

0 favorites 0 likes

#image-generation

@OpenAI: What makes ChatGPT Images 2.0 a state-of-the-art image generation model? Researchers behind the model explain. A thread…

X AI KOLs ↗ · 2026-04-21 Cached

OpenAI researchers explain the advances that make ChatGPT Images 2.0 a state-of-the-art image generation model, highlighting its thinking and intelligence capabilities.

0 favorites 0 likes

#image-generation

Where's the raccoon with the ham radio? (ChatGPT Images 2.0)

Simon Willison's Blog ↗ · 2026-04-21 Cached

OpenAI released ChatGPT Images 2.0, claiming a GPT-3-to-GPT-5 leap; Simon Willison benchmarks it with a "Where's Waldo"-style raccoon-and-ham-radio prompt against gpt-image-1, Google Nano Banana 2 and Pro, showing mixed hide-and-seek success.

0 favorites 0 likes

#image-generation

OpenAI cooked with the new Images 2 Model, the characters can stay extremely consistent, while text is clear and stays the same

Reddit r/singularity ↗ · 2026-04-21

OpenAI released an upgraded image model that keeps character appearance perfectly consistent across frames and renders crisp, stable text.

0 favorites 0 likes

#image-generation

@aiDotEngineer: Building Generative Image & Video models at Scale https://youtube.com/watch?v=xOP1PM8fwnk… A lot of interest in image g…

X AI KOLs Timeline ↗ · 2026-04-21

YouTube talk by @sedielem offering a concise state-of-the-art overview of scaling generative image and video models, covering modeling, architecture, distillation and control.

0 favorites 0 likes

#image-generation

The game specific meme potential on gpt image 2 is insane

Reddit r/singularity ↗ · 2026-04-21

Users are discovering strong meme-generation capabilities in GPT Image 2, particularly for game-specific humor.

0 favorites 0 likes

image-generation

Submit Feedback