image-generation

#image-generation

Information-Theoretic Classifier-Free Guidance with Adaptive Schedule Optimization

arXiv cs.LG ↗ · yesterday Cached

Proposes an information-theoretic framework for optimizing classifier-free guidance schedules in diffusion models, achieving improved trade-offs between condition consistency and sample diversity on ImageNet and COCO benchmarks.

0 favorites 0 likes

#image-generation

DiffusionBench: Towards Holistic Evaluation of Generative Diffusion Transformers

Hacker News Top ↗ · yesterday Cached

Introduces DiffusionBench, a unified benchmark for holistic evaluation of generative diffusion transformers, supporting multiple generation tasks and providing standardized training and evaluation.

0 favorites 0 likes

#image-generation

Krea 2 Technical Report (59 minute read)

TLDR AI ↗ · yesterday Cached

Krea 2 is a series of foundation models for creative image generation, built with a large-scale data infrastructure and multi-stage training pipeline. It introduces a prompt expander and style-reference system to improve steerability and enable creative exploration.

0 favorites 0 likes

#image-generation

Krea 2 released on Hugging Face

Reddit r/LocalLLaMA ↗ · yesterday Cached

Krea 2 is a 12-billion parameter text-to-image diffusion model released open-weight on Hugging Face, with Raw (base) and Turbo (post-trained) checkpoints available.

0 favorites 0 likes

#image-generation

Best cheap model for content writing, realistic image generation & vibe coding?

Reddit r/AI_Agents ↗ · yesterday

Asks for recommendations on affordable AI models for content writing, image generation, and vibe coding.

0 favorites 0 likes

#image-generation

Boogu Base, Turbo, Edit - open-source unified image generation and editing model series

Reddit r/LocalLLaMA ↗ · 2d ago

Boogu has released a series of open-source unified image generation and editing models, including Base, Turbo, and Edit variants.

0 favorites 0 likes

#image-generation

DiffusionBench: On Holistic Evaluation of Diffusion Transformers

Hugging Face Daily Papers ↗ · 2d ago Cached

Researchers introduce NanoGen, a unified framework for training and evaluating diffusion transformers, and propose DiffusionBench, a holistic benchmark combining ImageNet class-conditional and text-to-image generation to better assess progress in generative modeling.

0 favorites 0 likes

#image-generation

Semantic Browsing: Controllable Diversity for Image Generation

Hugging Face Daily Papers ↗ · 3d ago Cached

Semantic Browsing introduces a method for controlled diversity in text-to-image generation by using a Vision Language Model with an agentic workflow to generate structured, interpretable variations based on semantic decisions.

0 favorites 0 likes

#image-generation

Local text to image model comparaison: The ultimate test.

Reddit r/LocalLLaMA ↗ · 3d ago

User presents a comprehensive comparison of local text-to-image models using 192 prompts, evaluating capabilities like text rendering, faces, anatomy, and spatial composition, with results and prompts publicly available at imagebench.ai.

0 favorites 0 likes

#image-generation

I pretrained and post trained a 500M parameter LLM and 330M parameter Image generator from scratch

Reddit r/LocalLLaMA ↗ · 3d ago

The author details the process of pretraining and post-training a 500M parameter language model and a 330M parameter image generator entirely from scratch.

0 favorites 0 likes

#image-generation

Thumbmagic

Product Hunt ↗ · 4d ago

Thumbmagic is an AI thumbnail generator trained on top-performing thumbnails.

0 favorites 0 likes

#image-generation

@cellinlab: Holy cow! Brilliant! Why didn't I think of this usage — using an embedded browser as the infinite canvas for Codex Image 2 image generation!

X AI KOLs Timeline ↗ · 5d ago Cached

Discovered a creative usage: using an embedded browser to achieve infinite canvas image generation with Codex Image 2.

0 favorites 0 likes

#image-generation

gave my local llm agent mcp tools for local image + video gen, so it just generates when i ask (fully offline+free)

Reddit r/LocalLLaMA ↗ · 2026-06-18

A user demonstrates giving a local LLM agent MCP tools for local image and video generation, enabling fully offline and free generation on demand.

0 favorites 0 likes

#image-generation

@FinanceYF5: 3 years of AI progress ModelScope (left) Grok Imagine 1.5 (right)

X AI KOLs Following ↗ · 2026-06-18 Cached

Shows three years of AI progress: ModelScope on the left, Grok Imagine 1.5 on the right.

0 favorites 0 likes

#image-generation

Midjourney, The Image Generation Company, Just Built the Sequel to the MRI

Reddit r/singularity ↗ · 2026-06-18

Midjourney, known for AI image generation, has developed a new technology that is described as the sequel to the MRI, likely advancing medical imaging capabilities.

0 favorites 0 likes

#image-generation

FreeStyle: Free Control of Style-Content Dual-Reference Generation from Community LoRA Mining

Hugging Face Daily Papers ↗ · 2026-06-18 Cached

FreeStyle proposes a scalable dual-reference generation framework using community LoRA mining to construct large-scale style-content triplets, with disentanglement mechanisms to prevent content leakage, and introduces a comprehensive benchmark for evaluation.

0 favorites 0 likes

#image-generation

The FID Lottery: Quantifying Hidden Randomness in Generative-Model Evaluation

Hugging Face Daily Papers ↗ · 2026-06-18 Cached

This paper analyzes the variance of FID scores across different training and sampling seeds, revealing significant reproducibility issues in image generation evaluation. It proposes a new evaluation protocol with error bars and per-cell optimal guidance tuning.

0 favorites 0 likes

#image-generation

ostris/ideogram_4_turbotime_lora

Hugging Face Models Trending ↗ · 2026-06-17 Cached

A LoRA that adapts Ideogram 4 to generate high-quality images in as few as 2 steps without CFG, using a novel continuous turbo training method.

0 favorites 0 likes

#image-generation

Comfy-Org/Boogu-Image

Hugging Face Models Trending ↗ · 2026-06-17 Cached

Comfy-Org has repackaged Boogu-Image model files for ComfyUI, including base, edit, and turbo variants with different quantization formats, plus a LoRA and text encoder.

0 favorites 0 likes

#image-generation

New image model from Google

Reddit r/singularity ↗ · 2026-06-17

Google released a new image generation model.

0 favorites 0 likes

image-generation

Submit Feedback