Diffusion Model that can turn any Image into a Playable Hallucination! BUT LOCALLY, NOT ON DATACENTER
Summary
A diffusion model that can transform any image into an interactive, playable hallucination, running locally on user hardware.
Similar Articles
New BEST local AI image generator is here!
Ernie Image, a new open-source diffusion model, surpasses Zage in text rendering and prompt fidelity and can be run locally via ComfyUI with ~20 GB VRAM.
Deep Neural Network that turns any Image into a Playable Game ! All on consumer GPUs and Not Datacenters
The author presents a small transformer-based neural network trained from scratch to turn any image into a playable game, running in real-time on consumer GPUs like an RTX 5090. The model uses autoregressive decoding with KV caching but currently has issues with motion and context.
Why DDIM Hallucinates More than DDPM: A Theoretical Analysis of Reverse Dynamics
This paper provides a theoretical analysis explaining why deterministic DDIM samplers hallucinate more than stochastic DDPM samplers in diffusion models, attributing it to getting stuck in mode-interpolation regions during reverse dynamics.
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion
PiD introduces a pixel diffusion decoder that reformulates latent decoding as conditional pixel diffusion, enabling fast and high-quality image synthesis at high resolutions with reduced computational requirements. It decodes latents into 4x or 8x upscaled images in under a second on consumer hardware.
Score-Control for Hallucination Reduction in Diffusion Models
This paper introduces Variance-Guided Score Modulation (VSM) to reduce hallucinations in diffusion models by controlling score function smoothness, achieving up to ~25% reduction while maintaining image quality.