On the quantitative analysis of decoder-based generative models
Summary
This paper proposes using Annealed Importance Sampling to evaluate log-likelihoods for decoder-based generative models (VAEs, GANs, etc.), addressing the challenge of intractable likelihood estimation. The authors validate their method and provide evaluation code to analyze model performance, overfitting, and mode coverage.
View Cached Full Text
Cached at: 04/20/26, 02:45 PM
Similar Articles
Lost and Found in Translation: Variational Diagnostics for Neural Codebook Channels
This paper introduces the neural codebook channel diagnostic for VAEs, which measures encoder-decoder disagreement and provides a certificate bounded by the variational gap, enabling detection of mismatched decoding in deep generative models.
New sampler + verifier *drastically* improves tiny 0.5b model coding performance
The paper introduces VGB, a process-guided sampling algorithm with probabilistic backtracking, which significantly improves coding performance on tiny 0.5B models by being robust to verifier errors.
The FID Lottery: Quantifying Hidden Randomness in Generative-Model Evaluation
This paper analyzes the variance of FID scores across different training and sampling seeds, revealing significant reproducibility issues in image generation evaluation. It proposes a new evaluation protocol with error bars and per-cell optimal guidance tuning.
Active Learning for Conditional Generative Compressed Sensing
This paper proposes a framework for conditional generative compressed sensing, proving stable recovery bounds for prompt-conditioned models and demonstrating how prompt matching influences sampling distributions in experiments with Stable Diffusion.
Probabilistic bias adjustment of seasonal forecasts using generative machine learning: A case study of Arctic sea ice predictions
This paper presents a probabilistic post-processing framework using conditional variational autoencoders (cVAEs) to bias-adjust seasonal forecasts of Arctic sea ice, improving calibration, sharpness, and spectral power over standard methods.