vq-vae

Tag

Cards List
#vq-vae

[NEW MODEL] SupraLabs started the Any2Any model family!

Reddit r/LocalLLaMA · 6d ago Cached

SupraLabs released Supra-A2A-Nano-Exp, a small any-to-any autoregressive model that unifies text and image tokenization into a single Transformer, serving as an educational prototype rather than a production-ready system.

0 favorites 0 likes
#vq-vae

SimPersona: Learning Discrete Buyer Personas from Raw Clickstreams for Grounded E-Commerce Agents

arXiv cs.AI · 2026-05-15 Cached

SimPersona learns discrete buyer personas from raw clickstreams using a VQ-VAE and maps them to persona tokens for LLM-based web agents, achieving high conversion-rate alignment across many live storefronts.

0 favorites 0 likes
#vq-vae

Continuous First, Discrete Later: VQ-VAEs Without Dimensional Collapse

arXiv cs.LG · 2026-05-11 Cached

This paper addresses the issue of dimensional collapse in VQ-VAEs, showing that representations often occupy a low-dimensional subspace. It proposes an 'AE Warm-Up' strategy that trains the model as an unquantized autoencoder first, which improves reconstruction quality and increases effective latent dimensionality.

0 favorites 0 likes
#vq-vae

Understanding VQ-VAE (DALL-E Explained Pt. 1)

ML at Berkeley · 2021-02-09 Cached

An educational blog post explaining the Vector Quantized Variational Autoencoder (VQ-VAE) architecture, a key component of OpenAI's DALL-E image generation model.

0 favorites 0 likes
#vq-vae

Jukebox

OpenAI Blog · 2020-04-30 Cached

OpenAI's Jukebox is a generative model that produces music as raw audio, including vocals and instruments, using a VQ-VAE for compression and hierarchical Sparse Transformer priors to handle long-range musical structure. It represents a significant step beyond symbolic music generation by operating directly in the raw audio domain.

0 favorites 0 likes
← Back to home

Submit Feedback