Tag
Google introduces Veo 3.1, an upgraded video generation model with richer audio, improved narrative control, and enhanced realism, alongside significant updates to Flow with new editing capabilities including Insert and Remove features, plus audio support across all existing tools.
OpenAI's Jukebox is a generative model that produces music as raw audio, including vocals and instruments, using a VQ-VAE for compression and hierarchical Sparse Transformer priors to handle long-range musical structure. It represents a significant step beyond symbolic music generation by operating directly in the raw audio domain.