Tag
Google DeepMind announces Gemma 4 12B, a novel encoder-free multimodal AI model that integrates vision and audio directly into the LLM backbone, delivering advanced reasoning and agentic capabilities on laptops with 16GB of RAM, released under Apache 2.0 license.
Teenage Engineering announces APC-2, a professional audio disc recording system for cutting vinyl records in real time, built in collaboration with SUPERSENSE.
A recap of an extraordinary week in open AI, featuring over 25 open-weight model releases across LLMs, image generation, audio/speech, vision, and video/3D, with notable contributions from NVIDIA, Google, and others.
Texas Instruments has released new 5532 chips that differ from the classic versions used for decades, potentially impacting audio applications.
Wired reviews four best Alexa speakers and smart displays for 2026, highlighting the Echo Show 11 as the top smart display and the Echo Show 8 (3rd gen) as the best affordable option, with mentions of ads and speaker quality trade-offs.
Xiaomi announces Sound Play, a compact portable speaker with 18W output, colorful lighting, 14-hour battery, and IP68 durability.
Xiaomi announces the Buds 6, featuring a comfortable semi-in-ear fit, richer sound, clearer calls, and smarter everyday convenience.
ChildVox presents a comprehensive benchmark for analyzing children's acoustic communication across developmental stages, integrating over 20 sub-tasks from 17 child-centered audio and speech datasets.
The Cearvol Wave Lite earbuds offer moderate hearing assistance but fall short in audio quality, especially for conversation and movie-watching, though they are reasonably priced for the hearing aid market.
Audiomass is a free, open-source multitrack audio editor that runs entirely in the web browser.
A blog post detailing the debugging of a recurring XF86AudioPlay key event in Emacs, traced to a headphone device driver using libinput and evtest.
Marshall announces the Milton A.N.C., a new pair of on-ear wireless headphones with active noise cancellation, available for $229.99. It offers up to 80 hours of playtime without ANC, Bluetooth 6.0, spatial audio, and a replaceable battery.
loopmaster is an IDE for livecoding music, enabling real-time algorithmic music composition.
A comment praising a product or demo for its high-quality appearance and sound.
Leaked images and details reveal Sony's upcoming 10th anniversary ColleXion headphones, featuring premium design, updated audio drivers, and a $649 price tag, expected to launch May 19th.
AudioMosaic introduces a contrastive learning-based audio encoder that uses structured time-frequency masking on spectrogram patches for efficient large-batch training, achieving state-of-the-art performance on audio benchmarks and improving audio-language models.
The author describes testing an agent workflow that converts prompts into audio courses for publishing to Spotify, with potential uses like meeting briefings, team updates, and study notes.
OpenAI announces the availability of their podcast on major streaming platforms including Spotify, Apple Podcasts, and YouTube.
Socrati is a new product launching on Product Hunt that generates personal knowledge podcasts from various sources.
OmniGUI introduces a step-level benchmark for GUI agents that integrates static images, synchronous audio, and video clips to simulate real smartphone interactions. Evaluation shows current models struggle with temporal and auditory inputs, highlighting the need for omni-modal capabilities.