Tag
OpenAI announces the availability of their podcast on major streaming platforms including Spotify, Apple Podcasts, and YouTube.
Socrati is a new product launching on Product Hunt that generates personal knowledge podcasts from various sources.
OmniGUI introduces a step-level benchmark for GUI agents that integrates static images, synchronous audio, and video clips to simulate real smartphone interactions. Evaluation shows current models struggle with temporal and auditory inputs, highlighting the need for omni-modal capabilities.
OpenAI publishes the GPT-4o System Card detailing comprehensive safety evaluations and risk mitigations across cybersecurity, biological threats, persuasion, and model autonomy. The multimodal model scores low-to-medium on preparedness framework categories with novel safeguards for audio capabilities.
yt-dlp is a feature-rich command-line audio/video downloader supporting thousands of sites, forked from youtube-dl.