voice-assistant

Tag

Cards List
#voice-assistant

Amazon’s new Alexa+ powered feature can generate podcast episodes

TechCrunch AI · 2026-05-18 Cached

Amazon announces a new Alexa+ feature called Alexa Podcasts that generates podcast episodes on any topic using AI, with options to customize length and tone, narrated by AI host voices.

0 favorites 0 likes
#voice-assistant

I spent 45 minutes driving to work every day unable to talk to my AI agent, so I built an iOS app where I can handsfree talk to my AI agent - with full TTS and STT built-in

Reddit r/openclaw · 2026-05-17

A developer built ClawVibe, an iOS app for hands-free voice interaction with AI agents, featuring on-device speech recognition and TTS for low latency.

0 favorites 0 likes
#voice-assistant

AI felt trapped in a textbox, so I spent the last 14 months trying to give it a body

Reddit r/singularity · 2026-05-16 Cached

The developer spent 14 months creating an AI physical prototype device named Keito, based on the ESP32 chip. It supports features such as voice conversation, real-time lip-sync animation, capacitive touch interaction, music playback, and weather query, aiming to liberate AI from the text box.

0 favorites 0 likes
#voice-assistant

@FinanceYF5: Meta AI is transforming from a 'chat box' into an always-on perception layer. Alexandr Wang mentioned that the Muse Spark update includes voice conversations, real-time camera AI, and a gradual transition into glasses. The point is not just another voice assistant, but AI beginning to see, hear, and understand the world in front of you.

X AI KOLs Following · 2026-05-15 Cached

Meta AI is evolving from a chat box into an always-on perception layer, adding voice conversations, real-time camera AI capabilities, and gradually moving into glasses form, enabling AI to see, hear, and understand the world in front of the user.

0 favorites 0 likes
#voice-assistant

MIST: Multimodal Interactive Speech-based Tool-calling Conversational Assistants for Smart Homes

arXiv cs.CL · 2026-05-11 Cached

The paper introduces MIST, a synthetic dataset and framework for training multimodal voice assistants to control IoT devices in smart homes. It highlights significant performance gaps between open and closed-weight models in handling complex, speech-based tool-calling tasks.

0 favorites 0 likes
#voice-assistant

Built a practical voice-first AI tool for ADHD/executive dysfunction — one-tap brain dump → structured reminders & tasks (not a full autonomous agent)

Reddit r/AI_Agents · 2026-05-10

The author introduces SAVI, an iOS app designed for ADHD users that converts voice brain dumps into structured tasks and reminders using on-device AI like Whisper and GPT-4o.

0 favorites 0 likes
#voice-assistant

Built a JARVIS-style assistant with wake word, vision mode, local voice cloning, and LLM-generated system commands

Reddit r/ArtificialInteligence · 2026-05-08

A developer built a JARVIS-style personal assistant called CYBER with wake word activation, local voice cloning via XTTS v2, vision mode, and LLM-generated system commands, all running locally without cloud dependencies.

0 favorites 0 likes
#voice-assistant

Cardamom

Product Hunt · 2026-05-07

Cardamom is an AI-powered phone ordering system designed for takeout-heavy restaurants.

0 favorites 0 likes
#voice-assistant

Parloa builds service agents customers want to talk to

OpenAI Blog · 2026-05-07 Cached

Parloa has evolved its platform to an AI Agent Management Platform (AMP) using GPT-5.4, enabling enterprises to design, simulate, and deploy voice and text service agents without coding.

0 favorites 0 likes
#voice-assistant

EchoChain: A Full-Duplex Benchmark for State-Update Reasoning Under Interruptions

arXiv cs.CL · 2026-04-21 Cached

EchoChain is a new benchmark for evaluating AI models' ability to revise in-progress responses when users interrupt mid-generation. The benchmark identifies three failure patterns (contextual inertia, interruption amnesia, objective displacement) and finds that across evaluated real-time voice models, no system exceeds 50% pass rate.

0 favorites 0 likes
#voice-assistant

ARKAD Wallet

Product Hunt · 2026-04-13

ARKAD Wallet is a product that allows users to talk to their finances to improve personal finance management.

0 favorites 0 likes
#voice-assistant

Intelligent Eyewear | I/O 2026 Keynote

YouTube AI Channels · 2026-05-23 Cached

At I/O 2026, Google unveiled the Android XR smart glasses ecosystem. The first audio glasses, powered by Gemini, will launch in fall 2026, offering hands-free voice assistance, navigation, cross-app operations, and real-time translation, in partnership with Samsung, Gentle Monster, and Warby Parker.

0 favorites 0 likes
← Previous
← Back to home

Submit Feedback