real-time

#real-time

@itsafiz: It really isn't an exaggeration! LiteParse clocks in at an average of 3ms per page for a reason: it skips the heavy AI …

X AI KOLs Following ↗ · yesterday Cached

LiteParse is a fast document parsing tool that runs locally, achieving ~3ms per page by skipping heavy AI and cloud overhead. It uses deterministic layout heuristics and selective OCR to output structured Markdown, making it ideal for real-time RAG pipelines and coding agents.

0 favorites 0 likes

#real-time

@xiaoying_eth: This guy just open-sourced a real-time global intelligence dashboard, completely free. >It tracks conflicts, military activities, infrastructure, protests, and market signals in real time. >Runs in the browser >MIT License. https://github.com/koala73/worldmonitor…

X AI KOLs Timeline ↗ · yesterday Cached

An open-source real-time global intelligence dashboard that tracks conflicts, military activities, infrastructure, protests, and market signals, runs in the browser, and is licensed under MIT.

0 favorites 0 likes

#real-time

@HowToPrompt__: This tool lets you face-swap on a live webcam in real-time using just one photo. zoom calls, omegle, livestreams.. you …

X AI KOLs Timeline ↗ · yesterday Cached

A free open-source tool enables real-time face-swapping on live webcam using a single photo, with 93k GitHub stars.

0 favorites 0 likes

#real-time

@IlirAliu_: Forget lidar. One single camera. Runs in real time & is open source: A streaming 3D model that reconstructs scenes live…

X AI KOLs Timeline ↗ · 2d ago Cached

LingBot-Map is an open-source, real-time streaming 3D reconstruction model that uses a single camera, running at ~20 FPS via a feed-forward geometric context transformer, outperforming both streaming and offline methods.

0 favorites 0 likes

#real-time

@LangChain: In a real conversation, deciding when to speak takes about as much brainpower as deciding what to say. Voice agents hav…

X AI KOLs Following ↗ · 2d ago Cached

Sierra Platform's approach to voice agents parallelizes thinking, listening, and talking to mimic human conversation, as discussed on the Max Agency podcast.

0 favorites 0 likes

#real-time

NeuraDock Visual Cognitive Load Agent Tutorial: A Quality-Gated Open-Source EEG Workflow for Alpha Dynamics and Real-Time Applications

arXiv cs.AI ↗ · 2d ago Cached

This tutorial paper presents NeuraDock Agent, an open-source EEG workflow for visual cognitive load analysis with alpha dynamics, including preprocessing, quality control, real-time API, and LLM interpretation.

0 favorites 0 likes

#real-time

@liquidai: Introducing LFM2.5-230M: our smallest model yet, built to run fast anywhere (CPUs, NPUs, and GPUs) to enable agentic ta…

X AI KOLs Timeline ↗ · 3d ago Cached

Liquid AI releases LFM2.5-230M, a small 230M parameter model optimized for fast inference on CPUs, NPUs, and GPUs, targeting agentic tasks on devices like phones and robots.

0 favorites 0 likes

#real-time

Can I texture 3D objects with oil paint?

Lobsters Hottest ↗ · 3d ago Cached

A traditional oil painting artist developed the open-source tool Bob Jack Painter, which uses a real-time camera to map oil paint textures from a physical canvas onto 3D models, enabling the workflow of texturing digital 3D objects with real oil paint.

0 favorites 0 likes

#real-time

Best STT API for voice agents? I’d test latency before accuracy

Reddit r/AI_Agents ↗ · 3d ago

The author argues that for live voice agents, STT latency and real-time behavior are more critical than raw transcription accuracy, and proposes a different evaluation scorecard.

0 favorites 0 likes

#real-time

@jxnlco: Computah! Activate Firewall! with gpt-realtime-2 you can in context prompt your wake words, reasoning, and build some s…

X AI KOLs Following ↗ · 4d ago Cached

jxnlco demonstrates gpt-realtime-2's ability to process in-context wake words and reasoning by building a Simon Says game that beats him.

0 favorites 0 likes

#real-time

Signspell

Product Hunt ↗ · 4d ago

Signspell is a Python package for real-time American Sign Language alphabet recognition, installable via pip.

0 favorites 0 likes

#real-time

South Korean AI app went viral for AI characters that can talk, react, and respond to camera context

Reddit r/artificial ↗ · 4d ago

A South Korean AI app goes viral for enabling lifelike video conversations with AI characters that use voice, lip sync, facial expressions, and camera context, signaling a shift from text-based interfaces to real-time video-native interactions.

0 favorites 0 likes

#real-time

THE EX-GOOGLE CHARACTER AI ERA IS EVOLVING

Reddit r/artificial ↗ · 4d ago

Mel AI is evolving AI characters from text-based interactions to real-time video chat, with lip sync, facial expressions, and camera context awareness, following the success of Character AI.

0 favorites 0 likes

#real-time

@rohanpaul_ai: You can import 5 stocks, and the tool can scrape information from major websites to generate real-time AI summaries. It…

X AI KOLs Following ↗ · 5d ago Cached

KroWork is a newly launched tool that converts AI chat conversations into reusable desktop applications, allowing non-technical users to create workflows via natural language that run locally without consuming tokens on restart. It enables tasks like real-time stock monitoring for free.

0 favorites 0 likes

#real-time

Is Whisper still the best default for speech-to-text if the app needs to be real time?

Reddit r/AI_Agents ↗ · 5d ago

Explores whether OpenAI's Whisper remains the top choice for real-time speech-to-text applications, considering alternatives and performance trade-offs.

0 favorites 0 likes

#real-time

@DataChaz: @NVIDIA just quietly dropped an incredibly impressive speech recognition model that completely changes the math for loc…

X AI KOLs Timeline ↗ · 5d ago Cached

NVIDIA quietly released Nemotron-3.5-ASR, a lightweight 0.6B parameter open-source speech recognition model designed for real-time streaming with support for 40+ languages, low latency, and cache-aware architecture.

0 favorites 0 likes

#real-time

Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models

Hugging Face Daily Papers ↗ · 6d ago Cached

Wan-Streamer is a unified end-to-end multimodal model for real-time audio-visual interaction using causal attention and integrated processing of visual, audio, and text modalities, achieving sub-second latency.

0 favorites 0 likes

#real-time

We got local models to triage the OpenClaw repo for FREE!*

Hugging Face Blog ↗ · 2026-06-22 Cached

The blog post describes using local open-weight models like Gemma and Qwen in an agent harness to automatically triage issues and pull requests in the OpenClaw repository, enabling real-time notifications without relying on costly closed API models.

0 favorites 0 likes

#real-time

Show HN: TownSquare, a tiny presence layer for websites

Hacker News Top ↗ · 2026-06-20 Cached

TownSquare is a tiny presence layer for websites that lets visitors see each other and interact in real-time with no accounts or algorithms, using a single script tag.

0 favorites 0 likes

#real-time

Researchers introduce T-Rex, a framework that unifies vision, language, and tactile sensing so robots can respond to physical contact in real time rather than relying on vision alone

Reddit r/singularity ↗ · 2026-06-20

Researchers introduced T-Rex, a framework that integrates vision, language, and tactile sensing, enabling robots to respond to physical contact in real time rather than relying solely on vision.

0 favorites 0 likes

real-time

Submit Feedback