Tag
User released an open-source tool to explore the kokoro model, with code on GitHub and model data on HuggingFace.
Orkas is an open-source, local-first desktop agent app where a lead agent coordinates specialized sub-agents, each with its own context boundary, using user-provided API keys from various LLM providers.
Introduces the Godot open-source game engine, emphasizing its free nature, MIT license, 2D engine advantages, and encourages downloading and using it.
FLYWHEEL.md introduces a loop-based framework for agentic coding, where AI agents autonomously ship software but stop at human-gated checkpoints for critical decisions, applying Karpathy's AutoResearch loop to real-world software deployment.
Built an open-source tool for authoring and sharing SKILL.md files, with 150+ community skills across various domains, no accounts required, and BYOK support.
LongCat released an open-source talking-avatar model (likely state-of-the-art) under MIT license, with a Hugging Face demo, enabling various applications like AI tutors, dubbing, and coding agents.
Llampart 1.0.0 is a standalone local web UI for llama-server with translations, extended settings, and a polished conversation sidebar, released under MIT license.
GBrain is a state-of-the-art retrieval tool for AI agents, released under MIT license, featuring hybrid search, self-wiring knowledge graphs, and temporal question answering, built by Y Combinator's CEO for his own agents.
A CLAUDE.md file is shared to fix long-running coding agents that talk too much without shipping work. It focuses on action over narration and works across models.
The Cal.com team has open sourced their entire scheduling platform as cal.diy under an MIT license, offering a free, self-hosted alternative to paid services like Calendly and SavvyCal.
An open-source repository called train-llm-from-scratch enables training billion-parameter LLMs on a single GPU, with a configurable pipeline from raw text to inference, including dataset streaming and checkpointing, under MIT License.
PrivateScribe.ai is a fully local, MIT-licensed AI transcription platform with HIPAA safeguards, now featuring a bundled macOS app, onboarding wizard, speaker diarization, and encryption.
Kevin Lin, a postdoctoral fellow at Oxford University, open-sourced Violin, a video translation tool that integrates speech recognition, LLM translation, and speech synthesis into an automated pipeline. It supports multilingual translation and personalized styles, and provides three usage modes: Web, CLI, and Agent.
Violin is an open-source video translation tool that integrates speech recognition, large language model translation, and text-to-speech. It supports over 30 languages and offers three usage modes: CLI, web app, and Claude Code.
Browser-Use, an open-source framework for AI-driven browser automation developed by ETH Zurich students, challenges the traditional RPA industry by offering free, self-healing capabilities that mimic human interaction without relying on brittle HTML parsing.
Cal.com releases Cal.diy, a fully open-source MIT-licensed community fork with all enterprise features stripped out for personal self-hosting.