The author built HeurChain, a memory broker that provides agent-specific, persistent memory storage for AI agents, surviving restarts and supporting structured and semantic retrieval.
A new method for orchestrating agents is being worked on, featuring delegation plans and subagents that can run locally or in Dockerized cloud environments, with message passing between them.
A tutorial explaining secrets management options for NixOS, comparing tools like sops-nix, agenix, and ragenix, with practical examples of using sops-nix for encrypted secrets management.
Vulnerability Garden is a curated list of named vulnerabilities, attack techniques, and exploits, providing references and dates for each entry.
React Doctor v2 is an open-source CLI tool that analyzes React codebases for performance issues, bad patterns, unnecessary re-renders, and broken architecture. It supports Next.js, Vite, and React Native and can be run instantly via npx.
OrcaRouter is a learning-based LLM router that dynamically routes prompts to appropriate models based on quality, cost, speed, and reliability, improving over time with production traffic.
Conductor is a Mac app that enables running multiple coding agents simultaneously on isolated codebase copies, with $22M Series A funding and the launch of Conductor Cloud for continuous agent operation.
Modular published a blog post explaining why traditional HTTP routing doesn't work for LLM inference workloads. The article describes how their distributed inference framework handles stateful, heterogeneous GPU pods with KV caches, specialized prefill/decode backends, and conversation-level routing that traditional stateless routing algorithms cannot address.
Lemonade has added an experimental ROCm backend for vLLM, allowing users to easily run safetensors LLMs on AMD GPUs with a simple command.
Jane Street's Head of Technology presents code that purportedly generates $13B profit, offering a template to build your own AI-powered hedge fund.
Explains how the -ncmoe flag in llama.cpp improves performance for MoE models like Qwen3.6 35B A3B on limited VRAM (8-12GB) by offloading some expert layers to CPU+RAM, with benchmarks showing up to 5x speedup on an RTX 3070Ti.
DriftGuard is a PyPI package that adds a semantic memory layer for AI agents, allowing them to remember past mistakes and avoid repeating them by comparing proposed actions against a graph of past failures.
The author explains why they have switched from writing markdown files to using Claude Code to generate HTML for them, arguing that HTML is the new markdown.
The article argues that HTML is a superior output format for AI agents compared to Markdown due to richer information density, visual clarity, ease of sharing, and two-way interaction, and shares why the author and others at Claude Code prefer HTML.
An open-source desktop tool called udemy-downloader-gui has been released, allowing users to download any Udemy course for free offline use with a single click.
Codex introduces the /goal command, which lets the AI autonomously work toward a defined end state, streamlining long-running tasks like refactors, migrations, and retry loops.
The author built a benchmark harness to evaluate local LLMs for autonomous Go code generation, focusing on log parser generation for SIEM pipelines, and published results comparing quality vs. speed.
OpenAI announces a migration guide for users to switch from ChatGPT to Codex, a dedicated AI coding assistant.
Bjarne Stroustrup answers common questions about memory leaks in C++, providing guidance on modern C++ memory management techniques.
Twitter/X post explaining how the Hermes AI agent's autonomous /goal flow works - users set a goal once and the model executes without supervision, writing files, running commands, building, testing, and iterating until completion or failure.