RiskKernel — self-hosted guardrails + kill switch for AI agents (your keys, no telemetry, Apache-2.0, single Go binary)

Reddit r/AI_Agents 06/01/26, 01:37 PM Tools

self-hosted guardrails kill-switch ai-agents llmops opentelemetry golang open-source

Summary

RiskKernel is a self-hosted, single Go binary that enforces hard per-run budgets (cost, loop count, wall-clock), kill switches, and human approval gates for AI agents, supporting Anthropic and OpenAI providers with no telemetry.

If you self-host anything touching LLMs, the scary failure mode is an agent that loops and quietly burns money (or fires a side-effecting tool) with no ceiling and no kill switch. RiskKernel is a single \~20MB static Go binary you run yourself that enforces hard per-run budgets — cost, loop count, wall-clock — plus a kill switch, crash-resume, and human approval gates on side-effecting tools. Everything is deterministic Go; an LLM is never in the enforcement path. Self-hosted ethos, end to end: BYO provider key (never stored in state, never logged), no telemetry/phone-home (verifiable — outbound is only to your provider, your OTLP endpoint, and your approval webhook; see SECURITY.md), and your state is a SQLite file you own. Adoption is one env var in front of an existing OpenAI-compatible app. Honest limits: single instance + SQLite (no HA yet), one API token (front it with oauth2-proxy/Authelia for auth), no streaming yet. Native providers are Anthropic + OpenAI today. Apache-2.0, docker run quickstart + demo GIF in the README — feedback welcome. opensource selfhosted aiagents llmops opentelemetry observability golang

Original Article

Similar Articles

AgentKernelArena: Generalization-Aware Benchmarking of GPU Kernel Optimization Agents

Hugging Face Daily Papers

AgentKernelArena is an open-source benchmark for evaluating AI coding agents on GPU kernel optimization, assessing full agent workflows and generalization to unseen configurations across 196 tasks.

OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform

Papers with Code Trending

OpenGuardrails is an open-source platform for AI safety, offering context-aware content-safety and manipulation detection (e.g., prompt injection, jailbreaking) via a unified model, plus a separate NER pipeline for data-leakage identification. It achieves state-of-the-art performance on safety benchmarks and supports private, enterprise-grade deployment.

We built a public red team environment for our AI agent security proxy — submit attacks and get a full security trace back

Reddit r/artificial

Arc Gate is a runtime governance layer for LLM agents that enforces instruction-authority boundaries. The project has launched a public red team environment where users can submit attacks and receive full security traces, with a benchmark showing 100% unsafe action prevention.

Open-sourcing a shell-level security layer for AI agents

Reddit r/AI_Agents

Open-sourcing a shell-level control layer that blocks dangerous commands, exposes fake secrets, and enforces runtime policies to make AI agents safer and more deterministic in developer environments.

I built an AI agent runtime in Go that compiles and tests generated code before delivering it , 35 files, 156 tests, zero dependencies