Tag
Claude Managed Agents can now operate in a user-controlled sandbox on your own infrastructure, with new integration guides for Blaxel AI, e2b, Google Cloud, Namespace Labs, and Superserve AI.
The author explains how they built a compute platform capable of launching millions of sandboxes per second in constant time, focusing on decoupled scheduling and capacity aggregation using Cassandra and S3.
The article discusses the current state of computer-control harnesses that allow local vision language models to securely control a cursor in a sandbox environment.
A guide for running Pi AI agent securely inside a Docker Sandbox while running llama-server on the host machine for local GPU inference.
Monty is a minimal secure Python interpreter written in Rust, designed for AI Agents, offering microsecond-level startup and near-native CPython performance, while strictly restricting access to the host filesystem, network, and environment variables.
This article introduces the five-layer safety guardrail configuration of Claude Code, including OS sandbox, native permission rules, PreToolUse Hook, engineering rules, and remote access control. It also provides a deny/ask/allow configuration and command classification list to ensure the Agent operates autonomously within secure boundaries.
LangChain introduces LangSmith Sandboxes, providing each AI agent with its own isolated computer environment for safe code execution, addressing security risks of running untrusted code in containers or locally.
Simon Willison introduces micropython-wasm, an alpha package that runs Python code in a sandbox using MicroPython compiled to WebAssembly, enabling safe plugin execution within Python applications like Datasette.
A list of six powerful but lesser-known AI developer tools: Instructor for structured JSON output, Octopoda for agent memory, E2B for secure sandboxes, Firecrawl for website-to-markdown, Composio for app integrations, and LiteLLM for multi-model API.
Sage is a local AI inference engine built in Rust that runs entirely on the user's machine with kernel-level sandbox security, no telemetry, and no cloud dependency.
China released OpenSandbox, an open-source sandbox runtime for AI agents, supporting multiple SDKs and secure execution environments with Docker/Kubernetes isolation.
Mukil Loganathan from LangChain introduced LangSmith Sandboxes at the Interrupt conference, enabling safe execution of agent code with low latency, secure isolation, persistence, and snapshot/restore capabilities.
A developer created a cartoon sandbox where AI characters live, text, and autonomously run a town, with user interactions like ordering fights between characters.
Sentinel v0.3.0 is released, an out-of-band AI Agent security framework featuring Shield Sidecar, deterministic shadow sandbox, Red Team Engine with 34 vectors, and EU AI Act compliance reports.
The author introduces ptc_runner_mcp, an MCP server that provides a stateful, sandboxed REPL using a Clojure-like language, allowing AI agents to perform exploratory computations without overwhelming the context window.
The author built an AI sandbox manager using LXC containers that allows Codex agents to have full sudo access and GPU passthrough on headless Linux while keeping the host system safe from catastrophic errors.
AniC_dev introduces box, a simple and powerful sandbox for AI agents, claiming it is the most affordable option.
Presented at arXiv, DeltaBox introduces OS-level mechanisms (DeltaFS and DeltaCR) for millisecond-level checkpoint and rollback in stateful AI agents by only duplicating changes between consecutive states, achieving 14ms checkpoint and 5ms rollback on SWE-bench and enabling significantly deeper tree search within fixed time budgets.
A developer built Terrarium, an open-source sandboxing solution for running multiple AI agents securely with isolated worlds, reverse-proxy management, and state rollback.
Network allow-lists are insufficient to prevent data exfiltration via authorized channels like DNS or allowed endpoints. Canister, a lightweight Linux sandbox, addresses this with a layer-7 egress proxy that performs TLS interception and data-loss prevention.