sandbox

#sandbox

@vintcessun: Just came across this article, pretty impressive. Essentially, when AI agents do parallel exploration or tree search, each checkpoint/rollback requires backing up the entire file and process state, taking hundreds of milliseconds. DeltaBox discovered that consecutive checkpoints are actually highly similar. So instead of copying everything, just record the changes. It introduces two OS-level mechanisms…

X AI KOLs Timeline ↗ · 2026-05-24 Cached

Presented at arXiv, DeltaBox introduces OS-level mechanisms (DeltaFS and DeltaCR) for millisecond-level checkpoint and rollback in stateful AI agents by only duplicating changes between consecutive states, achieving 14ms checkpoint and 5ms rollback on SWE-bench and enabling significantly deeper tree search within fixed time budgets.

0 favorites 0 likes

#sandbox

Proper security and maintenance for agentic environments

Reddit r/openclaw ↗ · 2026-05-24

A developer built Terrarium, an open-source sandboxing solution for running multiple AI agents securely with isolated worlds, reverse-proxy management, and state rollback.

0 favorites 0 likes

#sandbox

A Network Allow-List Won't Stop Exfiltration

Lobsters Hottest ↗ · 2026-05-24 Cached

Network allow-lists are insufficient to prevent data exfiltration via authorized channels like DNS or allowed endpoints. Canister, a lightweight Linux sandbox, addresses this with a layer-7 egress proxy that performs TLS interception and data-loss prevention.

0 favorites 0 likes

#sandbox

Edge Python a 170 KB sandboxed Python subset (WASM) for running agent-generated code client-side (directly in the browser)

Reddit r/ArtificialInteligence ↗ · 2026-05-22

Edge Python is a 170 KB WASM-based sandboxed Python subset that runs agent-generated code directly in the browser without a server, supporting classes, async/await, pattern matching, and more.

0 favorites 0 likes

#sandbox

@peywalt: on the day of modal's series c announcement i am ... getting microsoft word to run in a modal sandbox

X AI KOLs Following ↗ · 2026-05-22 Cached

A user demonstrates running Microsoft Word inside a Modal sandbox on the day of Modal's Series C funding announcement.

0 favorites 0 likes

#sandbox

How to open calc.exe from S&Box

Lobsters Hottest ↗ · 2026-05-21 Cached

A security researcher examines the C# sandboxing in S&Box (Garry's Mod 2), which uses an API whitelist instead of a hardened runtime. By modifying the compile blacklist, they bypass the restrictions and crash the editor, demonstrating that the approach is insecure despite being similar to Space Station 14's system.

0 favorites 0 likes

#sandbox

@hwchase17: https://x.com/hwchase17/status/2057506580447510889

X AI KOLs Timeline ↗ · 2026-05-21 Cached

LangSmith introduces an Auth Proxy to secure network access for agent sandboxes, keeping credentials out of the runtime and enforcing explicit network access policies.

0 favorites 0 likes

#sandbox

We Reverse-Engineered Docker Sandbox's Undocumented MicroVM API

Hacker News Top ↗ · 2026-05-21 Cached

A team reverse-engineered Docker's undocumented MicroVM API used by Docker Sandboxes and built the open-source Sandbox Agent SDK to orchestrate AI coding agents inside microVMs for secure untrusted code execution.

0 favorites 0 likes

#sandbox

@_philschmid: I'm excited to introduce Managed Agents in the Gemini API. One API call gives you a full agent with code execution, web…

X AI KOLs Following ↗ · 2026-05-19 Cached

Phil Schmid announces Managed Agents in the Gemini API, enabling one-call agents with code execution, web browsing, and file management in isolated sandboxes, powered by Gemini 3.5 Flash.

0 favorites 0 likes

#sandbox

@mernit: Introducing Capsule — The Infra Framework for AI Apps Capsule is a Python framework that provides infrastructure primit…

X AI KOLs Following ↗ · 2026-05-19 Cached

Capsule is a Python framework that provides infrastructure primitives like sandboxes, auth, session management, integrations, and payments for AI apps, aiming to simplify deployment and iteration.

0 favorites 0 likes

#sandbox

@LangChain: Agent inside the sandbox or outside it? @Shevchenkoaalex from @TryRamp's answer.

X AI KOLs Timeline ↗ · 2026-05-18 Cached

A tweet from LangChain referencing an answer by Shevchenkoaalex of TryRamp about whether an agent should be inside or outside a sandbox, likely discussing security or deployment patterns.

0 favorites 0 likes

#sandbox

@gregpr07: /goal build GTA 6 Is this the AGI test? One prompt in -> full playable game out? How good can a single prompt get? http…

X AI KOLs Following ↗ · 2026-05-17 Cached

A tweet showcases a demo where a single prompt generates a playable open-city sandbox game reminiscent of GTA 6, hinting at AGI-level capability.

0 favorites 0 likes

#sandbox

How are you all handling state for long-running agents? Stateless sandboxes are eating my evenings

Reddit r/AI_Agents ↗ · 2026-05-17

A developer discusses challenges with state persistence in long-running coding agents using sandbox environments, detailing the costly resume overhead and seeking community solutions for persistent state handling without custom checkpointing layers.

0 favorites 0 likes

#sandbox

@ishaan_jaff: We're open sourcing LiteLLM Agent Platform Run Claude Code, Codex, Hermes or any coding agent in isolated K8s sandboxes…

X AI KOLs Timeline ↗ · 2026-05-16 Cached

LiteLLM is open-sourcing its Agent Platform, allowing developers to run coding agents like Claude Code, Codex, and Hermes in isolated Kubernetes sandboxes without exposing real API keys.

0 favorites 0 likes

#sandbox

@markokraemer: Yes we are building SandboxAgent which is just opencode based runtime running in sandboxes 1 random thing is remote ses…

X AI KOLs Following ↗ · 2026-05-15

Markokraemer announces SandboxAgent, an opencode-based runtime that runs in sandboxes with remote session storage and git-native versioning for centralized data and isolated operation.

0 favorites 0 likes

#sandbox

I built a cloud agent harness that you can train to be specialized at any task

Reddit r/AI_Agents ↗ · 2026-05-15

The author introduces OpenSteer, a cloud agent platform that allows users to create and customize specialized agents for automating tasks across websites and services, with support for cloud browsers, APIs, MCPs, and CLIs, demonstrated through a sales automation agent.

0 favorites 0 likes

#sandbox

Just stumbled across one of the wildest AI experiments I’ve seen in a while.

Reddit r/AI_Agents ↗ · 2026-05-14

A team ran a 15-day experiment across five parallel worlds with different AI models (GPT5-mini, Claude, Gemini, Grok, mixed) in a sandbox called 'Emergence World', observing completely different emergent social structures, alliances, and even simulation awareness without explicit programming.

0 favorites 0 likes

#sandbox

Would an open-source CLI orchestration layer for AI agents make sense?

Reddit r/AI_Agents ↗ · 2026-05-14

The article explores the idea of an open-source layer to orchestrate CLI usage by AI agents, addressing challenges like permissions, sandboxing, and audit trails when agents interact with multiple CLIs.

0 favorites 0 likes

#sandbox

@thsottiaux: We are continuing to invest in making agents work better on Windows. Highly recommend reading David's engineering post …

X AI KOLs Following ↗ · 2026-05-13 Cached

OpenAI is improving agent support on Windows by implementing a custom sandbox for Codex, addressing OS-level isolation challenges to ensure safe and efficient operation.

0 favorites 0 likes

#sandbox

@dzhng: INTRODUCING: Duet Agent A new type of harness we're building at @duetchat Perfect for jobs that don't fit in one chat: …

X AI KOLs Following ↗ · 2026-05-13 Cached

Duetchat introduces Duet Agent, a new harness for running long-duration AI agent tasks with state machine relay, memory compaction, and a stateless runner for sandboxes.

0 favorites 0 likes

sandbox

Submit Feedback