Pi + Docker Sandbox + llama-server setup guide

Reddit r/AI_Agents Tools

Summary

A guide for running Pi AI agent securely inside a Docker Sandbox while running llama-server on the host machine for local GPU inference.

Hi everyone, I put together a guide for running Pi securely inside a Docker Sandbox while running llama-server directly on my host machine. The goal was to keep the agent isolated while still running the model on the local GPU at full speed. Hope someone finds this useful! \-> Link is in the comments below.
Original Article

Similar Articles

Automated AI researcher running locally with llama.cpp

Reddit r/LocalLLaMA

ml-intern is a harness for AI agents that integrates with Hugging Face's libraries and now supports running local models via llama.cpp or ollama, enabling an automated AI researcher to run 24/7 on a laptop.

We have sub-agents at home

Reddit r/LocalLLaMA

A developer shares a forked sub-agent repository for pi coding agent that works with a single local LLM slot and limited VRAM, using llama.cpp server and quantized models. The post also discusses performance with the Apex Qwen variant using MTP.