Tag
This paper presents Autopilot, an execution model for long-horizon LLM agents that enforces honest termination by externalizing state into a gated finite-state machine. It provides a theoretical guarantee against fabricated success and demonstrates significantly lower fabrication rates compared to Reflexion and StateFlow in empirical evaluations.