long-horizon-tasks

Tag

Cards List
#long-horizon-tasks

FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents

arXiv cs.CL · 2026-04-20 Cached

FS-Researcher introduces a file-system-based dual-agent framework that enables LLM agents to conduct deep research beyond context window limits by using persistent external memory as a shared workspace. The framework achieves state-of-the-art results on research benchmarks and demonstrates effective test-time scaling through computation allocation to evidence collection.

0 favorites 0 likes
← Previous
← Back to home

Submit Feedback