Tag
The article introduces Needle, a 26M parameter model by Cactus-Compute designed for single-shot tool calling, arguing that tool routing should be separated from reasoning as a structured prediction task to improve agent efficiency and latency.