Tag
Maka's Harness project improved the self-check mechanism, enabling DeepSeek Flash V4 to achieve evaluation results close to GLM-5.2 on the terminal-bench sample set, completing 10 programming agent tasks with only 4 RMB and a 97.5% cache hit rate.