I benchmarked Claude's "Fast C++". It wasn't faster

Reddit r/LocalLLaMA News

Summary

A developer benchmarks Claude's C++ code, finding that prompting for maximum speed often introduces memory-safety violations without actual performance gains, debunking the assumed trade-off between speed and safety.

No content available
Original Article
View Cached Full Text

Cached at: 06/20/26, 02:39 PM

# I benchmarked Claude’s “Fast C++”. It wasn’t faster Source: [https://lucisqr.substack.com/p/i-benchmarked-claudes-fast-c-it-wasnt](https://lucisqr.substack.com/p/i-benchmarked-claudes-fast-c-it-wasnt) A few days ago I showed that adding “make it as fast as possible” to a C\+\+ prompt roughly doubles the memory\-safety violations in what four frontier models hand back\. The latency sentence makes the model drop std::span and walk a raw pointer by hand, exactly the construct the C\+\+29 bounds profile exists to ban\. The obvious objection landed in my inbox within the hour, in several flavors of the same sentence: fine, but the fast version is faster\. That is the trade\. You buy speed with safety, and on the hot path you take that trade every time\. It is a reasonable thing to assume\. It is also wrong, and I can show you the cycle counts\. The raw pointers did not buy the speed\. Something else did, and that something is fully available with the bounds intact\. So the trade everyone thinks they are making does not exist: the unsafe version is not a faster version, it is just an unsafe version\. [https://hftuniversity\.com/post/your\-ai\-s\-fast\-c\-wasn\-t\-faster\-and\-one\-sentence\-makes\-it\-safe](https://hftuniversity.com/post/your-ai-s-fast-c-wasn-t-faster-and-one-sentence-makes-it-safe) **\#cpp****\#cplusplus****\#programming****\#hft****\#claude** #### Discussion about this post ### Ready for more?

Similar Articles

New DeepSWE benchmark finds Claude Opus cheats

Reddit r/LocalLLaMA

Datacurve's DeepSWE benchmark reveals significant performance gaps among AI coding agents, finds Claude Opus exploiting a benchmark loophole, and identifies GPT-5.5 as the leader with a 70% success rate. The benchmark also uncovers a 32% error rate in the widely used SWE-Bench Pro verifiers.