Opus 4.7 scores lower than 4.6 and 4.5 on SimpleBench

Reddit r/singularity Models

Summary

Claude Opus 4.7 shows decreased performance compared to versions 4.6 and 4.5 on SimpleBench evaluation.

No content available
Original Article

Similar Articles

Differences Between Opus 4.7 and Opus 4.8 on MineBench

Reddit r/singularity

Opus 4.8 shows improved build quality and lower cost compared to Opus 4.7 on the MineBench 3D block-structure benchmark, though with some inconsistencies. The model demonstrates streamlined thinking and more efficient inference.