5.6 Sol is coming to Cerebras at 750 tokens per second in July

Reddit r/singularity Models

Summary

The 5.6 Sol model is coming to Cerebras hardware in July, offering inference at 750 tokens per second.

https://preview.redd.it/8nbr61qjzn9h1.png?width=1853&format=png&auto=webp&s=a223073294a2498e7557061f8b3fc822eb677f96 Absolutely insane
Original Article

Similar Articles

OpenAI partners with Cerebras 

OpenAI Blog

OpenAI partners with Cerebras to integrate 750MW of ultra low-latency AI compute into its platform, aiming to accelerate inference and enable faster real-time AI responses across various workloads.