8-16 MI50s Minimax M3 @19 tps TG (peak)

Reddit r/LocalLLaMA News

Summary

Reports a peak throughput of 19 tokens per second for the Minimax M3 model running on 8-16 MI50 GPUs.

No content available
Original Article

Similar Articles