Tag
Cerebras co-founder explains the fundamental difference between WSE (Wafer Scale Engine) and NVIDIA GPU: GPU is designed for graphics, runs AI by stacking cores and NVLink interconnect, while WSE makes the entire wafer into a single chip, with on-chip interconnect bandwidth and memory bandwidth far exceeding GPU clusters, greatly leading in inference speed.
The co-founder of Cerebras explains how their Wafer-Scale Engine (WSE) simplifies design compared to traditional NVIDIA GPUs.