Tag
OpenAI announces a custom chip designed in collaboration with Broadcom to enhance its AI infrastructure.
OpenAI has announced its first custom AI chip designed specifically for running ChatGPT and future AI agents, marking a major step in reducing reliance on external hardware providers.
OpenAI unveiled its first custom-built inference processor, named Jalapeño, developed with Broadcom to improve performance-per-watt and reduce reliance on Nvidia GPUs.
A custom FPGA implementation of a Transformer with KV cache achieves 56,000 tokens per second at 80 MHz, running microGPT on a tiny LCD.
A custom digital chip designed gate-by-gate achieves over 56,000 tokens/sec running a Transformer with KV cache at just 80 MHz, prototyped on an FPGA.