Tag
China has surpassed the US with the world's fastest supercomputer, though the machine is not optimized for AI workloads.
LiteParse is a fast, open-source document parser written in Rust that provides high-quality spatial text extraction with bounding boxes, supporting multiple languages and platforms for AI document workloads.
Intel's discontinued Optane persistent memory technology is finding a second life in AI workloads, enabling a user to run a 1 trillion parameter model locally at ~4 tokens/second using cheap second-hand Optane modules. The article highlights Optane's lower latency compared to SSDs, making it suitable for large model inference despite being slower than DRAM.
Intel launched the Crescent Island GPU at Computex 2026, featuring up to 480GB VRAM and based on the Arc Xe 3P architecture, targeting next-generation AI workloads.
Alibaba unveiled the Zhenwu M890 AI chip, designed to handle the memory and communication demands of AI agent workloads, as part of its push for domestic alternatives.
AMD argues that agentic AI requires rethinking infrastructure planning, with a need for dedicated CPU racks for orchestration and control workloads, shifting the CPU:GPU ratio from 1:8 or 1:4 to 1:1 or higher, rather than simply adding more CPUs to GPU-dense servers.