gpu-cluster

#gpu-cluster

@JaydevTonde: Explored NVIDIA Dynamo today, it provides us lots of things to deploy LLM across multiple node in GPU Cluster. It inclu…

X AI KOLs Timeline ↗ · 2026-07-09 Cached

Explored NVIDIA Dynamo, a tool for deploying LLMs across multiple GPU cluster nodes with features like model caching, autoscaling, multinode deployments, and Kubernetes integration.

0 favorites 0 likes

#gpu-cluster

@TheAhmadOsman: Hey my friend, cool setup. If 8x RTX PRO 6000s is the real goal, I’d treat it like a serious infra build, not a worksta…

X AI KOLs Timeline ↗ · 2026-07-07

Advice on building a high-end AI workstation with 8x RTX PRO 6000 GPUs, emphasizing proper infrastructure, cooling, and avoiding reuse of DDR4.

0 favorites 0 likes

#gpu-cluster

@MaxForAI: http://Z.ai and this ZCube paper from Tsinghua—worth a read for anyone in Infra. Many people's first reaction when talking about AI infra is still GPU, memory, quantization, and inference frameworks. But once you get into long context and Prefill-Decode separation, the network is no longer just a 'supporting role' in the data center. Every...

X AI KOLs Timeline ↗ · 2026-05-21

ZCube is a new network architecture that flattens the topology and mixes single/multi-rail access to optimize KV Cache transmission in long-context and PD separation scenarios. In the GLM-5.1 production cluster, it achieved a 33% reduction in switch/optical module costs, a 15% increase in GPU inference throughput, and a 40.6% decrease in TTFT P99.

0 favorites 0 likes

#gpu-cluster

@Zai_org: https://x.com/Zai_org/status/2057216685040443743

X AI KOLs Timeline ↗ · 2026-05-20 Cached

This paper presents ZCube, a novel network architecture developed by Z.ai, Harnets.AI, and Tsinghua University to address topology-induced congestion in Prefill-Decode disaggregated LLM inference clusters. Production deployments on GLM-5.1 coding workloads achieved a 33% reduction in network CapEx, 15% throughput improvement, and 40.6% reduction in TTFT P99 latency.

0 favorites 0 likes

#gpu-cluster

@zostaff: 20 years ago Jane Street's entire compute cluster was six Dell boxes stacked on the floor at the end of an office row. …

X AI KOLs Timeline ↗ · 2026-05-18 Cached

Jane Street allowed Dwarkesh Patel to tour their new Texas data center with 4,032 GPUs, each rack pulling 140 kilowatts, highlighting the massive scale and unique networking choices.

0 favorites 0 likes

#gpu-cluster

@0xCheshire: Jane Street just released inside views of its Texas AI training center: 4,032 GPUs, 8,000 kilometers of fiber optic cable, and a fully deployed liquid cooling system because air cooling couldn't keep up. But what's truly stunning is the origin of this computing behemoth. Technical lead Ron Minsky recalls...

X AI KOLs Timeline ↗ · 2026-05-16 Cached

Jane Street revealed inside views of its AI training center in Texas, housing 4,032 GPUs, 8,000 kilometers of fiber optics, and a full liquid cooling system, while recounting the 20-year evolution from a humble start with six Dell hosts to today's extreme trading system.

0 favorites 0 likes

#gpu-cluster

Introducing Stargate UK

OpenAI Blog ↗ · 2025-09-16 Cached

OpenAI is launching Stargate UK, an AI infrastructure partnership with NVIDIA and Nscale to build sovereign compute capabilities in the United Kingdom, with plans for up to 31,000 GPUs over time. The initiative supports the UK's national AI strategy and will enable OpenAI models to run on local UK compute for critical public services, regulated industries, and national security use cases.

0 favorites 0 likes

gpu-cluster

@JaydevTonde: Explored NVIDIA Dynamo today, it provides us lots of things to deploy LLM across multiple node in GPU Cluster. It inclu…

@TheAhmadOsman: Hey my friend, cool setup. If 8x RTX PRO 6000s is the real goal, I’d treat it like a serious infra build, not a worksta…

@Zai_org: https://x.com/Zai_org/status/2057216685040443743

@zostaff: 20 years ago Jane Street's entire compute cluster was six Dell boxes stacked on the floor at the end of an office row. …

Introducing Stargate UK

Submit Feedback