tinygrad

#tinygrad

@tinygrad: We are on the MLPerf board with AMD MI350X training Llama 8B. This is with our driver, runtime, kernels, and training l…

X AI KOLs Timeline ↗ · yesterday Cached

tinygrad announces it has achieved a spot on the MLPerf benchmark board using AMD MI350X hardware to train Llama 8B, with its own driver, runtime, kernels, and training loop, and plans to improve the time and tackle 405B next.

0 favorites 0 likes

#tinygrad

You can do CUDA inference on an Apple Silicon Mac with PCI Passthrough

Reddit r/LocalLLaMA ↗ · 2026-05-08 Cached

This article explores the feasibility of using an external NVIDIA RTX 5090 GPU with an Apple Silicon Mac via Thunderbolt for CUDA inference and gaming, covering methods like tinygrad eGPU drivers and PCI passthrough to a Linux VM.

0 favorites 0 likes

#tinygrad

Collected the infinity stones

Reddit r/LocalLLaMA ↗ · 2026-05-07

A user proposes building a heterogeneous AI cluster using Blackwell GPUs and high-memory servers connected via RDMA, seeking collaboration on Tinygrad driver development.

0 favorites 0 likes

tinygrad

@__tinygrad__: We are on the MLPerf board with AMD MI350X training Llama 8B. This is with our driver, runtime, kernels, and training l…

You can do CUDA inference on an Apple Silicon Mac with PCI Passthrough

Collected the infinity stones

Submit Feedback

@tinygrad: We are on the MLPerf board with AMD MI350X training Llama 8B. This is with our driver, runtime, kernels, and training l…