ROCm Status in mid 2026 [D]

Reddit r/MachineLearning News

Summary

The author asks about the current viability of AMD's ROCm ecosystem for AI training in mid-2026, comparing it to NVIDIA's CUDA and asking if it has reached a 'just works' stage for PyTorch.

Hey folks I'm starting to hear that ROCm works fine for inference now. But, I've not seen any reports on how viable it is for training. I have a couple of RTX 3090s I use for prototyping models, but I'm considering switching to a pair of RX7900XTX instead. On paper at least, the RX7900XTX can output about 4 times the throughput at FP16 with a similar power draw, VRAM, and cost. Based on PyTorch docs, it seems like ROCm is now fully supported, but I'm struggling to find user reports on how well PyTorch runs with ROCm instead of CUDA. How viable is it to switch over to ROCm at the moment? Is it at the "it just works" stage yet? Or is the AMD ecosystem still significantly behind CUDA?
Original Article

Similar Articles

ROCm vs Vulkan vs vLLM on Dual R9700's

Reddit r/LocalLLaMA

A comparison of AI inference frameworks ROCm, Vulkan, and vLLM running on dual AMD Radeon 9700 GPUs, likely benchmarking performance for large language models.

Strix Halo ROCm + MTP Notes (May 2026)

Reddit r/LocalLLaMA

Technical benchmark comparing ROCm and Vulkan backends for LLM inference on Strix Halo hardware after MTP merged into llama.cpp, revealing ROCm suffers severe performance drops at full context while Vulkan remains stable.