amd-rocm

#amd-rocm

Luce DFlash + PFlash on AMD Strix Halo: Qwen3.6-27B at 2.23x decode and 3.05x prefill vs llama.cpp HIP

Reddit r/LocalLLaMA ↗ · 2026-05-12

Luce releases DFlash and PFlash support for AMD Strix Halo APUs, achieving 2.23x decode and 3.05x prefill speedups over llama.cpp HIP on Qwen3.6-27B.

0 favorites 0 likes

#amd-rocm

MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required

Hugging Face Blog ↗ · 2026-05-08 Cached

A tutorial and project demonstrating LoRA fine-tuning of Qwen3-1.7B on AMD MI300X using ROCm for clinical question answering, providing a CUDA-free alternative for medical AI development.

0 favorites 0 likes

amd-rocm

Luce DFlash + PFlash on AMD Strix Halo: Qwen3.6-27B at 2.23x decode and 3.05x prefill vs llama.cpp HIP

MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required

Submit Feedback