@hotschmoe: After reading this post, I decided to get nvfp4 running on my Intel arc b70s just to see, after 12 hours it's running a…

X AI KOLs Following Tools

Summary

A user successfully ran nvfp4 quantization on Intel Arc B70s GPUs, achieving nearly double speed and higher accuracy compared to their best int4 configuration, challenging hardware-specific format assumptions.

After reading this post, I decided to get nvfp4 running on my Intel arc b70s just to see, after 12 hours it's running almost twice as fast, whole being more accurate, than my current best int4 autoround config "none of this was supposed to work"
Original Article
View Cached Full Text

Cached at: 07/04/26, 06:54 PM

After reading this post, I decided to get nvfp4 running on my Intel arc b70s just to see, after 12 hours it’s running almost twice as fast, whole being more accurate, than my current best int4 autoround config

“none of this was supposed to work”

Eric Hartford (@QuixiAI): “none of this was supposed to work”

Pfuh! Power to the people!

No more “gguf/mlx is for mac, bf16 is for ampere, nvfp4 is for nvidia” nonsense.

Similar Articles