block-quantization

Tag

Cards List
#block-quantization

@charles_irl: Low-precision floats are weird. I have been building up my intuition by playing with them outside of inference/training…

X AI KOLs Following · yesterday Cached

A tweet thread introduces a visualizer for micro-scaling/block quant formats like NVFP4 and MXFP4, explaining how these low-precision floats work and their use in LLM inference to reduce memory bandwidth demands.

0 favorites 0 likes
← Back to home

Submit Feedback