@charles_irl: another page for the @modal LLMEng Almanac: an explorer for low-precision floats, from bf16 to fp4 https://modal.com/ll…

X AI KOLs Following Tools

Summary

A page from Modal's LLM Engineer's Almanac that provides an interactive explorer for understanding low-precision floating-point formats like bf16 and fp4.

another page for the @modal LLMEng Almanac: an explorer for low-precision floats, from bf16 to fp4 https://t.co/yOgLrOFNOY https://t.co/w2u1ND5AQi
Original Article
View Cached Full Text

Cached at: 05/18/26, 04:34 PM

another page for the @modal LLMEng Almanac: an explorer for low-precision floats, from bf16 to fp4

https://t.co/yOgLrOFNOY https://t.co/w2u1ND5AQi


LLM Engineer’s Almanac - Quant Formats

Source: https://modal.com/llm-almanac/quant-formats/e4::0x38 Value


Bit Pattern

Sign

Exponent

Significand


Sign

Exponent

Significand

Raw Hexadecimal Integer Value

Raw Decimal Integer Value

Hexadecimal Form (“%a”)


Evaluation in Base-2

(-1)0× 10201112- 01112× 1.0002

Evaluation in Base-10

1 × 20× 1

Exact Base-10 Value

1

Similar Articles

dMX: Differentiable Mixed-Precision Assignment for Low-Precision Floating-Point Formats

arXiv cs.LG

dMX is a differentiable mixed-precision quantization framework that learns optimal floating-point bit-width assignments per layer for LLMs, targeting the MXFP family of formats defined by the OCP standard. It uses continuous optimization with temperature-based annealing and a budget-aware regularization term, consistently outperforming KL-divergence heuristics on Llama, Qwen3, and SmolLM2 models.