checkpoint-restore

#checkpoint-restore

@charles_irl: Added a smol new section to last week's blog post on the technical internals of @modal's fast cold boots. This section …

X AI KOLs Following ↗ · 2026-05-18 Cached

Modal explains how it reduces AI inference cold starts by 40x using cloud buffers, a custom filesystem, checkpoint/restore, and CUDA checkpoint/restore, framing cloud buffer management as a linear optimization problem solved with GLOP.

0 favorites 0 likes

#checkpoint-restore

How to achieve truly serverless GPUs (20 minute read)

TLDR AI ↗ · 2026-05-13 Cached

Modal explains the four key ingredients they developed to spin up serverless GPU inference replicas in seconds instead of minutes, enabling efficient GPU allocation for variable AI workloads.

0 favorites 0 likes

checkpoint-restore

@charles_irl: Added a smol new section to last week's blog post on the technical internals of @modal's fast cold boots. This section …

How to achieve truly serverless GPUs (20 minute read)

Submit Feedback