static-batching

Tag

Cards List
#static-batching

@pallavishekhar_: Continuous Batching in LLMs Read here: https://outcomeschool.com/blog/continuous-batching-in-llms…

X AI KOLs Timeline · yesterday Cached

A blog post explaining continuous batching, a technique for improving LLM serving throughput by dynamically adding new requests to a batch as old ones finish, keeping the GPU busy and reducing idle time.

0 favorites 0 likes
← Back to home

Submit Feedback