@raydistributed: We just released Ray 2.56! This includes - Ray Data stability improvements: reduced object store spilling, automatic ba…

X AI KOLs Following Tools

Summary

Ray 2.56 has been released with improvements to Ray Data, Ray Serve for LLMs, GPU-domain-aware placement groups, and Kubernetes integration.

We just released Ray 2.56! This includes - Ray Data stability improvements: reduced object store spilling, automatic batch size selection - Ray Serve LLM re-architecture: decoupling request handling from the token streaming response path, LLM serving performance improvements, new routing policies like session-sticky routing via consistent hashing - Ray Core GPU-domain-aware placement groups: enables placement groups to pack bundles onto nodes that share a http://ray.io/gpu-domain label instead of only packing at the single-node level - Kubernetes integration: initial Kubernetes in-place pod resizing support for Autoscaler v2
Original Article

Similar Articles

@robertnishihara: Try Ray 2.56!

X AI KOLs Following

Ray 2.56 is released with stability improvements for Ray Data and a re-architecture of Ray Serve for better LLM serving performance.