Tag
FlashLib updates to support ANN search with IVF-Flat, achieving up to 6.5× faster performance than cuVS on real-world vector workloads. LEANN now integrates FlashLib as a backend, offering substantial speedups in build and search operations.
The Flash-KMeans team releases FlashLib, a GPU library for classical ML operators that achieves up to 208x speedups over cuML on Hopper GPUs, with a focus on fast, predictable performance for agentic AI workloads.