worst-case-complexity

Tag

Cards List
#worst-case-complexity

Incremental BPE Tokenization

arXiv cs.CL · 2026-06-01 Cached

This paper introduces an incremental algorithm for Byte Pair Encoding (BPE) tokenization that processes each byte in O(log^2 t) time, enabling efficient partial tokenization in streaming settings and achieving speedups over existing implementations.

0 favorites 0 likes
← Back to home

Submit Feedback