toki-pona

Tag

Cards List
#toki-pona

Examining the Limits of Word2Vec with Toki Pona

arXiv cs.CL · 2026-06-17 Cached

This paper investigates whether Word2Vec can generate meaningful semantic embeddings for Toki Pona, a constructed language with only ~130 words, using a corpus of 1.4 million sentences, and examines the effect of non-Toki Pona tokens on embedding quality.

0 favorites 0 likes
← Back to home

Submit Feedback