brazilian-portuguese

Tag

Cards List
#brazilian-portuguese

Toten: Knowledge-Based Ontological Tokenization Of Physical Quantities And Technical Notation In Brazilian Portuguese

arXiv cs.AI · 2026-06-20 Cached

TOTEN is a knowledge-based ontological tokenization framework that replaces statistical tokenization with declarative classification grounded in a formal ontology of engineering entities, achieving high ontological atomicity and numerical reconstruction for physical quantities and technical notation in Brazilian Portuguese.

0 favorites 0 likes
← Back to home

Submit Feedback