treebank

Tag

Cards List
#treebank

Meet UD_Czech-PDTC: A Large and Genre-Rich Treebank in Universal Dependencies

arXiv cs.CL · 16h ago Cached

This paper introduces UD_Czech-PDTC, a large and genre-diverse treebank for Czech in the Universal Dependencies framework, derived from the Prague Dependency Treebank-Consolidated. It describes the conversion process and differences between annotation schemes.

0 favorites 0 likes
#treebank

Prague Dependency Treebank -- Consolidated 2.0: Enriching a Complex Annotation Scheme

arXiv cs.CL · 16h ago Cached

We present the second consolidated version of the Prague Dependency Treebank, a 4-million-token manual multilingual annotation resource covering morphology, syntax, semantics, coreference, and discourse, along with compatible lexicons.

0 favorites 0 likes
#treebank

AthDGC: An Open Diachronic Greek Treebank with Indo-European Parallels

arXiv cs.CL · 2026-06-16 Cached

This paper introduces AthDGC, the first openly licensed dependency-parsed treebank of Greek spanning eight diachronic periods, with verse-level cross-alignment to four ancient Indo-European languages using NLP tools like Stanza, LaBSE, and multilingual-BERT.

0 favorites 0 likes
#treebank

AfriSUD: A Dependency Treebank Collection for Evaluating Models on African Languages

arXiv cs.CL · 2026-06-12 Cached

AfriSUD is a new dependency treebank collection for African languages, following the Surface-Syntactic Universal Dependencies (SUD) framework, designed to evaluate NLP models on languages like Naija, Wolof, and Yorùbá.

0 favorites 0 likes
← Back to home

Submit Feedback