Dependency Treebank derived from the Mbyá Guaraní collection of Robert Dooley

Corpus sintácticamente anotado derivado de la colección Mbyá Guaraní de Robert Dooley

Object Details

Collection LanguageGuaraní, Mbyá
Language PIDailla:119734
Title [Indigenous]
Language of Indigenous Title
TitleDependency Treebank derived from the Mbyá Guaraní collection of Robert Dooley
Country(ies)Canada
Brazil
Collector(s)Thomas, Guillaume
Dooley, Robert A.
Depositor(s)Thomas, Guillaume
Project/Collector Websitehttps://www.gpythomas.com/
Description [Indigenous]
Language of Indigenous Description
DescriptionSyntactically annotated corpus of Mbyá Guaraní. The corpus consists of a text file that contains syntactic annotations of the texts contained in the Mbyá Guarani Collection of Robert Dooley, together with annotation guidelines. Syntactic annotation was realized in dependency grammar, following Universal Dependencies v.2 guidelines (Nivre et al. 2016).

The creation of this collection was supported by a New Researcher Award (2016-2017) to Guillaume Thomas from the Connaught Fund, University of Toronto. The texts were annotated using SIL Fieldworks (Black & Simons 2006) and Arborator (Gerdres 2013).

  • Andrew Black and Gary Simons. 2006. The SIL FieldWorks Language Explorer Approach to Morphological Parsing. Computational Linguistics for Less studied Languages: Texas Linguistics Society, 10. SIL.
  • Kim Gerdes, 2013. Collaborative dependency annotation. In Journal Proceedings of the second international conference on dependency linguistics (DepLing 2013), 88-97.
  • Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajič, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman. 2016. Universal Dependencies v1: A Multilingual Treebank Collection. In Proceedings of LREC.
References