This paper presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.
Cecchini, F. M., Sprugnoli, R., Moretti, G., Passarotti, M. C., UDante: First Steps Towards the Universal Dependencies Treebank of Dante's Latin Works, in Proceedings of the Seventh Italian Conference on Computational Linguistics. Bologna, Italy, March 1-3, (Bologna, 01-03 March 2021), CEUR-WS.org, Bologna 2020: 1-7 [http://hdl.handle.net/10807/164927]
UDante: First Steps Towards the Universal Dependencies Treebank of Dante's Latin Works
Cecchini, Flavio Massimiliano;Sprugnoli, Rachele;Passarotti, Marco Carlo
2020
Abstract
This paper presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.