This paper presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.

Cecchini, F. M., Sprugnoli, R., Moretti, G., Passarotti, M. C., UDante: First Steps Towards the Universal Dependencies Treebank of Dante's Latin Works, in Proceedings of the Seventh Italian Conference on Computational Linguistics. Bologna, Italy, March 1-3, (Bologna, 01-03 March 2021), CEUR-WS.org, Bologna 2020: 1-7 [http://hdl.handle.net/10807/164927]

UDante: First Steps Towards the Universal Dependencies Treebank of Dante's Latin Works

Cecchini, Flavio Massimiliano;Sprugnoli, Rachele;Passarotti, Marco Carlo
2020

Abstract

This paper presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.
2020
Inglese
Proceedings of the Seventh Italian Conference on Computational Linguistics. Bologna, Italy, March 1-3
Seventh Italian Conference on Computational Linguistics
Bologna
1-mar-2021
3-mar-2021
NA
CEUR-WS.org
Cecchini, F. M., Sprugnoli, R., Moretti, G., Passarotti, M. C., UDante: First Steps Towards the Universal Dependencies Treebank of Dante's Latin Works, in Proceedings of the Seventh Italian Conference on Computational Linguistics. Bologna, Italy, March 1-3, (Bologna, 01-03 March 2021), CEUR-WS.org, Bologna 2020: 1-7 [http://hdl.handle.net/10807/164927]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/164927
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact