This paper presents the initial stages of a project focused on coreference and anaphora resolution in Latin texts. By building a corpus enhanced with coreference/anaphora annotation, the project wants to explore empirically a layer of metalinguistic analysis that has not been yet extensively investigated in linguistic resources and natural language processing for Latin. After reviewing the related work on this NLP task, the paper discusses annotation criteria and data analysis, providing examples about a few issues that emerged during the annotation process.
Delfino, E., Leotta, R. G., Passarotti, M. C., Moretti, G., Building CorefLat. a Linguistic Resource for Coreference and Anaphora Resolution in Latin, in Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024), (Pisa, 04-06 December 2024), CEUR Workshop Proceedings, Pisa 2024: 273-279 [https://hdl.handle.net/10807/308719]
Building CorefLat. a Linguistic Resource for Coreference and Anaphora Resolution in Latin
Leotta, Roberta Grazia;Passarotti, Marco Carlo;
2024
Abstract
This paper presents the initial stages of a project focused on coreference and anaphora resolution in Latin texts. By building a corpus enhanced with coreference/anaphora annotation, the project wants to explore empirically a layer of metalinguistic analysis that has not been yet extensively investigated in linguistic resources and natural language processing for Latin. After reviewing the related work on this NLP task, the paper discusses annotation criteria and data analysis, providing examples about a few issues that emerged during the annotation process.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.