In this paper, we conduct parsing experiments on Dante Alighieri{'}s Divine Comedy, an Old Italian poem composed between 1306-1321 and organized into three Cantiche {---}Inferno, Purgatorio, and Paradiso. We perform parsing on subsets of the poem using both a Modern Italian training set and sections of the Divine Comedy itself to evaluate under which scenarios parsers achieve higher scores. We find that employing in-domain training data supports better results, leading to an increase of approximately +17{\%} in Unlabeled Attachment Score (UAS) and +25-30{\%} in Labeled Attachment Score (LAS). Subsequently, we provide brief commentary on the differences in scores achieved among subsections of Cantiche, and we conduct experimental parsing on a text from the same period and style as the Divine Comedy.
Corbetta, C., Passarotti, M. C., Moretti, G., The Rise and Fall of Dependency Parsing in Dante Alighieri's Divine Comedy, in Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024, (TORINO -- ITA, 25-25 May 2024), ELRA and ICCL, TORINO -- ITA 2024: 50-56 [https://hdl.handle.net/10807/278622]
The Rise and Fall of Dependency Parsing in Dante Alighieri's Divine Comedy
Passarotti, Marco Carlo;
2024
Abstract
In this paper, we conduct parsing experiments on Dante Alighieri{'}s Divine Comedy, an Old Italian poem composed between 1306-1321 and organized into three Cantiche {---}Inferno, Purgatorio, and Paradiso. We perform parsing on subsets of the poem using both a Modern Italian training set and sections of the Divine Comedy itself to evaluate under which scenarios parsers achieve higher scores. We find that employing in-domain training data supports better results, leading to an increase of approximately +17{\%} in Unlabeled Attachment Score (UAS) and +25-30{\%} in Labeled Attachment Score (LAS). Subsequently, we provide brief commentary on the differences in scores achieved among subsections of Cantiche, and we conduct experimental parsing on a text from the same period and style as the Divine Comedy.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.