Despite a centuries-long tradition in lexicography, Latin lacks state-of-the-art computational lexical resources. This situation is strictly related to the still quite limited amount of linguistically annotated textual data for Latin, which can help the building of new lexical resources by supporting them with empirical evidence. However, projects for creating new language resources for Latin have been launched over the last decade to fill this gap. In this paper, we present Latin Vallex, a valency lexicon for Latin built in mutual connection with the semantic and pragmatic annotation of two Latin treebanks featuring texts of different eras. On the one hand, such a connection between the empirical evidence provided by the treebanks and the lexicon allows to enhance each frame entry in the lexicon with its frequency in real data. On the other hand, each valency-capable word in the treebanks is linked to a frame entry in the lexicon.
Passarotti, M. C., Gonzalez Saavedra, B., Onambele Manga, C. L., Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin, in Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), (Portorož, 23-28 May 2016), European Language Resources Association (ELRA), Portorož 2016: 2599-2606 [http://hdl.handle.net/10807/78696]
Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin
Passarotti, Marco CarloPrimo
;Gonzalez Saavedra, BertaSecondo
;Onambele Manga, Christophe LedouxUltimo
2016
Abstract
Despite a centuries-long tradition in lexicography, Latin lacks state-of-the-art computational lexical resources. This situation is strictly related to the still quite limited amount of linguistically annotated textual data for Latin, which can help the building of new lexical resources by supporting them with empirical evidence. However, projects for creating new language resources for Latin have been launched over the last decade to fill this gap. In this paper, we present Latin Vallex, a valency lexicon for Latin built in mutual connection with the semantic and pragmatic annotation of two Latin treebanks featuring texts of different eras. On the one hand, such a connection between the empirical evidence provided by the treebanks and the lexicon allows to enhance each frame entry in the lexicon with its frequency in real data. On the other hand, each valency-capable word in the treebanks is linked to a frame entry in the lexicon.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.