Although the Universal Dependencies initiative today allows for cross-linguistically consistent annotation of morphology and syntax in treebanks for several languages, syntactically annotated corpora are not yet interoperable with many lexical resources that describe properties of the words that occur therein. In order to cope with such limitation, we propose to adopt the principles of the Linguistic Linked Open Data community, to describe and publish dependency treebanks as LLOD. In particular, this paper illustrates the approach pursued in the LiLa Knowledge Base, which enables interoperability between corpora and lexical resources for Latin, to publish as Linguistic Linked Open Data the annotation layers of two versions of a Medieval Latin treebank (the Index Thomisticus Treebank).

Mambrini, F., Passarotti, M., Moretti, G., Pellegrini, M., The Index Thomisticus Treebank as Linked Data in the LiLa Knowledge Base, in Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), (Marseille, 20-25 June 2022), European Language Resources Association (ELRA), Marseille 2022: 4022-4029. [10.5281/zenodo.6664693] [http://hdl.handle.net/10807/210702]

The Index Thomisticus Treebank as Linked Data in the LiLa Knowledge Base

Mambrini, Francesco;Passarotti, Marco;Pellegrini, Matteo
2022

Abstract

Although the Universal Dependencies initiative today allows for cross-linguistically consistent annotation of morphology and syntax in treebanks for several languages, syntactically annotated corpora are not yet interoperable with many lexical resources that describe properties of the words that occur therein. In order to cope with such limitation, we propose to adopt the principles of the Linguistic Linked Open Data community, to describe and publish dependency treebanks as LLOD. In particular, this paper illustrates the approach pursued in the LiLa Knowledge Base, which enables interoperability between corpora and lexical resources for Latin, to publish as Linguistic Linked Open Data the annotation layers of two versions of a Medieval Latin treebank (the Index Thomisticus Treebank).
2022
Inglese
Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022)
13th Conference on Language Resources and Evaluation (LREC 2022)
Marseille
20-giu-2022
25-giu-2022
979-10-95546-72-6
European Language Resources Association (ELRA)
Mambrini, F., Passarotti, M., Moretti, G., Pellegrini, M., The Index Thomisticus Treebank as Linked Data in the LiLa Knowledge Base, in Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), (Marseille, 20-25 June 2022), European Language Resources Association (ELRA), Marseille 2022: 4022-4029. [10.5281/zenodo.6664693] [http://hdl.handle.net/10807/210702]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/210702
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact