In this paper we describe the process of inclusion of etymological information in a knowledge base of interoperable Latin linguistic resources developed in the context of the LiLa: Linking Latin project. Interoperability is obtained by applying the Linked Open Data principles. Particularly, an extensive collection of Latin lemmas is used to link the (distributed) resources. For the etymology, we rely on the Ontolex-lemon ontology and the lemonEty extension to model the information, while the source data are taken from a recent etymological dictionary of Latin. As a result, the collection of lemmas LiLa is built around now includes 1,465 Proto-Italic and 1,393 Proto-Indo-European reconstructed forms that are used to explain the history of 1,400 Latin words. We discuss the motivation, methodology and modeling strategies of the work, as well as its possible applications and potential future developments.

Mambrini, F., Passarotti, M. C., Representing Etymology in the LiLa Knowledge Base of Linguistic Resources for Latin, in Proceedings of the Globalex Workshop on Linked Lexicography. LREC 2020 Workshop, (Marseille, 12-12 May 2020), European Language Resources Association (ELRA), Paris 2020: 20-28. [10.5281/zenodo.3862156] [http://hdl.handle.net/10807/153771]

Representing Etymology in the LiLa Knowledge Base of Linguistic Resources for Latin

Mambrini, Francesco;Passarotti, Marco Carlo
2020

Abstract

In this paper we describe the process of inclusion of etymological information in a knowledge base of interoperable Latin linguistic resources developed in the context of the LiLa: Linking Latin project. Interoperability is obtained by applying the Linked Open Data principles. Particularly, an extensive collection of Latin lemmas is used to link the (distributed) resources. For the etymology, we rely on the Ontolex-lemon ontology and the lemonEty extension to model the information, while the source data are taken from a recent etymological dictionary of Latin. As a result, the collection of lemmas LiLa is built around now includes 1,465 Proto-Italic and 1,393 Proto-Indo-European reconstructed forms that are used to explain the history of 1,400 Latin words. We discuss the motivation, methodology and modeling strategies of the work, as well as its possible applications and potential future developments.
2020
Inglese
Proceedings of the Globalex Workshop on Linked Lexicography. LREC 2020 Workshop
Globalex Workshop on Linked Lexicography
Marseille
12-mag-2020
12-mag-2020
979-10-95546-46-7
European Language Resources Association (ELRA)
Mambrini, F., Passarotti, M. C., Representing Etymology in the LiLa Knowledge Base of Linguistic Resources for Latin, in Proceedings of the Globalex Workshop on Linked Lexicography. LREC 2020 Workshop, (Marseille, 12-12 May 2020), European Language Resources Association (ELRA), Paris 2020: 20-28. [10.5281/zenodo.3862156] [http://hdl.handle.net/10807/153771]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/153771
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact