IRIS UniCatt

This paper presents LatInfLexi, a large inflected lexicon of Latin providing information on all the inflected wordforms of 3,348 verbs and 1,038 nouns. After a description of the structure of the resource and some data on its size, the procedure followed to obtain the lexicon from the database of the Lemlat 3.0 morphological analyzer is detailed, as well as the choices made regarding overabundant and defective cells. The way in which the data of LatInfLexi can be exploited in order to perform a quantitative assessment of predictability in Latin verb inflection is then illustrated: results obtained by computing the conditional entropy of guessing the content of a paradigm cell assuming knowledge of one wordform or multiple wordforms are presented in turn, highlighting the descriptive and theoretical relevance of the analysis. Lastly, the paper envisages the advantages of an inclusion of LatInfLexi into the LiLa knowledge base, both for the presented resource and for the knowledge base itself.

Pellegrini, M., Using LatInfLexi for an Entropy-Based Assessment of Predictability in Latin Inflection, Paper, in Proceedings of LT4HALA 2020-1st Workshop on Language Technologies for Historical and Ancient Languages, (Marseille, 12-12 May 2020), European Language Resources Association (ELRA), Marseille 2020: 37-46 [https://hdl.handle.net/10807/166160]

Using LatInfLexi for an Entropy-Based Assessment of Predictability in Latin Inflection

Pellegrini, Matteo

2020

Abstract

This paper presents LatInfLexi, a large inflected lexicon of Latin providing information on all the inflected wordforms of 3,348 verbs and 1,038 nouns. After a description of the structure of the resource and some data on its size, the procedure followed to obtain the lexicon from the database of the Lemlat 3.0 morphological analyzer is detailed, as well as the choices made regarding overabundant and defective cells. The way in which the data of LatInfLexi can be exploited in order to perform a quantitative assessment of predictability in Latin verb inflection is then illustrated: results obtained by computing the conditional entropy of guessing the content of a paradigm cell assuming knowledge of one wordform or multiple wordforms are presented in turn, highlighting the descriptive and theoretical relevance of the analysis. Lastly, the paper envisages the advantages of an inclusion of LatInfLexi into the LiLa knowledge base, both for the presented resource and for the knowledge base itself.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2020
			
	Lingua del contenuto
	
				Inglese
			
	Titolo del volume che raccoglie gli atti
	
				Proceedings of LT4HALA 2020-1st Workshop on Language Technologies for Historical and Ancient Languages
			
	Denominazione evento
	
				LT4HLA 2020 - 1st Workshop on Language Technologies for Historical and Ancient Languages
			
	Luogo dell'evento
	
				Marseille
			
	Tipo di contributo
	
				Paper
			
	Data inizio evento
	
				12-mag-2020
			
	Data fine evento
	
				12-mag-2020
			
	ISBN della pubblicazione
	
				979-10-95546-53-5
			
	Editore
	
				European Language Resources Association (ELRA)
			
	Citazione
	
				Pellegrini, M., Using LatInfLexi for an Entropy-Based Assessment of Predictability in Latin Inflection,  Paper, in Proceedings of LT4HALA 2020-1st Workshop on Language Technologies for Historical and Ancient Languages, (Marseille,  12-12 May 2020), European Language Resources Association (ELRA), Marseille 2020: 37-46 [https://hdl.handle.net/10807/166160]
			
	Appare nelle tipologie:
	
				Paper, Selected paper, Contributed paper, Working paper, Poster, Poster paper, Comunicazione, Relazione (in volume)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/166160

Citazioni

ND

ND

ND

social impact