IRIS UniCatt

In this paper we introduce the DaDoEval shared task at EVALITA 2020, aimed at automatically assigning temporal information to documents written in Italian. The evaluation exercise comprises three levels of temporal granularity, from coarse-grained to year-based, and includes two types of test sets, either having the same genre of the training set, or a different one. More specifically, DaDoEval deals with the corpus of Alcide De Gasperi's documents, providing both public documents and letters as test sets. Two systems participated in the competition, achieving results always above the baseline in all subtasks. As expected, coarse-grained classification into five periods is rather easy to perform automatically, while the year-based one is still an unsolved problem also due to the lack of enough training data for some years. Results showed also that, although De Gasperi's letters in our test set were written in standard Italian and in a style which was not too colloquial, cross-genre classification yields remarkably lower results than the same-genre setting.

Menini, S., Moretti, G., Sprugnoli, R., Tonelli, S., DaDoEval @ EVALITA 2020: Same-genre and cross-genre dating of historical documents, in Proceedings of the Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2020), (Online, 17-17 December 2020), Accademia University Press, Torino 2020: 391-397 [http://hdl.handle.net/10807/165686]

DaDoEval @ EVALITA 2020: Same-genre and cross-genre dating of historical documents

Menini S.^Primo;Moretti G.^Secondo;Sprugnoli, Rachele^Penultimo;Tonelli S.^Ultimo

2020

Abstract

In this paper we introduce the DaDoEval shared task at EVALITA 2020, aimed at automatically assigning temporal information to documents written in Italian. The evaluation exercise comprises three levels of temporal granularity, from coarse-grained to year-based, and includes two types of test sets, either having the same genre of the training set, or a different one. More specifically, DaDoEval deals with the corpus of Alcide De Gasperi's documents, providing both public documents and letters as test sets. Two systems participated in the competition, achieving results always above the baseline in all subtasks. As expected, coarse-grained classification into five periods is rather easy to perform automatically, while the year-based one is still an unsolved problem also due to the lack of enough training data for some years. Results showed also that, although De Gasperi's letters in our test set were written in standard Italian and in a style which was not too colloquial, cross-genre classification yields remarkably lower results than the same-genre setting.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2020
			
	Lingua del contenuto
	
				Inglese
			
	Titolo del volume che raccoglie gli atti
	
				Proceedings of the Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2020)
			
	Denominazione evento
	
				7th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. EVALITA 2020
			
	Luogo dell'evento
	
				Online
			
	Data inizio evento
	
				17-dic-2020
			
	Data fine evento
	
				17-dic-2020
			
	ISBN del volume
	
				9791280136275
			
	Editore
	
				Accademia University Press
			
	URL alternativo
	
				http://ceur-ws.org/Vol-2765/paper152.pdf
			
	Citazione
	
				Menini, S., Moretti, G., Sprugnoli, R., Tonelli, S.,  DaDoEval @ EVALITA 2020: Same-genre and cross-genre dating of historical documents, in Proceedings of the Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2020), (Online,  17-17 December 2020), Accademia University Press, Torino 2020: 391-397 [http://hdl.handle.net/10807/165686]
			
	Appare nelle tipologie:
	
				Atti di Convegno, Congresso, Giornate di studio, ecc., Workshop (in volume)

File in questo prodotto:

File	Dimensione	Formato
paper152.pdf accesso aperto Tipologia file ?: Versione Editoriale (PDF) Licenza: Creative commons Dimensione 320.8 kB Formato Adobe PDF Visualizza/Apri	320.8 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/165686

Citazioni

ND

6

ND

social impact