We present the results of our attempt to use NLP tools in order to identify named entities in the publications of the Deutsches Archäologisches Institute (DAI) and link the identified locations to entries in the iDAI.gazetteer. Our case study focuses on articles written in German and published in the journal Chiron between 1971 and 2014. We describe the annotation pipeline that starts from the digitized texts published in the new portal of the DAI. We evaluate the performances of geoparsing and NER and test an approach to improve the accuracy of the latter.

Mambrini, F., The iDAI.publication: Extracting and Linking Information in the Publications of the German Archaeological Institute (DAI), in Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018). 10-12 December 2018, Torino, (ita, 10-12 December 2018), CEUR-WS, Torino 2018:<<CEUR WORKSHOP PROCEEDINGS>>,2253 253-257. [10.4000/books.aaccademia.3456] [http://hdl.handle.net/10807/133011]

The iDAI.publication: Extracting and Linking Information in the Publications of the German Archaeological Institute (DAI)

Mambrini, Francesco
2018

Abstract

We present the results of our attempt to use NLP tools in order to identify named entities in the publications of the Deutsches Archäologisches Institute (DAI) and link the identified locations to entries in the iDAI.gazetteer. Our case study focuses on articles written in German and published in the journal Chiron between 1971 and 2014. We describe the annotation pipeline that starts from the digitized texts published in the new portal of the DAI. We evaluate the performances of geoparsing and NER and test an approach to improve the accuracy of the latter.
2018
Inglese
Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018). 10-12 December 2018, Torino
5th Italian Conference on Computational Linguistics, CLiC-it 2018
ita
10-dic-2018
12-dic-2018
978-88-31978-41-5
CEUR-WS
Mambrini, F., The iDAI.publication: Extracting and Linking Information in the Publications of the German Archaeological Institute (DAI), in Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018). 10-12 December 2018, Torino, (ita, 10-12 December 2018), CEUR-WS, Torino 2018:<<CEUR WORKSHOP PROCEEDINGS>>,2253 253-257. [10.4000/books.aaccademia.3456] [http://hdl.handle.net/10807/133011]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/133011
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact