In this paper we present a keyphrase extraction system called Keyphrase Digger (KD). The tool uses both statistical measures and linguistic information to detect a weighted list of n-grams representing the most important concepts of a text. KD is the reimplementation of an existing tool, which has been extended with new features, a high level of customizability, a shorter processing time and an extensive evaluation on different text genres in English and Italian (ie scientific articles and historical texts).

Moretti, G., Sprugnoli, R., Tonelli, S., Digging in the Dirt: Extracting Keyphrases from Texts with KD, Paper, in Proceedings of the Second Italian Conference on Computational Linguistics CLiC-it 2015, (Trento, 03-04 December 2015), Accademia University Press srl, Torino 2015: 198-203 [http://hdl.handle.net/10807/132952]

Digging in the Dirt: Extracting Keyphrases from Texts with KD

Sprugnoli, Rachele
Secondo
;
2015

Abstract

In this paper we present a keyphrase extraction system called Keyphrase Digger (KD). The tool uses both statistical measures and linguistic information to detect a weighted list of n-grams representing the most important concepts of a text. KD is the reimplementation of an existing tool, which has been extended with new features, a high level of customizability, a shorter processing time and an extensive evaluation on different text genres in English and Italian (ie scientific articles and historical texts).
2015
Inglese
Proceedings of the Second Italian Conference on Computational Linguistics CLiC-it 2015
CLiC-it 2015
Trento
Paper
3-dic-2015
4-dic-2015
978-88-99200-62-6
Accademia University Press srl
Moretti, G., Sprugnoli, R., Tonelli, S., Digging in the Dirt: Extracting Keyphrases from Texts with KD, Paper, in Proceedings of the Second Italian Conference on Computational Linguistics CLiC-it 2015, (Trento, 03-04 December 2015), Accademia University Press srl, Torino 2015: 198-203 [http://hdl.handle.net/10807/132952]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/132952
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact