This paper presents the automatic extension to other languages of TERSEO, a knowledge-based system for the recognition and normalization of temporal expressions originally developed for Spanish. TERSEO was first extended to English through the automatic translation of the temporal expressions. Then, an improved porting process was applied to Italian, where the automatic translation of the temporal expressions from English and from Spanish was combined with the extraction of new expressions from an Italian annotated corpus. Experimental results demonstrate how, while still adhering to the rule-based paradigm, the development of automatic rule translation procedures allowed us to minimize the effort required for porting to new languages. Relying on such procedures, and without any manual effort or previous knowledge of the target language, TERSEO recognizes and normalizes temporal expressions in Italian with good results (72% precision and 83% recall for recognition).

Saquete, E., Martinez-Barco, P., Munoz, R., Negri, M., Speranza, M., Sprugnoli, R., Multilingual Extension of a Temporal Expression Normalizer using Annotated Corpora, Paper, in EACL 2006 Workshop on Cross-Language Knowledge Induction, (Trento, Italy, 03-03 April 2006), Association for Computational Linguistics, Trento 2006: 1-8 [http://hdl.handle.net/10807/133009]

Multilingual Extension of a Temporal Expression Normalizer using Annotated Corpora

Sprugnoli, Rachele
2006

Abstract

This paper presents the automatic extension to other languages of TERSEO, a knowledge-based system for the recognition and normalization of temporal expressions originally developed for Spanish. TERSEO was first extended to English through the automatic translation of the temporal expressions. Then, an improved porting process was applied to Italian, where the automatic translation of the temporal expressions from English and from Spanish was combined with the extraction of new expressions from an Italian annotated corpus. Experimental results demonstrate how, while still adhering to the rule-based paradigm, the development of automatic rule translation procedures allowed us to minimize the effort required for porting to new languages. Relying on such procedures, and without any manual effort or previous knowledge of the target language, TERSEO recognizes and normalizes temporal expressions in Italian with good results (72% precision and 83% recall for recognition).
2006
Inglese
EACL 2006 Workshop on Cross-Language Knowledge Induction
EACL 2006 Workshop on Cross-Language Knowledge Induction
Trento, Italy
Paper
3-apr-2006
3-apr-2006
Association for Computational Linguistics
Saquete, E., Martinez-Barco, P., Munoz, R., Negri, M., Speranza, M., Sprugnoli, R., Multilingual Extension of a Temporal Expression Normalizer using Annotated Corpora, Paper, in EACL 2006 Workshop on Cross-Language Knowledge Induction, (Trento, Italy, 03-03 April 2006), Association for Computational Linguistics, Trento 2006: 1-8 [http://hdl.handle.net/10807/133009]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/133009
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact