The knowledge-based system TERSEO was originally developed for the recognition and normalization of temporal expressions in Spanish and then extended to other languages: to English first, through the automatic translation of the temporal expressions, and then to Italian, applying a porting process where the automatic translation of the rules was combined with the extraction of expressions from an annotated corpus. In this paper we present a new automatic porting procedure, where resolution rules are automatically assigned to the temporal expressions that have been acquired in a new language, thus eliminating the need for automatic translation and consequently minimizing the errors produced. This is achieved by exploiting the rules of the temporal model, which are language independent, and the information extracted from the annotated corpus. Evaluation results of the updated version of TERSEO for English show a considerable improvement in recognition performance (+ 14% F-measure) with respect to the original system.
Saquete, E., Martinez-Barco, P., Munoz, R., Negri, M., Speranza, M., Sprugnoli, R., Automatic resolution rule assignment to multilingual temporal expressions using annotated corpora, Paper, in TIME 2006 International Symposium on Temporal Representation and Reasoning, (Budapest, Hungary, 15-17 June 2006), IEEE, Budapest 2006: 218-224. 10.1109/TIME.2006.9 [http://hdl.handle.net/10807/132987]
Automatic resolution rule assignment to multilingual temporal expressions using annotated corpora
Sprugnoli, Rachele
2006
Abstract
The knowledge-based system TERSEO was originally developed for the recognition and normalization of temporal expressions in Spanish and then extended to other languages: to English first, through the automatic translation of the temporal expressions, and then to Italian, applying a porting process where the automatic translation of the rules was combined with the extraction of expressions from an annotated corpus. In this paper we present a new automatic porting procedure, where resolution rules are automatically assigned to the temporal expressions that have been acquired in a new language, thus eliminating the need for automatic translation and consequently minimizing the errors produced. This is achieved by exploiting the rules of the temporal model, which are language independent, and the information extracted from the annotated corpus. Evaluation results of the updated version of TERSEO for English show a considerable improvement in recognition performance (+ 14% F-measure) with respect to the original system.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.