We present an overview of the Index Thomisticus Treebank project (IT-TB). The ITTB consists of around 60,000 tokens from the Index Thomisticus by Roberto Busa SJ, an 11- million-token Latin corpus of the texts by Thomas Aquinas. We briefly describe the annotation guidelines, shared with the Latin Dependency Treebank (LDT). The application of data-driven dependency parsers on IT-TB and LDT data is reported on. We present training and parsing results on several datasets and provide evaluation of learning algorithms and techniques. Furthermore, we introduce the IT-TB valency lexicon extracted from the treebank. We report on quantitative data of the lexicon and provide some statistical measures on subcategorisation structures.

Mcgillivray, B., Passarotti, M. C., Ruffolo, P., The Index Thomisticus Treebank Project: Annotation, Parsing and Valency Lexicon, <<REVUE TAL>>, 2009; 50(2) (2): 103-127 [http://hdl.handle.net/10807/1401]

The Index Thomisticus Treebank Project: Annotation, Parsing and Valency Lexicon

Mcgillivray, Barbara;Passarotti, Marco Carlo;
2009

Abstract

We present an overview of the Index Thomisticus Treebank project (IT-TB). The ITTB consists of around 60,000 tokens from the Index Thomisticus by Roberto Busa SJ, an 11- million-token Latin corpus of the texts by Thomas Aquinas. We briefly describe the annotation guidelines, shared with the Latin Dependency Treebank (LDT). The application of data-driven dependency parsers on IT-TB and LDT data is reported on. We present training and parsing results on several datasets and provide evaluation of learning algorithms and techniques. Furthermore, we introduce the IT-TB valency lexicon extracted from the treebank. We report on quantitative data of the lexicon and provide some statistical measures on subcategorisation structures.
2009
Inglese
Mcgillivray, B., Passarotti, M. C., Ruffolo, P., The Index Thomisticus Treebank Project: Annotation, Parsing and Valency Lexicon, <<REVUE TAL>>, 2009; 50(2) (2): 103-127 [http://hdl.handle.net/10807/1401]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/1401
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact