We apply word hierarchical clustering techniques to collect the occurrences of the lemma forma that show a similar contextual behaviour in the works of Thomas Aquinas into the same or closely related groups. Our results will support the lexicographers of a data-driven new lexicon of Thomas Aquinas in their task of writing the lexical entry of forma. We use two datasets: the Index Thomisticus (IT), a corpus containing the opera omnia of Thomas Aquinas, and the Index Thomisticus Treebank, a syntactically annotated subset of the IT. Results are evaluated against a manually labeled subset of the occurrences of forma.
Cantaluppi, G., Passarotti, M. C., The Meaning of forma in Thomas Aquinas. Hierarchical Clustering from the Index Thomisticus Treebank, in Vicari, D., Okada, A., Ragozini, G., Weihs, C. (ed.), Analysis and Modeling of Complex Data in Behavioral and Social Sciences, Springer, Cham 2014: <<STUDIES IN CLASSIFICATION, DATA ANALYSIS, AND KNOWLEDGE ORGANIZATION>>, 83- 91. 10.1007/978-3-319-06692-9_10 [http://hdl.handle.net/10807/61186]
The Meaning of forma in Thomas Aquinas. Hierarchical Clustering from the Index Thomisticus Treebank
Cantaluppi, Gabriele;Passarotti, Marco Carlo
2014
Abstract
We apply word hierarchical clustering techniques to collect the occurrences of the lemma forma that show a similar contextual behaviour in the works of Thomas Aquinas into the same or closely related groups. Our results will support the lexicographers of a data-driven new lexicon of Thomas Aquinas in their task of writing the lexical entry of forma. We use two datasets: the Index Thomisticus (IT), a corpus containing the opera omnia of Thomas Aquinas, and the Index Thomisticus Treebank, a syntactically annotated subset of the IT. Results are evaluated against a manually labeled subset of the occurrences of forma.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.