This paper presents a structured framework for WordNet synset selection applied to Ancient Greek lexical material. Starting from synonym definitions extracted from the Liddell–Scott–Jones (LSJ) lexicon, we compare two strategies: hierarchy-driven aggregation via bounded hypernym trees and LLM-based definitional matching with pairwise ranking. Graded human evaluation shows that structure-aware methods provide a robust baseline, particularly for nouns and verbs, while LLM-based reranking does not consistently improve performance, especially for highly ploysemous groups of synonyms. Beyond supporting the development of an Ancient Greek WordNet, the study highlights the methodological portability of the framework to other languages and lexical resources.
Brigada Villa, L., Passarotti, M. C., Zanchi, C., Ginevra, R., Fratellini, E., Litta Modignani Picozzi, E. M. G., Evaluating Hierarchical Aggregation and LLM-Based Matching for Synset Selection in Ancient Greek, in Proceedings of the Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026) @LREC 2026, (Palma De Mallorca, 11-11 May 2026), European Language Resources Association (ELRA), Palma De Mallorca 2026: 368-379 [https://hdl.handle.net/10807/335518]
Evaluating Hierarchical Aggregation and LLM-Based Matching for Synset Selection in Ancient Greek
Passarotti, Marco Carlo;Zanchi, Chiara;Ginevra, Riccardo;Fratellini, Erica;Litta Modignani Picozzi, Eleonora Maria Gabriella
2026
Abstract
This paper presents a structured framework for WordNet synset selection applied to Ancient Greek lexical material. Starting from synonym definitions extracted from the Liddell–Scott–Jones (LSJ) lexicon, we compare two strategies: hierarchy-driven aggregation via bounded hypernym trees and LLM-based definitional matching with pairwise ranking. Graded human evaluation shows that structure-aware methods provide a robust baseline, particularly for nouns and verbs, while LLM-based reranking does not consistently improve performance, especially for highly ploysemous groups of synonyms. Beyond supporting the development of an Ancient Greek WordNet, the study highlights the methodological portability of the framework to other languages and lexical resources.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



