This paper presents a structured framework for WordNet synset selection applied to Ancient Greek lexical material. Starting from synonym definitions extracted from the Liddell–Scott–Jones (LSJ) lexicon, we compare two strategies: hierarchy-driven aggregation via bounded hypernym trees and LLM-based definitional matching with pairwise ranking. Graded human evaluation shows that structure-aware methods provide a robust baseline, particularly for nouns and verbs, while LLM-based reranking does not consistently improve performance, especially for highly ploysemous groups of synonyms. Beyond supporting the development of an Ancient Greek WordNet, the study highlights the methodological portability of the framework to other languages and lexical resources.

Brigada Villa, L., Passarotti, M. C., Zanchi, C., Ginevra, R., Fratellini, E., Litta Modignani Picozzi, E. M. G., Evaluating Hierarchical Aggregation and LLM-Based Matching for Synset Selection in Ancient Greek, in Proceedings of the Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026) @LREC 2026, (Palma De Mallorca, 11-11 May 2026), European Language Resources Association (ELRA), Palma De Mallorca 2026: 368-379 [https://hdl.handle.net/10807/335518]

Evaluating Hierarchical Aggregation and LLM-Based Matching for Synset Selection in Ancient Greek

Passarotti, Marco Carlo;Zanchi, Chiara;Ginevra, Riccardo;Fratellini, Erica;Litta Modignani Picozzi, Eleonora Maria Gabriella
2026

Abstract

This paper presents a structured framework for WordNet synset selection applied to Ancient Greek lexical material. Starting from synonym definitions extracted from the Liddell–Scott–Jones (LSJ) lexicon, we compare two strategies: hierarchy-driven aggregation via bounded hypernym trees and LLM-based definitional matching with pairwise ranking. Graded human evaluation shows that structure-aware methods provide a robust baseline, particularly for nouns and verbs, while LLM-based reranking does not consistently improve performance, especially for highly ploysemous groups of synonyms. Beyond supporting the development of an Ancient Greek WordNet, the study highlights the methodological portability of the framework to other languages and lexical resources.
2026
Inglese
Proceedings of the Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026) @LREC 2026
Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026)
Palma De Mallorca
11-mag-2026
11-mag-2026
978-2-493814-58-6
European Language Resources Association (ELRA)
Brigada Villa, L., Passarotti, M. C., Zanchi, C., Ginevra, R., Fratellini, E., Litta Modignani Picozzi, E. M. G., Evaluating Hierarchical Aggregation and LLM-Based Matching for Synset Selection in Ancient Greek, in Proceedings of the Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026) @LREC 2026, (Palma De Mallorca, 11-11 May 2026), European Language Resources Association (ELRA), Palma De Mallorca 2026: 368-379 [https://hdl.handle.net/10807/335518]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/335518
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact