This study explores the application of Large Language Models to populate synsets in the Latin WordNet, keeping a human-in-the-loop approach. We compare zero-shot, few-shot, and fine-tuning methods against an English baseline. Quantitative analysis reveals significant improvements from zero-shot to fine-tuned approaches, with the latter outperforming the baseline. Qualitative assessment indicates better performance with verbs and polysemous lemmas. While results are encouraging, human oversight remains crucial for accuracy. Future research could focus on improving performance across different parts of speech and degrees of polysemy, potentially incorporating etymological information or cross-linguistic data.
Santoro, D., Marchesi, B., Zampetta, S., Del Tredici, M., Biagetti, E., Litta Modignani Picozzi, E. M. G., Combei, C. R., Rocchi, S., Facchinetti, T., Ginevra, R., Zanchi, C., Exploring Latin WordNet synset annotation with LLMs, in Proceedings of the 13th Global Wordnet Conference, (Pavia, 27-31 January 2025), Global Wordnet Association, Pavia 2025: 66-76 [https://hdl.handle.net/10807/324821]
Exploring Latin WordNet synset annotation with LLMs
Litta Modignani Picozzi, Eleonora Maria Gabriella;Ginevra, Riccardo;
2025
Abstract
This study explores the application of Large Language Models to populate synsets in the Latin WordNet, keeping a human-in-the-loop approach. We compare zero-shot, few-shot, and fine-tuning methods against an English baseline. Quantitative analysis reveals significant improvements from zero-shot to fine-tuned approaches, with the latter outperforming the baseline. Qualitative assessment indicates better performance with verbs and polysemous lemmas. While results are encouraging, human oversight remains crucial for accuracy. Future research could focus on improving performance across different parts of speech and degrees of polysemy, potentially incorporating etymological information or cross-linguistic data.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



