IRIS UniCatt

This paper describes the inclusion of Sicilian in the Semantic Web through the development of new resources aligned with Linguistic Linked Open Data principles. More specifically, we model and publish the first Sicilian Lemma Bank and a bilingual Sicilian–Italian glossary extracted from the Sicilian Wiktionary (Wikizziunariu). These resources are formalized using the OntoLex-Lemon and LiLa (Linking Latin) ontologies with the aim of enabling cross-lingual interoperability. The glossary is also linked to the LiITA (Linking Italian) knowledge base. In addition, two preliminary experiments are reported: the first evaluates the translation capabilities of commercial Large Language Models (LLMs) from Sicilian into Italian; the second investigates bilingual lexicon induction through cross-lingual embedding alignment, with results indicating the challenges posed by low-resource dialects. This work aims to demonstrate the feasibility and importance of integrating under-resourced languages into broader Computational Linguistics and Semantic Web infrastructures.

Sprugnoli, R., Moretti, G., Muscianisi, D. G., Litta Modignani Picozzi, E. M. G., Ciallabacialla! Modeling and Linking a Regional Lexical Resource to Include Sicilian in the Semantic Web, in Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025), (Cagliari (Italia), 24-26 September 2025), CEUR Workshop Proceedings (CEUR-WS.org), Cagliari 2025:<<CEUR WORKSHOP PROCEEDINGS>>,4112 1093-1101 [https://hdl.handle.net/10807/337630]

Ciallabacialla! Modeling and Linking a Regional Lexical Resource to Include Sicilian in the Semantic Web

Sprugnoli, Rachele^{Primo

Writing – Original Draft Preparation};Muscianisi D. G.;Litta Modignani Picozzi, Eleonora Maria Gabriella^{Ultimo

Writing – Review & Editing}

2025

Abstract

This paper describes the inclusion of Sicilian in the Semantic Web through the development of new resources aligned with Linguistic Linked Open Data principles. More specifically, we model and publish the first Sicilian Lemma Bank and a bilingual Sicilian–Italian glossary extracted from the Sicilian Wiktionary (Wikizziunariu). These resources are formalized using the OntoLex-Lemon and LiLa (Linking Latin) ontologies with the aim of enabling cross-lingual interoperability. The glossary is also linked to the LiITA (Linking Italian) knowledge base. In addition, two preliminary experiments are reported: the first evaluates the translation capabilities of commercial Large Language Models (LLMs) from Sicilian into Italian; the second investigates bilingual lexicon induction through cross-lingual embedding alignment, with results indicating the challenges posed by low-resource dialects. This work aims to demonstrate the feasibility and importance of integrating under-resourced languages into broader Computational Linguistics and Semantic Web infrastructures.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2025
			
	Lingua del contenuto
	
				Inglese
			
	Titolo del volume che raccoglie gli atti
	
				Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
			
	Denominazione evento
	
				Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
			
	Luogo dell'evento
	
				Cagliari (Italia)
			
	Data inizio evento
	
				24-set-2025
			
	Data fine evento
	
				26-set-2025
			
	ISBN del volume
	
				979-12-243-0587-3
			
	Nome della collana/serie
	
				CEUR WORKSHOP PROCEEDINGS
			
	Editore
	
				CEUR Workshop Proceedings (CEUR-WS.org)
			
	URL alternativo
	
				https://aclanthology.org/2025.clicit-1.103/
			
	Citazione
	
				Sprugnoli, R., Moretti, G., Muscianisi, D. G., Litta Modignani Picozzi, E. M. G.,  Ciallabacialla! Modeling and Linking a Regional Lexical Resource to Include Sicilian in the Semantic Web, in Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025), (Cagliari (Italia),  24-26 September 2025), CEUR Workshop Proceedings (CEUR-WS.org), Cagliari 2025:<<CEUR WORKSHOP PROCEEDINGS>>,4112 1093-1101 [https://hdl.handle.net/10807/337630]
			
	Appare nelle tipologie:
	
				Atti di Convegno, Congresso, Giornate di studio, ecc., Workshop (in volume)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/337630

Citazioni

ND

0

ND

social impact