IRIS UniCatt

Glaucoma is a leading cause of irreversible blindness worldwide, affecting millions annually. Early diagnosis is crucial but challenging due to the asymptomatic nature of the disease and the complexity of clinical signs. Advanced ocular imaging generates complex, voluminous data, demanding significant expertise for interpretation. Artificial intelligence (AI) is transforming this field by enabling efficient data analysis. We present a comprehensive approach that combines advanced image preprocessing techniques, leveraging the Discrete Fourier Transform (DFT) implemented via the Fast Fourier Transform (FFT) algorithm, with state-of-the-art AI models to enhance diagnostic accuracy. Using the REFUGE2 dataset, this pre-processing achieved high visual fidelity, with SSIM values of 0.9225 for training and 0.8247 for testing. For classification, we employed Transformer architectures, including Masked Image Modeling (MIM) and Masked Autoencoder (MAE), integrated with Self-Supervised Learning (SimCLR). This multi-model approach yielded an accuracy of 90.06% with MIM and 90.14% with MAE on preprocessed images, surpassing the performance achieved on non-preprocessed data. The MIM model demonstrated a strong discriminative capability, highlighted by an AUC-ROC of 0.9298. Our findings show that combining robust preprocessing with Transformer networks significantly improves efficiency and diagnostic accuracy in the early detection of glaucoma.

Frasca, M., Lin, J., La Torre, D., Self-supervised learning model leveraging structural similarity index for glaucoma recognition, in 2025 International Conference on Decision Aid Sciences and Applications (DASA), (Manama, Bahrain, 01-02 December 2025), IEEE, New York 2025: 1496-1500. [10.1109/dasa68193.2025.11498861] [https://hdl.handle.net/10807/337464]

Self-supervised learning model leveraging structural similarity index for glaucoma recognition

Frasca, Maria;Lin, Jianyi;La Torre, Davide

2025

Abstract

Glaucoma is a leading cause of irreversible blindness worldwide, affecting millions annually. Early diagnosis is crucial but challenging due to the asymptomatic nature of the disease and the complexity of clinical signs. Advanced ocular imaging generates complex, voluminous data, demanding significant expertise for interpretation. Artificial intelligence (AI) is transforming this field by enabling efficient data analysis. We present a comprehensive approach that combines advanced image preprocessing techniques, leveraging the Discrete Fourier Transform (DFT) implemented via the Fast Fourier Transform (FFT) algorithm, with state-of-the-art AI models to enhance diagnostic accuracy. Using the REFUGE2 dataset, this pre-processing achieved high visual fidelity, with SSIM values of 0.9225 for training and 0.8247 for testing. For classification, we employed Transformer architectures, including Masked Image Modeling (MIM) and Masked Autoencoder (MAE), integrated with Self-Supervised Learning (SimCLR). This multi-model approach yielded an accuracy of 90.06% with MIM and 90.14% with MAE on preprocessed images, surpassing the performance achieved on non-preprocessed data. The MIM model demonstrated a strong discriminative capability, highlighted by an AUC-ROC of 0.9298. Our findings show that combining robust preprocessing with Transformer networks significantly improves efficiency and diagnostic accuracy in the early detection of glaucoma.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2025
			
	Lingua del contenuto
	
				Inglese
			
	Titolo del volume che raccoglie gli atti
	
				2025 International Conference on Decision Aid Sciences and Applications (DASA)
			
	Denominazione evento
	
				2025 International Conference on Decision Aid Sciences and Applications (DASA)
			
	Luogo dell'evento
	
				Manama, Bahrain
			
	Data inizio evento
	
				1-dic-2025
			
	Data fine evento
	
				2-dic-2025
			
	ISBN del volume
	
				979-8-3315-8859-5
			
	Editore
	
				IEEE
			
	DOI del contributo
	
				https://dx.doi.org/10.1109/dasa68193.2025.11498861
			
	Citazione
	
				Frasca, M., Lin, J., La Torre, D.,  Self-supervised learning model leveraging structural similarity index for glaucoma recognition, in 2025 International Conference on Decision Aid Sciences and Applications (DASA), (Manama, Bahrain,  01-02 December 2025), IEEE, New York 2025: 1496-1500. [10.1109/dasa68193.2025.11498861] [https://hdl.handle.net/10807/337464]
			
	Appare nelle tipologie:
	
				Atti di Convegno, Congresso, Giornate di studio, ecc., Workshop (in volume)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/337464

Citazioni

ND

ND

ND

social impact