Glaucoma is a leading cause of irreversible blindness worldwide, affecting millions annually. Early diagnosis is crucial but challenging due to the asymptomatic nature of the disease and the complexity of clinical signs. Advanced ocular imaging generates complex, voluminous data, demanding significant expertise for interpretation. Artificial intelligence (AI) is transforming this field by enabling efficient data analysis. We present a comprehensive approach that combines advanced image preprocessing techniques, leveraging the Discrete Fourier Transform (DFT) implemented via the Fast Fourier Transform (FFT) algorithm, with state-of-the-art AI models to enhance diagnostic accuracy. Using the REFUGE2 dataset, this pre-processing achieved high visual fidelity, with SSIM values of 0.9225 for training and 0.8247 for testing. For classification, we employed Transformer architectures, including Masked Image Modeling (MIM) and Masked Autoencoder (MAE), integrated with Self-Supervised Learning (SimCLR). This multi-model approach yielded an accuracy of 90.06% with MIM and 90.14% with MAE on preprocessed images, surpassing the performance achieved on non-preprocessed data. The MIM model demonstrated a strong discriminative capability, highlighted by an AUC-ROC of 0.9298. Our findings show that combining robust preprocessing with Transformer networks significantly improves efficiency and diagnostic accuracy in the early detection of glaucoma.

Frasca, M., Lin, J., La Torre, D., Self-supervised learning model leveraging structural similarity index for glaucoma recognition, in 2025 International Conference on Decision Aid Sciences and Applications (DASA), (Manama, Bahrain, 01-02 December 2025), IEEE, New York 2025: 1496-1500. [10.1109/dasa68193.2025.11498861] [https://hdl.handle.net/10807/337464]

Self-supervised learning model leveraging structural similarity index for glaucoma recognition

Lin, Jianyi;
2025

Abstract

Glaucoma is a leading cause of irreversible blindness worldwide, affecting millions annually. Early diagnosis is crucial but challenging due to the asymptomatic nature of the disease and the complexity of clinical signs. Advanced ocular imaging generates complex, voluminous data, demanding significant expertise for interpretation. Artificial intelligence (AI) is transforming this field by enabling efficient data analysis. We present a comprehensive approach that combines advanced image preprocessing techniques, leveraging the Discrete Fourier Transform (DFT) implemented via the Fast Fourier Transform (FFT) algorithm, with state-of-the-art AI models to enhance diagnostic accuracy. Using the REFUGE2 dataset, this pre-processing achieved high visual fidelity, with SSIM values of 0.9225 for training and 0.8247 for testing. For classification, we employed Transformer architectures, including Masked Image Modeling (MIM) and Masked Autoencoder (MAE), integrated with Self-Supervised Learning (SimCLR). This multi-model approach yielded an accuracy of 90.06% with MIM and 90.14% with MAE on preprocessed images, surpassing the performance achieved on non-preprocessed data. The MIM model demonstrated a strong discriminative capability, highlighted by an AUC-ROC of 0.9298. Our findings show that combining robust preprocessing with Transformer networks significantly improves efficiency and diagnostic accuracy in the early detection of glaucoma.
2025
Inglese
2025 International Conference on Decision Aid Sciences and Applications (DASA)
2025 International Conference on Decision Aid Sciences and Applications (DASA)
Manama, Bahrain
1-dic-2025
2-dic-2025
979-8-3315-8859-5
IEEE
Frasca, M., Lin, J., La Torre, D., Self-supervised learning model leveraging structural similarity index for glaucoma recognition, in 2025 International Conference on Decision Aid Sciences and Applications (DASA), (Manama, Bahrain, 01-02 December 2025), IEEE, New York 2025: 1496-1500. [10.1109/dasa68193.2025.11498861] [https://hdl.handle.net/10807/337464]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/337464
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact