This study investigates the application of modified Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic RGB and infrared (IR) datasets to meet the annotation requirements for wild radish (Raphanus raphanistrum). The RafanoSet dataset was used for evaluation. Traditional WGAN models struggle with vanishing gradients and poor convergence, affecting data quality. Customizations in WGAN-GP improved synthetic image quality, especially in maintaining SSIM for RGB datasets. However, generating high-quality IR images remains challenging due to spectral complexities, with lower SSIM scores. Architectural enhancements including transposed convolutions, dropout, and selective batch normalization improved SSIM scores from 0.5364 to 0.6615 for RGB and from 0.3306 to 0.4154 for IR images. This study highlights the customized model's key features: center dot Produces a 128 x 7 x 7 tensor, optimizes feature map size for subsequent layers, with two layers using 4 x 4 kernels and 128 and 64 filters for upsampling. center dot Uses 3 x 3 kernels in all convolutional layers to capture fine-grained spatial features, incorporates batch normalization for training stability, and applies dropout to reduce overfitting and improve generalization.

Rana, S., Gatti, M., Comparative Evaluation of Modified Wasserstein GAN-GP and State-of-the-Art GAN Models for Synthesizing Agricultural Weed Images in RGB and Infrared Domain, <<METHODSX (AMSTERDAM)>>, 2025; 14 (N/A): 1-15. [doi:10.1016/j.mex.2025.103309] [https://hdl.handle.net/10807/313073]

Comparative Evaluation of Modified Wasserstein GAN-GP and State-of-the-Art GAN Models for Synthesizing Agricultural Weed Images in RGB and Infrared Domain

Rana, Shubham
Primo
;
Gatti, Matteo
Ultimo
2025

Abstract

This study investigates the application of modified Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic RGB and infrared (IR) datasets to meet the annotation requirements for wild radish (Raphanus raphanistrum). The RafanoSet dataset was used for evaluation. Traditional WGAN models struggle with vanishing gradients and poor convergence, affecting data quality. Customizations in WGAN-GP improved synthetic image quality, especially in maintaining SSIM for RGB datasets. However, generating high-quality IR images remains challenging due to spectral complexities, with lower SSIM scores. Architectural enhancements including transposed convolutions, dropout, and selective batch normalization improved SSIM scores from 0.5364 to 0.6615 for RGB and from 0.3306 to 0.4154 for IR images. This study highlights the customized model's key features: center dot Produces a 128 x 7 x 7 tensor, optimizes feature map size for subsequent layers, with two layers using 4 x 4 kernels and 128 and 64 filters for upsampling. center dot Uses 3 x 3 kernels in all convolutional layers to capture fine-grained spatial features, incorporates batch normalization for training stability, and applies dropout to reduce overfitting and improve generalization.
2025
Inglese
Rana, S., Gatti, M., Comparative Evaluation of Modified Wasserstein GAN-GP and State-of-the-Art GAN Models for Synthesizing Agricultural Weed Images in RGB and Infrared Domain, <<METHODSX (AMSTERDAM)>>, 2025; 14 (N/A): 1-15. [doi:10.1016/j.mex.2025.103309] [https://hdl.handle.net/10807/313073]
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S2215016125001554-main.pdf

accesso aperto

Tipologia file ?: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 7.17 MB
Formato Adobe PDF
7.17 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/313073
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact