This study investigates the application of modified Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic RGB and infrared (IR) datasets to meet the annotation requirements for wild radish (Raphanus raphanistrum). The RafanoSet dataset was used for evaluation. Traditional WGAN models struggle with vanishing gradients and poor convergence, affecting data quality. Customizations in WGAN-GP improved synthetic image quality, especially in maintaining SSIM for RGB datasets. However, generating high-quality IR images remains challenging due to spectral complexities, with lower SSIM scores. Architectural enhancements including transposed convolutions, dropout, and selective batch normalization improved SSIM scores from 0.5364 to 0.6615 for RGB and from 0.3306 to 0.4154 for IR images. This study highlights the customized model's key features: center dot Produces a 128 x 7 x 7 tensor, optimizes feature map size for subsequent layers, with two layers using 4 x 4 kernels and 128 and 64 filters for upsampling. center dot Uses 3 x 3 kernels in all convolutional layers to capture fine-grained spatial features, incorporates batch normalization for training stability, and applies dropout to reduce overfitting and improve generalization.
Rana, S., Gatti, M., Comparative Evaluation of Modified Wasserstein GAN-GP and State-of-the-Art GAN Models for Synthesizing Agricultural Weed Images in RGB and Infrared Domain, <<METHODSX (AMSTERDAM)>>, 2025; 14 (N/A): 1-15. [doi:10.1016/j.mex.2025.103309] [https://hdl.handle.net/10807/313073]
Comparative Evaluation of Modified Wasserstein GAN-GP and State-of-the-Art GAN Models for Synthesizing Agricultural Weed Images in RGB and Infrared Domain
Rana, Shubham
Primo
;Gatti, MatteoUltimo
2025
Abstract
This study investigates the application of modified Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic RGB and infrared (IR) datasets to meet the annotation requirements for wild radish (Raphanus raphanistrum). The RafanoSet dataset was used for evaluation. Traditional WGAN models struggle with vanishing gradients and poor convergence, affecting data quality. Customizations in WGAN-GP improved synthetic image quality, especially in maintaining SSIM for RGB datasets. However, generating high-quality IR images remains challenging due to spectral complexities, with lower SSIM scores. Architectural enhancements including transposed convolutions, dropout, and selective batch normalization improved SSIM scores from 0.5364 to 0.6615 for RGB and from 0.3306 to 0.4154 for IR images. This study highlights the customized model's key features: center dot Produces a 128 x 7 x 7 tensor, optimizes feature map size for subsequent layers, with two layers using 4 x 4 kernels and 128 and 64 filters for upsampling. center dot Uses 3 x 3 kernels in all convolutional layers to capture fine-grained spatial features, incorporates batch normalization for training stability, and applies dropout to reduce overfitting and improve generalization.File | Dimensione | Formato | |
---|---|---|---|
1-s2.0-S2215016125001554-main.pdf
accesso aperto
Tipologia file ?:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
7.17 MB
Formato
Adobe PDF
|
7.17 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.