Accurate pre-harvest prediction of yield and quality (tenderometric reading, TR) is crucial for the processing pea industry due to a narrow optimal harvest window. Machine learning (ML) models offer potential, but their real-world utility depends on their performance stability across different years. This ability, known as temporal transferability, is often overestimated by standard validation methods, creating a gap between theoretical performance and operational reliability. This study aims to rigorously quantify this temporal transferability gap for both pea yield and TR prediction. Four ML algorithms (RF, XGBoost, GPR, SVMr) were evaluated using Sentinel-2 and ERA5-Land data from 2018 to 2024 in northern Italy. A comparison was made between a standard group-based cross-validation (LOGOCV) and a temporally rigorous Leave-One-Year-Out Cross-Validation (LOYOCV). For yield prediction, ML models outperformed a baseline (NullModel) under LOGOCV (SVMr nRMSE = 18.4 %), but performance degraded significantly under LOYOCV, revealing a clear transferability gap. TR prediction was more challenging while RF showed promising results in LOGOCV (nRMSE = 22.1 %), all ML models were outperformed by the NullModel in the more realistic LOYOCV scenario. The findings highlight a critical temporal transferability gap, especially for the TR parameter, limiting the current operational readiness of standard ML models. It is recommended that future work focus on more robust approaches, such as models designed for temporal data (e.g., RNNs, Transformers) and higher-resolution data, to bridge the gap towards reliable real-world application.

Croci, M., Ragazzi, M., Grassi, A., Impollonia, G., Amaducci, S., Assessing the temporal transferability of machine learning models for predicting processing pea yield and quality using Sentinel-2 and ERA5-land data, <<SMART AGRICULTURAL TECHNOLOGY>>, 2025; 12 (1): 1-20. [doi:10.1016/j.atech.2025.101207] [https://hdl.handle.net/10807/327716]

Assessing the temporal transferability of machine learning models for predicting processing pea yield and quality using Sentinel-2 and ERA5-land data

Croci, Michele
;
Ragazzi, Manuele;Impollonia, Giorgio;Amaducci, Stefano
2025

Abstract

Accurate pre-harvest prediction of yield and quality (tenderometric reading, TR) is crucial for the processing pea industry due to a narrow optimal harvest window. Machine learning (ML) models offer potential, but their real-world utility depends on their performance stability across different years. This ability, known as temporal transferability, is often overestimated by standard validation methods, creating a gap between theoretical performance and operational reliability. This study aims to rigorously quantify this temporal transferability gap for both pea yield and TR prediction. Four ML algorithms (RF, XGBoost, GPR, SVMr) were evaluated using Sentinel-2 and ERA5-Land data from 2018 to 2024 in northern Italy. A comparison was made between a standard group-based cross-validation (LOGOCV) and a temporally rigorous Leave-One-Year-Out Cross-Validation (LOYOCV). For yield prediction, ML models outperformed a baseline (NullModel) under LOGOCV (SVMr nRMSE = 18.4 %), but performance degraded significantly under LOYOCV, revealing a clear transferability gap. TR prediction was more challenging while RF showed promising results in LOGOCV (nRMSE = 22.1 %), all ML models were outperformed by the NullModel in the more realistic LOYOCV scenario. The findings highlight a critical temporal transferability gap, especially for the TR parameter, limiting the current operational readiness of standard ML models. It is recommended that future work focus on more robust approaches, such as models designed for temporal data (e.g., RNNs, Transformers) and higher-resolution data, to bridge the gap towards reliable real-world application.
2025
Inglese
Croci, M., Ragazzi, M., Grassi, A., Impollonia, G., Amaducci, S., Assessing the temporal transferability of machine learning models for predicting processing pea yield and quality using Sentinel-2 and ERA5-land data, <<SMART AGRICULTURAL TECHNOLOGY>>, 2025; 12 (1): 1-20. [doi:10.1016/j.atech.2025.101207] [https://hdl.handle.net/10807/327716]
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S2772375525004381-main.pdf

accesso aperto

Tipologia file ?: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 5.31 MB
Formato Adobe PDF
5.31 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/327716
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
social impact