Timely yield prediction is crucial for the agri-food supply chain as a whole. However, different stakeholders in the agri-food sector require different levels of accuracy and lead times in which a yield prediction should be available. For the producers, predictions during the growing season are essential to ensure that information is available early enough for the timely implementation of agronomic decisions, while industries can wait until later in the season to optimize their production process and increase their production traceability. In this study, we used machine learning algorithms, dynamic and static predictors, and a phenology approach to determine the time for issuing the yield prediction. In addition, the effect of data reduction was evaluated by comparing results obtained with and without principal component analysis (PCA). Gaussian process regression (GPR) was the best for predicting maize yield. Its best performance (nRMSE of 13.31%) was obtained late in the season and with the full set of predictors (vegetation indices, meteorological and soil predictors). In contrast, neural network (NNET) and support vector machines linear basis function (SVMl) achieved their best accuracy with only vegetation indices and at the tasseling phenological stage. Only slight differences in performance were observed between the algorithms considered, highlighting that the main factors influencing performance are the timing of the yield prediction and the predictors with which the machine learning algorithms are fed. Interestingly, PCA was instrumental in increasing the performances of NNET after this stage. An additional benefit of the application of PCA was the overall reduction between 12 and 30.20% in the standard deviation of the maize yield prediction performance from the leave one-year outer-loop cross-validation, depending on the feature set.

Croci, M., Impollonia, G., Meroni, M., Amaducci, S., Dynamic Maize Yield Predictions Using Machine Learning on Multi-Source Data, <<REMOTE SENSING>>, N/A; 15 (1): 100-100. [doi:10.3390/rs15010100] [https://hdl.handle.net/10807/230861]

Dynamic Maize Yield Predictions Using Machine Learning on Multi-Source Data

Croci, Michele
;
Impollonia, Giorgio;Amaducci, Stefano
2022

Abstract

Timely yield prediction is crucial for the agri-food supply chain as a whole. However, different stakeholders in the agri-food sector require different levels of accuracy and lead times in which a yield prediction should be available. For the producers, predictions during the growing season are essential to ensure that information is available early enough for the timely implementation of agronomic decisions, while industries can wait until later in the season to optimize their production process and increase their production traceability. In this study, we used machine learning algorithms, dynamic and static predictors, and a phenology approach to determine the time for issuing the yield prediction. In addition, the effect of data reduction was evaluated by comparing results obtained with and without principal component analysis (PCA). Gaussian process regression (GPR) was the best for predicting maize yield. Its best performance (nRMSE of 13.31%) was obtained late in the season and with the full set of predictors (vegetation indices, meteorological and soil predictors). In contrast, neural network (NNET) and support vector machines linear basis function (SVMl) achieved their best accuracy with only vegetation indices and at the tasseling phenological stage. Only slight differences in performance were observed between the algorithms considered, highlighting that the main factors influencing performance are the timing of the yield prediction and the predictors with which the machine learning algorithms are fed. Interestingly, PCA was instrumental in increasing the performances of NNET after this stage. An additional benefit of the application of PCA was the overall reduction between 12 and 30.20% in the standard deviation of the maize yield prediction performance from the leave one-year outer-loop cross-validation, depending on the feature set.
2022
Inglese
Croci, M., Impollonia, G., Meroni, M., Amaducci, S., Dynamic Maize Yield Predictions Using Machine Learning on Multi-Source Data, <<REMOTE SENSING>>, N/A; 15 (1): 100-100. [doi:10.3390/rs15010100] [https://hdl.handle.net/10807/230861]
File in questo prodotto:
File Dimensione Formato  
remotesensing-15-00100-v3.pdf

accesso aperto

Licenza: Creative commons
Dimensione 3.46 MB
Formato Adobe PDF
3.46 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/230861
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 18
  • ???jsp.display-item.citation.isi??? 14
social impact