Knowledge Resource Center for Ecological Environment in Arid Area
DOI | 10.3390/w14223647 |
Coupling Process-Based Models and Machine Learning Algorithms for Predicting Yield and Evapotranspiration of Maize in Arid Environments | |
Attia, Ahmed; Govind, Ajit; Qureshi, Asad Sarwar; Feike, Til; Rizk, Mosa Sayed; Shabana, Mahmoud M. A.; Kheir, Ahmed M. S. | |
通讯作者 | Kheir, AMS |
来源期刊 | WATER
![]() |
EISSN | 2073-4441 |
出版年 | 2022 |
卷号 | 14期号:22 |
英文摘要 | Crop yield prediction is critical for investigating the yield gap and potential adaptations to environmental and management factors in arid regions. Crop models (CMs) are powerful tools for predicting yield and water use, but they still have some limitations and uncertainties; therefore, combining them with machine learning algorithms (MLs) could improve predictions and reduce uncertainty. To that end, the DSSAT-CERES-maize model was calibrated in one location and validated in others across Egypt with varying agro-climatic zones. Following that, the dynamic model (CERES-Maize) was used for long-term simulation (1990-2020) of maize grain yield (GY) and evapotranspiration (ET) under a wide range of management and environmental factors. Detailed outputs from three growing seasons of field experiments in Egypt, as well as CERES-maize outputs, were used to train and test six machine learning algorithms (linear regression, ridge regression, lasso regression, K-nearest neighbors, random forest, and XGBoost), resulting in more than 1.5 million simulated yield and evapotranspiration scenarios. Seven warming years (i.e., 1991, 1998, 2002, 2005, 2010, 2013, and 2020) were chosen from a 31-year dataset to test MLs, while the remaining 23 years were used to train the models. The Ensemble model (super learner) and XGBoost outperform other models in predicting GY and ET for maize, as evidenced by R-2 values greater than 0.82 and RRMSE less than 9%. The broad range of management practices, when averaged across all locations and 31 years of simulation, not only reduced the hazard impact of environmental factors but also increased GY and reduced ET. Moving beyond prediction and interpreting the outputs from Lasso and XGBoost, and using global and local SHAP values, we found that the most important features for predicting GY and ET are maximum temperatures, minimum temperature, available water content, soil organic carbon, irrigation, cultivars, soil texture, solar radiation, and planting date. Determining the most important features is critical for assisting farmers and agronomists in prioritizing such features over other factors in order to increase yield and resource efficiency values. The combination of CMs and ML algorithms is a powerful tool for predicting yield and water use in arid regions, which are particularly vulnerable to climate change and water scarcity. |
英文关键词 | DSSAT models random forest XGBoost super learner lasso regression hyperparameters tuning water use feature importance |
类型 | Article |
语种 | 英语 |
开放获取类型 | Green Published, gold |
收录类别 | SCI-E |
WOS记录号 | WOS:000887803800001 |
WOS关键词 | WATER-USE EFFICIENCY ; FOOD SECURITY ; WHEAT GROWTH |
WOS类目 | Environmental Sciences ; Water Resources |
WOS研究方向 | Environmental Sciences & Ecology ; Water Resources |
资源类型 | 期刊论文 |
条目标识符 | http://119.78.100.177/qdio/handle/2XILL650/394853 |
推荐引用方式 GB/T 7714 | Attia, Ahmed,Govind, Ajit,Qureshi, Asad Sarwar,et al. Coupling Process-Based Models and Machine Learning Algorithms for Predicting Yield and Evapotranspiration of Maize in Arid Environments[J],2022,14(22). |
APA | Attia, Ahmed.,Govind, Ajit.,Qureshi, Asad Sarwar.,Feike, Til.,Rizk, Mosa Sayed.,...&Kheir, Ahmed M. S..(2022).Coupling Process-Based Models and Machine Learning Algorithms for Predicting Yield and Evapotranspiration of Maize in Arid Environments.WATER,14(22). |
MLA | Attia, Ahmed,et al."Coupling Process-Based Models and Machine Learning Algorithms for Predicting Yield and Evapotranspiration of Maize in Arid Environments".WATER 14.22(2022). |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。