A method to avoid spatial overfitting in estimation of grassland above-ground biomass on the Tibetan Plateau

Accurate assessments of grassland above-ground biomass (AGB) are crucial for the sustainable utilization and protection of grassland resources and the eco-environment. In this study, a random forest (RF) model combined with the forward feature selection (FFS) and leave-location-out cross-validation...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Hui Yu, Yufeng Wu, Liting Niu, Yafan Chai, Qisheng Feng, Wei Wang, Tiangang Liang
Formato: article
Lenguaje:EN
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://doaj.org/article/d1a70b3efee3483494c24b3188492737
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Sumario:Accurate assessments of grassland above-ground biomass (AGB) are crucial for the sustainable utilization and protection of grassland resources and the eco-environment. In this study, a random forest (RF) model combined with the forward feature selection (FFS) and leave-location-out cross-validation (LLO-CV) methods was trained to predict the dry weight (DW) of grassland AGB based on multiple factors. The final model exhibited a performance of R2 = 0.66, root mean square error (RMSE) of 503.86 kg DW/ha and mean absolute error (MAE) of 376.51 kg DW/ha. The spatial distribution of grassland AGB increased from northwest to southeast over the entire Tibetan Plateau (TP) from 2001 to 2018. Grassland AGB increased more than it decreased (70.6% vs 29.4%, respectively) during the study period. Using a combination of FFS and LLO-CV, spatial overfitting was reduced, and the predictive accuracy of the RF was improved, thus enhancing the ability to predict the AGB in unknown locations from training data. This study proposes a robust methodology with which to improve the transferability of machine learning algorithms to predict grassland AGB in unknown locations.