Incorporation of high accuracy surface modeling into machine learning to improve soil organic matter mapping
Digital soil mapping approaches related to soil organic matter (SOM) are crucial to quantify the process of the carbon cycle in terrestrial ecosystems and thus, can better manage soil fertility. Recently, many studies have compared machine learning (ML) models with traditional statistical models in...
Guardado en:
Autores principales: | , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/56f5059ed42145c7b13d73dc757c01df |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:56f5059ed42145c7b13d73dc757c01df |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:56f5059ed42145c7b13d73dc757c01df2021-12-01T04:57:15ZIncorporation of high accuracy surface modeling into machine learning to improve soil organic matter mapping1470-160X10.1016/j.ecolind.2021.107975https://doaj.org/article/56f5059ed42145c7b13d73dc757c01df2021-10-01T00:00:00Zhttp://www.sciencedirect.com/science/article/pii/S1470160X21006403https://doaj.org/toc/1470-160XDigital soil mapping approaches related to soil organic matter (SOM) are crucial to quantify the process of the carbon cycle in terrestrial ecosystems and thus, can better manage soil fertility. Recently, many studies have compared machine learning (ML) models with traditional statistical models in digital soil mapping. However, few studies focused on the application of hybrid models that combine ML with statistical models to map SOM content, especially in loess areas, which have a complicated geomorphologic landscape. In this study, the trend prediction used two ML models, i.e., gradient boosting modeling and random forest (RF), and a traditional stepwise multiple linear regression plus interpolated residuals generated from two classic geostatistical models, i.e., ordinary kriging and inverse distance weighting, and a high accuracy surface modeling (HASM) were implemented to map SOM content in the Dongzhi Loess Tableland area of China. A total of 145 topsoil samples and heterogeneous environmental variables were collected to develop the hybrid models. Results showed that 18 variables related to soil properties, climate variables, terrain attributes, vegetation indices, and location attributes played an important role in SOM mapping. The models that incorporate ML algorithms and interpolated residuals to predict SOM variation were found to have a better ability to handle complex environment relationships. The HASM model outperformed traditional geostatistical models in interpolating the residuals. In contrast, RF combined with HASM residuals (RF_HASM) gave the best performance, with the lowest mean absolute error (1.69 g/kg), root mean square error (2.30 g/kg), and the highest coefficient of determination (0.57) and concordance correlation coefficient (0.69) values. Moreover, the spatial distribution pattern obtained with RF_HASM yielded a spatial distribution of SOM that better fit the actual distribution pattern of the study area. In conclusion, these results suggest that RF_HASM is particularly capable of improving the mapping accuracy of SOM content at the regional scale.Zong WangZhengping DuXiaoyan LiZhengyi BaoNa ZhaoTianxiang YueElsevierarticleSoil organic matterDongzhi Loess TablelandGradient boosting modelingRandom forestHigh accuracy surface modelingEcologyQH540-549.5ENEcological Indicators, Vol 129, Iss , Pp 107975- (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Soil organic matter Dongzhi Loess Tableland Gradient boosting modeling Random forest High accuracy surface modeling Ecology QH540-549.5 |
spellingShingle |
Soil organic matter Dongzhi Loess Tableland Gradient boosting modeling Random forest High accuracy surface modeling Ecology QH540-549.5 Zong Wang Zhengping Du Xiaoyan Li Zhengyi Bao Na Zhao Tianxiang Yue Incorporation of high accuracy surface modeling into machine learning to improve soil organic matter mapping |
description |
Digital soil mapping approaches related to soil organic matter (SOM) are crucial to quantify the process of the carbon cycle in terrestrial ecosystems and thus, can better manage soil fertility. Recently, many studies have compared machine learning (ML) models with traditional statistical models in digital soil mapping. However, few studies focused on the application of hybrid models that combine ML with statistical models to map SOM content, especially in loess areas, which have a complicated geomorphologic landscape. In this study, the trend prediction used two ML models, i.e., gradient boosting modeling and random forest (RF), and a traditional stepwise multiple linear regression plus interpolated residuals generated from two classic geostatistical models, i.e., ordinary kriging and inverse distance weighting, and a high accuracy surface modeling (HASM) were implemented to map SOM content in the Dongzhi Loess Tableland area of China. A total of 145 topsoil samples and heterogeneous environmental variables were collected to develop the hybrid models. Results showed that 18 variables related to soil properties, climate variables, terrain attributes, vegetation indices, and location attributes played an important role in SOM mapping. The models that incorporate ML algorithms and interpolated residuals to predict SOM variation were found to have a better ability to handle complex environment relationships. The HASM model outperformed traditional geostatistical models in interpolating the residuals. In contrast, RF combined with HASM residuals (RF_HASM) gave the best performance, with the lowest mean absolute error (1.69 g/kg), root mean square error (2.30 g/kg), and the highest coefficient of determination (0.57) and concordance correlation coefficient (0.69) values. Moreover, the spatial distribution pattern obtained with RF_HASM yielded a spatial distribution of SOM that better fit the actual distribution pattern of the study area. In conclusion, these results suggest that RF_HASM is particularly capable of improving the mapping accuracy of SOM content at the regional scale. |
format |
article |
author |
Zong Wang Zhengping Du Xiaoyan Li Zhengyi Bao Na Zhao Tianxiang Yue |
author_facet |
Zong Wang Zhengping Du Xiaoyan Li Zhengyi Bao Na Zhao Tianxiang Yue |
author_sort |
Zong Wang |
title |
Incorporation of high accuracy surface modeling into machine learning to improve soil organic matter mapping |
title_short |
Incorporation of high accuracy surface modeling into machine learning to improve soil organic matter mapping |
title_full |
Incorporation of high accuracy surface modeling into machine learning to improve soil organic matter mapping |
title_fullStr |
Incorporation of high accuracy surface modeling into machine learning to improve soil organic matter mapping |
title_full_unstemmed |
Incorporation of high accuracy surface modeling into machine learning to improve soil organic matter mapping |
title_sort |
incorporation of high accuracy surface modeling into machine learning to improve soil organic matter mapping |
publisher |
Elsevier |
publishDate |
2021 |
url |
https://doaj.org/article/56f5059ed42145c7b13d73dc757c01df |
work_keys_str_mv |
AT zongwang incorporationofhighaccuracysurfacemodelingintomachinelearningtoimprovesoilorganicmattermapping AT zhengpingdu incorporationofhighaccuracysurfacemodelingintomachinelearningtoimprovesoilorganicmattermapping AT xiaoyanli incorporationofhighaccuracysurfacemodelingintomachinelearningtoimprovesoilorganicmattermapping AT zhengyibao incorporationofhighaccuracysurfacemodelingintomachinelearningtoimprovesoilorganicmattermapping AT nazhao incorporationofhighaccuracysurfacemodelingintomachinelearningtoimprovesoilorganicmattermapping AT tianxiangyue incorporationofhighaccuracysurfacemodelingintomachinelearningtoimprovesoilorganicmattermapping |
_version_ |
1718405661226696704 |