A Combined Strategy of Improved Variable Selection and Ensemble Algorithm to Map the Growing Stem Volume of Planted Coniferous Forest

Remote sensing technology is becoming mainstream for mapping the growing stem volume (GSV) and overcoming the shortage of traditional labor-consumed approaches. Naturally, the GSV estimation accuracy utilizing remote sensing imagery is highly related to the variable selection methods and algorithms....

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Xiaodong Xu, Hui Lin, Zhaohua Liu, Zilin Ye, Xinyu Li, Jiangping Long
Formato: article
Lenguaje:EN
Publicado: MDPI AG 2021
Materias:
Q
Acceso en línea:https://doaj.org/article/362c77df3cf240a9b98af9821f064afd
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:362c77df3cf240a9b98af9821f064afd
record_format dspace
spelling oai:doaj.org-article:362c77df3cf240a9b98af9821f064afd2021-11-25T18:55:00ZA Combined Strategy of Improved Variable Selection and Ensemble Algorithm to Map the Growing Stem Volume of Planted Coniferous Forest10.3390/rs132246312072-4292https://doaj.org/article/362c77df3cf240a9b98af9821f064afd2021-11-01T00:00:00Zhttps://www.mdpi.com/2072-4292/13/22/4631https://doaj.org/toc/2072-4292Remote sensing technology is becoming mainstream for mapping the growing stem volume (GSV) and overcoming the shortage of traditional labor-consumed approaches. Naturally, the GSV estimation accuracy utilizing remote sensing imagery is highly related to the variable selection methods and algorithms. Thus, to reduce the uncertainty caused by variables and models, this paper proposes a combined strategy involving improved variable selection with the collinearity test and the secondary ensemble algorithm to obtain the optimally combined variables and extract a reliable GSV from several base models. Our study extracted four types of alternative variables from the Sentinel-1A and Sentinel-2A image datasets, including vegetation indices, spectral reflectance variables, backscattering coefficients, and texture features. Then, an improved variable selection criterion with the collinearity test was developed and evaluated based on machine learning algorithms (classification and regression trees (CART), k-nearest neighbors (KNN), support vector regression (SVR), and artificial neural network (ANN)) considering the correlation between variables and GSV (with random forest (RF), distance correlation coefficient (DC), maximal information coefficient (MIC), and Pearson correlation coefficient (PCC) as evaluation metrics), and the collinearity among the variables. Additionally, we proposed a secondary ensemble with an improved weighted average approach (IWA) to estimate the reliable forest GSV using the first ensemble models constructed by Bagging and AdaBoost. The experimental results demonstrated that the proposed variable selection criterion efficiently obtained the optimal combined variable set without affecting the forest GSV mapping accuracy. Specifically, considering the first ensemble, the relative root mean square error (rRMSE) values ranged from 21.91% to 30.28% for Bagging and 23.33% to 31.49% for AdaBoost, respectively. After the secondary ensemble involving the IWA, the rRMSE values ranged from 18.89% to 21.34%. Furthermore, the variance of the GSV mapped by the secondary ensemble with various ranking methods was significantly reduced. The results prove that the proposed combined strategy has great potential to reduce the GSV mapping uncertainty imposed by current variable selection approaches and algorithms.Xiaodong XuHui LinZhaohua LiuZilin YeXinyu LiJiangping LongMDPI AGarticlegrowing stem volumesentinelvariable selectionensemble algorithmScienceQENRemote Sensing, Vol 13, Iss 4631, p 4631 (2021)
institution DOAJ
collection DOAJ
language EN
topic growing stem volume
sentinel
variable selection
ensemble algorithm
Science
Q
spellingShingle growing stem volume
sentinel
variable selection
ensemble algorithm
Science
Q
Xiaodong Xu
Hui Lin
Zhaohua Liu
Zilin Ye
Xinyu Li
Jiangping Long
A Combined Strategy of Improved Variable Selection and Ensemble Algorithm to Map the Growing Stem Volume of Planted Coniferous Forest
description Remote sensing technology is becoming mainstream for mapping the growing stem volume (GSV) and overcoming the shortage of traditional labor-consumed approaches. Naturally, the GSV estimation accuracy utilizing remote sensing imagery is highly related to the variable selection methods and algorithms. Thus, to reduce the uncertainty caused by variables and models, this paper proposes a combined strategy involving improved variable selection with the collinearity test and the secondary ensemble algorithm to obtain the optimally combined variables and extract a reliable GSV from several base models. Our study extracted four types of alternative variables from the Sentinel-1A and Sentinel-2A image datasets, including vegetation indices, spectral reflectance variables, backscattering coefficients, and texture features. Then, an improved variable selection criterion with the collinearity test was developed and evaluated based on machine learning algorithms (classification and regression trees (CART), k-nearest neighbors (KNN), support vector regression (SVR), and artificial neural network (ANN)) considering the correlation between variables and GSV (with random forest (RF), distance correlation coefficient (DC), maximal information coefficient (MIC), and Pearson correlation coefficient (PCC) as evaluation metrics), and the collinearity among the variables. Additionally, we proposed a secondary ensemble with an improved weighted average approach (IWA) to estimate the reliable forest GSV using the first ensemble models constructed by Bagging and AdaBoost. The experimental results demonstrated that the proposed variable selection criterion efficiently obtained the optimal combined variable set without affecting the forest GSV mapping accuracy. Specifically, considering the first ensemble, the relative root mean square error (rRMSE) values ranged from 21.91% to 30.28% for Bagging and 23.33% to 31.49% for AdaBoost, respectively. After the secondary ensemble involving the IWA, the rRMSE values ranged from 18.89% to 21.34%. Furthermore, the variance of the GSV mapped by the secondary ensemble with various ranking methods was significantly reduced. The results prove that the proposed combined strategy has great potential to reduce the GSV mapping uncertainty imposed by current variable selection approaches and algorithms.
format article
author Xiaodong Xu
Hui Lin
Zhaohua Liu
Zilin Ye
Xinyu Li
Jiangping Long
author_facet Xiaodong Xu
Hui Lin
Zhaohua Liu
Zilin Ye
Xinyu Li
Jiangping Long
author_sort Xiaodong Xu
title A Combined Strategy of Improved Variable Selection and Ensemble Algorithm to Map the Growing Stem Volume of Planted Coniferous Forest
title_short A Combined Strategy of Improved Variable Selection and Ensemble Algorithm to Map the Growing Stem Volume of Planted Coniferous Forest
title_full A Combined Strategy of Improved Variable Selection and Ensemble Algorithm to Map the Growing Stem Volume of Planted Coniferous Forest
title_fullStr A Combined Strategy of Improved Variable Selection and Ensemble Algorithm to Map the Growing Stem Volume of Planted Coniferous Forest
title_full_unstemmed A Combined Strategy of Improved Variable Selection and Ensemble Algorithm to Map the Growing Stem Volume of Planted Coniferous Forest
title_sort combined strategy of improved variable selection and ensemble algorithm to map the growing stem volume of planted coniferous forest
publisher MDPI AG
publishDate 2021
url https://doaj.org/article/362c77df3cf240a9b98af9821f064afd
work_keys_str_mv AT xiaodongxu acombinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT huilin acombinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT zhaohualiu acombinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT zilinye acombinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT xinyuli acombinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT jiangpinglong acombinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT xiaodongxu combinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT huilin combinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT zhaohualiu combinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT zilinye combinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT xinyuli combinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
AT jiangpinglong combinedstrategyofimprovedvariableselectionandensemblealgorithmtomapthegrowingstemvolumeofplantedconiferousforest
_version_ 1718410515945881600