Application of RR-XGBoost combined model in data calibration of micro air quality detector

Abstract Grid monitoring is the current development direction of atmospheric monitoring. The micro air quality detector is of great help to the grid monitoring of the atmosphere, so higher requirements are put forward for the accuracy of the micro air quality detector. This paper presents a model to...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Bing Liu, Xianghua Tan, Yueqiang Jin, Wangwang Yu, Chaoyang Li
Formato: article
Lenguaje:EN
Publicado: Nature Portfolio 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/d2ba713e260348619d7a2fed89eb14b9
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:d2ba713e260348619d7a2fed89eb14b9
record_format dspace
spelling oai:doaj.org-article:d2ba713e260348619d7a2fed89eb14b92021-12-02T14:53:34ZApplication of RR-XGBoost combined model in data calibration of micro air quality detector10.1038/s41598-021-95027-12045-2322https://doaj.org/article/d2ba713e260348619d7a2fed89eb14b92021-08-01T00:00:00Zhttps://doi.org/10.1038/s41598-021-95027-1https://doaj.org/toc/2045-2322Abstract Grid monitoring is the current development direction of atmospheric monitoring. The micro air quality detector is of great help to the grid monitoring of the atmosphere, so higher requirements are put forward for the accuracy of the micro air quality detector. This paper presents a model to calibrate the measurement data of the micro air quality detector using the monitoring data of the air quality monitoring station. The concentration of six types of air pollutants is the research object of this study to establish a calibration model for the measurement data of the micro air quality detector. The first step is to use correlation analysis to find out the main factors affecting the concentration of the six types of pollutants. The second step uses Ridge Regression (RR) to select variables, find out the factors that have significant effects on the concentration of pollutants, and give the quantitative relationship between these factors and the pollutants. Finally, the predicted value of the ridge regression model and the measurement data of the micro air quality detector are used as input variables, and the Extreme Gradient Boosting (XGBoost) algorithm is used to give the final pollutant concentration prediction model. We named the combined model of ridge regression and XGBoost algorithm RR-XGBoost model. Relative Mean Absolute Percent Error (MAPE), Mean Absolute Error (MAE), goodness of fit (R 2), and Root Mean Square Error (RMSE) were used to evaluate the prediction accuracy of the RR-XGBoost model. The results show that the model is superior to some commonly used pollutant prediction methods such as random forest, support vector machine, and multilayer perceptron neural network in the evaluation of various indicators. The model not only has a good prediction effect on the training set but also on the test set, indicating that the model has good generalization ability. Using the RR-XGBoost model to calibrate the data of the micro air quality detector can make up for the shortcomings of the data monitoring accuracy of the micro air quality detector. The model plays an active role in the deployment of micro air quality detectors and grid monitoring of the atmosphere.Bing LiuXianghua TanYueqiang JinWangwang YuChaoyang LiNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 11, Iss 1, Pp 1-14 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Bing Liu
Xianghua Tan
Yueqiang Jin
Wangwang Yu
Chaoyang Li
Application of RR-XGBoost combined model in data calibration of micro air quality detector
description Abstract Grid monitoring is the current development direction of atmospheric monitoring. The micro air quality detector is of great help to the grid monitoring of the atmosphere, so higher requirements are put forward for the accuracy of the micro air quality detector. This paper presents a model to calibrate the measurement data of the micro air quality detector using the monitoring data of the air quality monitoring station. The concentration of six types of air pollutants is the research object of this study to establish a calibration model for the measurement data of the micro air quality detector. The first step is to use correlation analysis to find out the main factors affecting the concentration of the six types of pollutants. The second step uses Ridge Regression (RR) to select variables, find out the factors that have significant effects on the concentration of pollutants, and give the quantitative relationship between these factors and the pollutants. Finally, the predicted value of the ridge regression model and the measurement data of the micro air quality detector are used as input variables, and the Extreme Gradient Boosting (XGBoost) algorithm is used to give the final pollutant concentration prediction model. We named the combined model of ridge regression and XGBoost algorithm RR-XGBoost model. Relative Mean Absolute Percent Error (MAPE), Mean Absolute Error (MAE), goodness of fit (R 2), and Root Mean Square Error (RMSE) were used to evaluate the prediction accuracy of the RR-XGBoost model. The results show that the model is superior to some commonly used pollutant prediction methods such as random forest, support vector machine, and multilayer perceptron neural network in the evaluation of various indicators. The model not only has a good prediction effect on the training set but also on the test set, indicating that the model has good generalization ability. Using the RR-XGBoost model to calibrate the data of the micro air quality detector can make up for the shortcomings of the data monitoring accuracy of the micro air quality detector. The model plays an active role in the deployment of micro air quality detectors and grid monitoring of the atmosphere.
format article
author Bing Liu
Xianghua Tan
Yueqiang Jin
Wangwang Yu
Chaoyang Li
author_facet Bing Liu
Xianghua Tan
Yueqiang Jin
Wangwang Yu
Chaoyang Li
author_sort Bing Liu
title Application of RR-XGBoost combined model in data calibration of micro air quality detector
title_short Application of RR-XGBoost combined model in data calibration of micro air quality detector
title_full Application of RR-XGBoost combined model in data calibration of micro air quality detector
title_fullStr Application of RR-XGBoost combined model in data calibration of micro air quality detector
title_full_unstemmed Application of RR-XGBoost combined model in data calibration of micro air quality detector
title_sort application of rr-xgboost combined model in data calibration of micro air quality detector
publisher Nature Portfolio
publishDate 2021
url https://doaj.org/article/d2ba713e260348619d7a2fed89eb14b9
work_keys_str_mv AT bingliu applicationofrrxgboostcombinedmodelindatacalibrationofmicroairqualitydetector
AT xianghuatan applicationofrrxgboostcombinedmodelindatacalibrationofmicroairqualitydetector
AT yueqiangjin applicationofrrxgboostcombinedmodelindatacalibrationofmicroairqualitydetector
AT wangwangyu applicationofrrxgboostcombinedmodelindatacalibrationofmicroairqualitydetector
AT chaoyangli applicationofrrxgboostcombinedmodelindatacalibrationofmicroairqualitydetector
_version_ 1718389477588598784