Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model

The concentration series of PM<sub>2.5</sub> (particulate matter ≤ 2.5 μm) is nonlinear, nonstationary, and noisy, making it difficult to predict accurately. This paper presents a new PM<sub>2.5</sub> concentration prediction method based on a hybrid model of complete ensembl...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Xuchu Jiang, Peiyao Wei, Yiwen Luo, Ying Li
Formato: article
Lenguaje:EN
Publicado: MDPI AG 2021
Materias:
FE
Acceso en línea:https://doaj.org/article/236884f9cfc947ae826f6af5349f0bbc
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:236884f9cfc947ae826f6af5349f0bbc
record_format dspace
spelling oai:doaj.org-article:236884f9cfc947ae826f6af5349f0bbc2021-11-25T16:44:57ZAir Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model10.3390/atmos121114522073-4433https://doaj.org/article/236884f9cfc947ae826f6af5349f0bbc2021-11-01T00:00:00Zhttps://www.mdpi.com/2073-4433/12/11/1452https://doaj.org/toc/2073-4433The concentration series of PM<sub>2.5</sub> (particulate matter ≤ 2.5 μm) is nonlinear, nonstationary, and noisy, making it difficult to predict accurately. This paper presents a new PM<sub>2.5</sub> concentration prediction method based on a hybrid model of complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) and bi-directional long short-term memory (BiLSTM). The new method was applied to predict the same kind of particulate pollutant PM<sub>10</sub> and heterogeneous gas pollutant O<sub>3</sub>, proving that the prediction method has strong generalization ability. First, CEEMDAN was used to decompose PM<sub>2.5</sub> concentrations at different frequencies. Then, the fuzzy entropy (FE) value of each decomposed wave was calculated, and the near waves were combined by K-means clustering to generate the input sequence. Finally, the combined sequences were put into the BiLSTM model with multiple hidden layers for training. We predicted the PM<sub>2.5</sub> concentrations of Seoul Station 116 by the hour, with values of the root mean square error (<i>RMSE</i>), the mean absolute error (<i>MAE</i>), and the symmetric mean absolute percentage error (<i>SMAPE</i>) as low as 2.74, 1.90, and 13.59%, respectively, and an <i>R</i><sup>2</sup> value as high as 96.34%. The “CEEMDAN-FE” decomposition-merging technology proposed in this paper can effectively reduce the instability and high volatility of the original data, overcome data noise, and significantly improve the model’s performance in predicting the real-time concentrations of PM<sub>2.5</sub>.Xuchu JiangPeiyao WeiYiwen LuoYing LiMDPI AGarticlePM<sub>2.5</sub>PM<sub>10</sub>O<sub>3</sub>CEEMDANFEBiLSTMMeteorology. ClimatologyQC851-999ENAtmosphere, Vol 12, Iss 1452, p 1452 (2021)
institution DOAJ
collection DOAJ
language EN
topic PM<sub>2.5</sub>
PM<sub>10</sub>
O<sub>3</sub>
CEEMDAN
FE
BiLSTM
Meteorology. Climatology
QC851-999
spellingShingle PM<sub>2.5</sub>
PM<sub>10</sub>
O<sub>3</sub>
CEEMDAN
FE
BiLSTM
Meteorology. Climatology
QC851-999
Xuchu Jiang
Peiyao Wei
Yiwen Luo
Ying Li
Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model
description The concentration series of PM<sub>2.5</sub> (particulate matter ≤ 2.5 μm) is nonlinear, nonstationary, and noisy, making it difficult to predict accurately. This paper presents a new PM<sub>2.5</sub> concentration prediction method based on a hybrid model of complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) and bi-directional long short-term memory (BiLSTM). The new method was applied to predict the same kind of particulate pollutant PM<sub>10</sub> and heterogeneous gas pollutant O<sub>3</sub>, proving that the prediction method has strong generalization ability. First, CEEMDAN was used to decompose PM<sub>2.5</sub> concentrations at different frequencies. Then, the fuzzy entropy (FE) value of each decomposed wave was calculated, and the near waves were combined by K-means clustering to generate the input sequence. Finally, the combined sequences were put into the BiLSTM model with multiple hidden layers for training. We predicted the PM<sub>2.5</sub> concentrations of Seoul Station 116 by the hour, with values of the root mean square error (<i>RMSE</i>), the mean absolute error (<i>MAE</i>), and the symmetric mean absolute percentage error (<i>SMAPE</i>) as low as 2.74, 1.90, and 13.59%, respectively, and an <i>R</i><sup>2</sup> value as high as 96.34%. The “CEEMDAN-FE” decomposition-merging technology proposed in this paper can effectively reduce the instability and high volatility of the original data, overcome data noise, and significantly improve the model’s performance in predicting the real-time concentrations of PM<sub>2.5</sub>.
format article
author Xuchu Jiang
Peiyao Wei
Yiwen Luo
Ying Li
author_facet Xuchu Jiang
Peiyao Wei
Yiwen Luo
Ying Li
author_sort Xuchu Jiang
title Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model
title_short Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model
title_full Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model
title_fullStr Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model
title_full_unstemmed Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model
title_sort air pollutant concentration prediction based on a ceemdan-fe-bilstm model
publisher MDPI AG
publishDate 2021
url https://doaj.org/article/236884f9cfc947ae826f6af5349f0bbc
work_keys_str_mv AT xuchujiang airpollutantconcentrationpredictionbasedonaceemdanfebilstmmodel
AT peiyaowei airpollutantconcentrationpredictionbasedonaceemdanfebilstmmodel
AT yiwenluo airpollutantconcentrationpredictionbasedonaceemdanfebilstmmodel
AT yingli airpollutantconcentrationpredictionbasedonaceemdanfebilstmmodel
_version_ 1718413005580926976