Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model
The concentration series of PM<sub>2.5</sub> (particulate matter ≤ 2.5 μm) is nonlinear, nonstationary, and noisy, making it difficult to predict accurately. This paper presents a new PM<sub>2.5</sub> concentration prediction method based on a hybrid model of complete ensembl...
Guardado en:
Autores principales: | , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
MDPI AG
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/236884f9cfc947ae826f6af5349f0bbc |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:236884f9cfc947ae826f6af5349f0bbc |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:236884f9cfc947ae826f6af5349f0bbc2021-11-25T16:44:57ZAir Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model10.3390/atmos121114522073-4433https://doaj.org/article/236884f9cfc947ae826f6af5349f0bbc2021-11-01T00:00:00Zhttps://www.mdpi.com/2073-4433/12/11/1452https://doaj.org/toc/2073-4433The concentration series of PM<sub>2.5</sub> (particulate matter ≤ 2.5 μm) is nonlinear, nonstationary, and noisy, making it difficult to predict accurately. This paper presents a new PM<sub>2.5</sub> concentration prediction method based on a hybrid model of complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) and bi-directional long short-term memory (BiLSTM). The new method was applied to predict the same kind of particulate pollutant PM<sub>10</sub> and heterogeneous gas pollutant O<sub>3</sub>, proving that the prediction method has strong generalization ability. First, CEEMDAN was used to decompose PM<sub>2.5</sub> concentrations at different frequencies. Then, the fuzzy entropy (FE) value of each decomposed wave was calculated, and the near waves were combined by K-means clustering to generate the input sequence. Finally, the combined sequences were put into the BiLSTM model with multiple hidden layers for training. We predicted the PM<sub>2.5</sub> concentrations of Seoul Station 116 by the hour, with values of the root mean square error (<i>RMSE</i>), the mean absolute error (<i>MAE</i>), and the symmetric mean absolute percentage error (<i>SMAPE</i>) as low as 2.74, 1.90, and 13.59%, respectively, and an <i>R</i><sup>2</sup> value as high as 96.34%. The “CEEMDAN-FE” decomposition-merging technology proposed in this paper can effectively reduce the instability and high volatility of the original data, overcome data noise, and significantly improve the model’s performance in predicting the real-time concentrations of PM<sub>2.5</sub>.Xuchu JiangPeiyao WeiYiwen LuoYing LiMDPI AGarticlePM<sub>2.5</sub>PM<sub>10</sub>O<sub>3</sub>CEEMDANFEBiLSTMMeteorology. ClimatologyQC851-999ENAtmosphere, Vol 12, Iss 1452, p 1452 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
PM<sub>2.5</sub> PM<sub>10</sub> O<sub>3</sub> CEEMDAN FE BiLSTM Meteorology. Climatology QC851-999 |
spellingShingle |
PM<sub>2.5</sub> PM<sub>10</sub> O<sub>3</sub> CEEMDAN FE BiLSTM Meteorology. Climatology QC851-999 Xuchu Jiang Peiyao Wei Yiwen Luo Ying Li Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model |
description |
The concentration series of PM<sub>2.5</sub> (particulate matter ≤ 2.5 μm) is nonlinear, nonstationary, and noisy, making it difficult to predict accurately. This paper presents a new PM<sub>2.5</sub> concentration prediction method based on a hybrid model of complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) and bi-directional long short-term memory (BiLSTM). The new method was applied to predict the same kind of particulate pollutant PM<sub>10</sub> and heterogeneous gas pollutant O<sub>3</sub>, proving that the prediction method has strong generalization ability. First, CEEMDAN was used to decompose PM<sub>2.5</sub> concentrations at different frequencies. Then, the fuzzy entropy (FE) value of each decomposed wave was calculated, and the near waves were combined by K-means clustering to generate the input sequence. Finally, the combined sequences were put into the BiLSTM model with multiple hidden layers for training. We predicted the PM<sub>2.5</sub> concentrations of Seoul Station 116 by the hour, with values of the root mean square error (<i>RMSE</i>), the mean absolute error (<i>MAE</i>), and the symmetric mean absolute percentage error (<i>SMAPE</i>) as low as 2.74, 1.90, and 13.59%, respectively, and an <i>R</i><sup>2</sup> value as high as 96.34%. The “CEEMDAN-FE” decomposition-merging technology proposed in this paper can effectively reduce the instability and high volatility of the original data, overcome data noise, and significantly improve the model’s performance in predicting the real-time concentrations of PM<sub>2.5</sub>. |
format |
article |
author |
Xuchu Jiang Peiyao Wei Yiwen Luo Ying Li |
author_facet |
Xuchu Jiang Peiyao Wei Yiwen Luo Ying Li |
author_sort |
Xuchu Jiang |
title |
Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model |
title_short |
Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model |
title_full |
Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model |
title_fullStr |
Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model |
title_full_unstemmed |
Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model |
title_sort |
air pollutant concentration prediction based on a ceemdan-fe-bilstm model |
publisher |
MDPI AG |
publishDate |
2021 |
url |
https://doaj.org/article/236884f9cfc947ae826f6af5349f0bbc |
work_keys_str_mv |
AT xuchujiang airpollutantconcentrationpredictionbasedonaceemdanfebilstmmodel AT peiyaowei airpollutantconcentrationpredictionbasedonaceemdanfebilstmmodel AT yiwenluo airpollutantconcentrationpredictionbasedonaceemdanfebilstmmodel AT yingli airpollutantconcentrationpredictionbasedonaceemdanfebilstmmodel |
_version_ |
1718413005580926976 |