Seasonal prediction of PM2.5 based on the PCA-BP neural network

ZHANG Yiwen, GUO Aodong, WU Hailong, YUAN Hongwu, DONG Yunchun

JOURNAL OF NANJING FORESTRY UNIVERSITY ›› 2020, Vol. 44 ›› Issue (5) : 231-238.

PDF(1849 KB)
PDF(1849 KB)
JOURNAL OF NANJING FORESTRY UNIVERSITY ›› 2020, Vol. 44 ›› Issue (5) : 231-238. DOI: 10.3969/j.issn.1000-2006.201806011

Seasonal prediction of PM2.5 based on the PCA-BP neural network

Author information +
History +

Abstract

【Objective】 We predict the concentration of PM2.5 during different seasons and use the Principal Component Analysis (PCA) method to reduce the dimensionality of the data, while also improving the accuracy of the prediction and reducing the time complexity, to serve as a reference for travel by people and government decision-making. 【Method】 The PM2.5 concentration was forecast based on the values of PM10, SO2, CO2, CO and O3 concentration in Hefei from 2014 to 2017, and the meteorological factors during the same period. The data analysis found that the concentration of PM2.5 varies greatly across seasons; therefore, this study is focused on the forecasting during different seasons. To improve the accuracy of forecasting, influencing meteorological factors such as wind, temperature, humidity and air pressure were added to the forecasting. The PCA method was used for data dimensionality reduction, and then, the data were input into the BP neural network model for prediction. 【Result】 The experiment used three groups of assessments for comparison: five types of pollutant indicators to predict PM2.5 (PM2.5-5), addition of twelve comprehensive indicators of meteorological factors to predict PM2.5 (PM2.5-12), and use of comprehensive indicators which were processed by PCA (PM2.5-PCA) to predict PM2.5. The experimental results showed that the PM2.5 concentration in the four seasons had large changes, and the difference in Root Mean Square Error(RMSE) is large, when using the PM2.5-PCA method. We found that if the RMSE is reduced in any season, the correlation coefficient (r) value is increased. 【Conclusion】 The PM2.5 concentration value has seasonal characteristics, and the seasonal prediction method can improve prediction accuracy. Moreover, adopting the PCA method to reduce data dimensionality can ensure accuracy and decrease the time complexity at the same time.

Key words

PM2.5 / neural networks / prediction / principal component analysis(PCA)

Cite this article

Download Citations
ZHANG Yiwen , GUO Aodong , WU Hailong , et al . Seasonal prediction of PM2.5 based on the PCA-BP neural network[J]. JOURNAL OF NANJING FORESTRY UNIVERSITY. 2020, 44(5): 231-238 https://doi.org/10.3969/j.issn.1000-2006.201806011

References

[1]
李令军, 王占山, 张大伟 , 等. 2013—2014年北京大气重污染特征研究[J]. 中国环境科学, 2016,36(1):27-35.
LI L J, WANG Z S, ZHANG D W , et al. Analysis of heavy air pollution episodes in Beijing during 20132014[J]. China Environ Sci, 2016,36(1):27-35.DOI: 10.3969/j.issn.1000-6923.2016.01.005.
[2]
MAKKONEN U, HELLÉN H, ANTTILA P , et al. Size distribution and chemical composition of airborne particles in south-eastern Finland during different seasons and wildfire episodes in 2006[J]. Sci Total Environ, 2010,408(3):644-651. DOI: 10.1016/j.scitotenv.2009.10.050.
The inorganic main elements, trace elements and PAHs were determined from selected PM(1), PM(2.5) and PM(10) samples collected at the Nordic background station in Virolahti during different seasons and during the wildfire episodes in 2006. Submicron particles are those most harmful to human beings, as they are able to penetrate deep into the human respiratory system and may cause severe health effects. About 70-80%, of the toxic trace elements, like lead, cadmium, arsenic and nickel, as well as PAH compounds, were found in particles smaller than 1 microm. Furthermore, the main part of the copper, zinc, and vanadium was associated with submicron particles. In practice, all the PAHs found in PM(10) were actually in PM(2.5). For PAHs and trace elements, it is more beneficial to analyse the PM(2.5) or even the PM(1) fraction instead of PM(10), because exclusion of the large particles reduces the need for sample cleaning to minimize the matrix effects during the analysis. During the wildfire episodes, the concentrations of particles smaller than 2.5 microm, as well as those of submicron particles, increased, and also the ratio PM(1)/PM(10) increased to about 50%. On the fire days, the mean potassium concentration was higher in all particle fractions, but ammonium and nitrate concentrations rose only in particles smaller than 1.0 microm. PAH concentrations rose even to the same level as in winter.
[3]
杨孝文, 周颖, 程水源 , 等. 北京冬季一次重污染过程的污染特征及成因分析[J]. 中国环境科学, 2016,36(3):679-686.
YANG X W, ZHOU Y, CHENG S Y , et al. Characteristics and formation mechanism of a heavy winter air pollution event in Beijing[J]. China Environ Sci, 2016,36(3):679-686.DOI: 10.3969/j.issn.1000-6923.2016.03.007.
[4]
张晓茹, 孔少飞, 银燕 , 等. 亚青会期间南京大气PM2.5中重金属来源及风险[J]. 中国环境科学, 2016,36(1):1-11.
ZHANG X R, KONG S F, YIN Y , et al. Sources and risk assessment of heavy metals in ambient PM2.5 during Youth Asian Game period in Nanjing[J]. China Environ Sci, 2016,36(1):1-11.DOI: 10.3969/j.issn.1000-6923.2016.01.001.
[5]
刘小生, 李胜, 赵相博 . 基于基因表达式编程的PM2.5浓度预测模型研究[J]. 江西理工大学学报, 2013,34(5):1-5.
LIU X S, LI S, ZHAO X B . A study on the prediction model of PM2.5 concentration based on gene expression programming[J]. J Jiangxi Univ Sci Technol, 2013,34(5):1-5.DOI: 10.13265/j.cnki.jxlgdxxb.2013.05.019.
[6]
DUNEAD, POHOATA A, IORDACHE S . Using wavelet-feedforward neural networks to improve air pollution forecasting in urban environments[J]. Environ Monit Assess, 2015,187(7):1-16. DOI: 10.1007/s10661-015-4697-x.
[7]
彭斯俊, 沈加超, 朱雪 . 基于ARIMA模型的PM2.5预测[J]. 安全与环境工程, 2014,21(6):125-128.
PENG S J, SHEN J C, ZHU X . Forecast of PM2.5 based on the ARIMA model[J]. Saf Environ Eng, 2014,21(6):125-128.DOI: 10.13578/j.cnki.issn.1671-1556.2014.06.023.
[8]
贺祥, 林振山 . 基于GAM模型分析影响因素交互作用对PM2.5浓度变化的影响[J]. 环境科学, 2017,38(1):22-32.
HE X, LIN Z S . Interactive effects of the influencing factors on the changes of PM2.5 concentration based on GAM model[J]. Environ Sci, 2017,38(1):22-32.DOI: 10.13227/j.hjkx.201606061.
[9]
刘杰, 杨鹏, 吕文生 , 等. 基于气象因素的PM2.5质量浓度预测模型[J]. 山东大学学报(工学版), 2015,45(6):76-83.
LIU J, YANG P, LV W S , et al. Prediction models of PM2.5 mass concentration based on meteorological factors[J]. J Shandong Univ (Eng Sci), 2015,45(6):76-83.DOI: 10.6040/j.issn.1672-3961.0.2014.214.
[10]
COBOURN W G . An enhanced PM2.5 air quality forecast model based on nonlinear regression and back-trajectory concentrations[J]. Atmos Environ, 2010,44(25):3015-3023.DOI: 10.1016/j.atmosenv.2010.05.009.
[11]
沈剑波, 雷相东, 李玉堂 , 等. 基于BP神经网络的长白落叶松人工林林分平均高预测[J]. 南京林业大学学报(自然科学版), 2018,42(2):147-154.
SHEN J B, LEI X D, LI Y T , et al. Prediction mean height for Larix olgensis plantation based on Bayesian-regularization BP neural network[J]. J Nanjing For Univ (Nat Sci Ed), 2018,42(2):147-154.DOI: 10.3969/j.issn.1000-2006.201706012.
[12]
王嫣然, 张学霞, 赵静瑶 , 等. 北京地区不同季节PM2.5和PM10浓度对地面气象因素的响应[J]. 中国环境监测, 2017,33(2):34-41.
WANG Y R, ZHANG X X, ZHAO J Y , et al. Study on the response of PM2.5 and PM10 concentrations to the ground meteorological conditions in different seasons in Beijing[J]. Environ Monit China, 2017,33(2):34-41. DOI: 10.19316/j.issn.1002-6002.2017.02.06.
[13]
姚达文, 刘永红, 丁卉 , 等. 气象参数对基于BP神经网络的PM2.5日均值预报模型的影响[J]. 安全与环境学报, 2015,15(6):324-328.
YAO D W, LIU Y H, DING H , et al. Effect of meteorological parameters on the PM2.5 daily concentration forecasting model based on the BP neural network[J]. J Saf Environ, 2015,15(6):324-328.DOI: 10.13637/j.issn.1009-6094.2015.06.067.
[14]
杨笑笑, 汤莉莉, 张运江 , 等. 南京夏季市区VOCs特征及O3生成潜势的相关性分析[J]. 环境科学, 2016,37(2):443-451.
YANG X X, TANG L L, ZHANG Y J , et al. Correlation analysis between characteristics of VOCs and ozone formation potential in summer in Nanjing urban district[J]. Environ Sci, 2016,37(2):443-451.DOI: 10.13227/j.hjkx.2016.02.006.
[15]
陈刚, 刘佳媛, 皇甫延琦 , 等. 合肥城区PM10及PM2.5季节污染特征及来源解析[J]. 中国环境科学, 2016,36(7):1938-1946.
CHEN G, LIU J Y, HUANGPU Y Q , et al. Seasonal variations and source apportionment of ambient PM10 and PM2.5 at urban area of Hefei,China[J]. China Environ Sci, 2016,36(7):1938-1946.DOI: 10.3969/j.issn.1000-6923.2016.07.003
[16]
ZHOU F N, PARK J H, LIU Y J . Differential feature based hierarchical PCA fault detection method for dynamic fault[J]. Neurocomputing, 2016,202:27-35.DOI: 10.1016/j.neucom.2016.03.007.
[17]
REN T, LIU S, MU H P , et al. Temperature prediction of the molten salt collector tube using BP neural network[J]. IET Renew Power Gener, 2016,10(2):212-220.DOI: 10.1049/iet-rpg.2015.0065.
[18]
肖小兵, 刘宏立, 马子骥 . 基于奇异谱分析的经验模态分解去噪方法[J]. 计算机工程与科学, 2017,39(5):919-924.
XIAO X B, LIU H L, MA Z J . An empirical mode decomposition de-noising method based on singular spectrum analysis[J]. Comput Eng Sci, 2017,39(5):919-924.DOI: 10.3969/j.issn.1007-130X.2017.05.015.
[19]
XIEX, LAM K M . Gabor-based kernel PCA with doubly nonlinear mapping for face recognition with a single face image[J]. IEEE Trans Image Process, 2006,15(9):2481-2492. DOI: 10.1109/TIP.2006.877435.
In this paper, a novel Gabor-based kernel principal component analysis (PCA) with doubly nonlinear mapping is proposed for human face recognition. In our approach, the Gabor wavelets are used to extract facial features, then a doubly nonlinear mapping kernel PCA (DKPCA) is proposed to perform feature transformation and face recognition. The conventional kernel PCA nonlinearly maps an input image into a high-dimensional feature space in order to make the mapped features linearly separable. However, this method does not consider the structural characteristics of the face images, and it is difficult to determine which nonlinear mapping is more effective for face recognition. In this paper, a new method of nonlinear mapping, which is performed in the original feature space, is defined. The proposed nonlinear mapping not only considers the statistical property of the input features, but also adopts an eigenmask to emphasize those important facial feature points. Therefore, after this mapping, the transformed features have a higher discriminating power, and the relative importance of the features adapts to the spatial importance of the face images. This new nonlinear mapping is combined with the conventional kernel PCA to be called
[20]
关蓓蓓, 郑思俊, 崔心红 . 城市人工林空气负离子变化特征及其主要影响因子[J]. 南京林业大学学报(自然科学版), 2016,40(1):73-79.
GUAN B B, ZHENG S J, CUI X H . The variation and main influencing factors of negative air ions in urban plantation[J]. J Nanjing For Univ (Nat Sci Ed), 2016,40(1):73-79.DOI: 10.3969/j.issn.1000-2006.2016.01.012.
[21]
污染物数据采集[EB /OL].http://www.pm25.in/.2013-08-06/2018-2-20.
[22]
气数据采集[EB /OL].http://www.weather.com.cn/.2008-05-16/2018-5-10.

RIGHTS & PERMISSIONS

Copyright reserved © 2020.
PDF(1849 KB)

Accesses

Citation

Detail

Sections
Recommended
The full text is translated into English by AI, aiming to facilitate reading and comprehension. The core content is subject to the explanation in Chinese.

/