基于PCA-BP神经网络的PM2.5季节性预测方法研究

张怡文, 郭傲东, 吴海龙, 袁宏武, 董云春

南京林业大学学报(自然科学版) ›› 2020, Vol. 44 ›› Issue (5) : 231-238.

PDF(1849 KB)
PDF(1849 KB)
南京林业大学学报(自然科学版) ›› 2020, Vol. 44 ›› Issue (5) : 231-238. DOI: 10.3969/j.issn.1000-2006.201806011
研究论文

基于PCA-BP神经网络的PM2.5季节性预测方法研究

作者信息 +

Seasonal prediction of PM2.5 based on the PCA-BP neural network

Author information +
文章历史 +

摘要

【目的】分季节预测PM2.5浓度值,利用PCA方法对数据进行降维,分析季节及气象因素对PM2.5的影响,在提高预测准确率的同时降低时间复杂度。【方法】以合肥市2014—2017年的PM10、SO2、CO2、CO、O3浓度值,以及同时段的气象因素值,对PM2.5浓度进行预测。数据分析中发现PM2.5在不同季节浓度差异较大,故本研究选择分季节进行预测;为了提高预测准确率,加入如风力、温度、湿度、气压等气象因素进行预测,同时采用主成分分析(PCA)的方法进行数据降维,将降维后的数据再输入BP神经网络模型进行预测。【结果】实验采用3组实验进行对比:5种污染物指标(PM2.5-5)预测PM2.5、加入气象因素的综合12项指标(PM2.5-12)预测PM2.5、对综合指标进行PCA处理后的(PM2.5-PCA)预测PM2.5。实验结果表明:4个季节的PM2.5浓度值有较大变化,均方根误差(RMSE)的差值较大;采用PM2.5-PCA的方法,在任何季节的RMSE均有降低,相关系数(r)均有所提高。【结论】PM2.5浓度具有季节性特征,采用季节性预测方法可以提高预测准确率;同时采用PCA方法进行降维,可以在保证准确率的同时降低预测时间复杂度。

Abstract

【Objective】 We predict the concentration of PM2.5 during different seasons and use the Principal Component Analysis (PCA) method to reduce the dimensionality of the data, while also improving the accuracy of the prediction and reducing the time complexity, to serve as a reference for travel by people and government decision-making. 【Method】 The PM2.5 concentration was forecast based on the values of PM10, SO2, CO2, CO and O3 concentration in Hefei from 2014 to 2017, and the meteorological factors during the same period. The data analysis found that the concentration of PM2.5 varies greatly across seasons; therefore, this study is focused on the forecasting during different seasons. To improve the accuracy of forecasting, influencing meteorological factors such as wind, temperature, humidity and air pressure were added to the forecasting. The PCA method was used for data dimensionality reduction, and then, the data were input into the BP neural network model for prediction. 【Result】 The experiment used three groups of assessments for comparison: five types of pollutant indicators to predict PM2.5 (PM2.5-5), addition of twelve comprehensive indicators of meteorological factors to predict PM2.5 (PM2.5-12), and use of comprehensive indicators which were processed by PCA (PM2.5-PCA) to predict PM2.5. The experimental results showed that the PM2.5 concentration in the four seasons had large changes, and the difference in Root Mean Square Error(RMSE) is large, when using the PM2.5-PCA method. We found that if the RMSE is reduced in any season, the correlation coefficient (r) value is increased. 【Conclusion】 The PM2.5 concentration value has seasonal characteristics, and the seasonal prediction method can improve prediction accuracy. Moreover, adopting the PCA method to reduce data dimensionality can ensure accuracy and decrease the time complexity at the same time.

关键词

PM2.5 / 神经网络 / 预测 / 主成分分析

Key words

PM2.5 / neural networks / prediction / principal component analysis(PCA)

引用本文

导出引用
张怡文, 郭傲东, 吴海龙, . 基于PCA-BP神经网络的PM2.5季节性预测方法研究[J]. 南京林业大学学报(自然科学版). 2020, 44(5): 231-238 https://doi.org/10.3969/j.issn.1000-2006.201806011
ZHANG Yiwen, GUO Aodong, WU Hailong, et al. Seasonal prediction of PM2.5 based on the PCA-BP neural network[J]. JOURNAL OF NANJING FORESTRY UNIVERSITY. 2020, 44(5): 231-238 https://doi.org/10.3969/j.issn.1000-2006.201806011
中图分类号: X831   

参考文献

[1]
李令军, 王占山, 张大伟 , 等. 2013—2014年北京大气重污染特征研究[J]. 中国环境科学, 2016,36(1):27-35.
LI L J, WANG Z S, ZHANG D W , et al. Analysis of heavy air pollution episodes in Beijing during 20132014[J]. China Environ Sci, 2016,36(1):27-35.DOI: 10.3969/j.issn.1000-6923.2016.01.005.
[2]
MAKKONEN U, HELLÉN H, ANTTILA P , et al. Size distribution and chemical composition of airborne particles in south-eastern Finland during different seasons and wildfire episodes in 2006[J]. Sci Total Environ, 2010,408(3):644-651. DOI: 10.1016/j.scitotenv.2009.10.050.
The inorganic main elements, trace elements and PAHs were determined from selected PM(1), PM(2.5) and PM(10) samples collected at the Nordic background station in Virolahti during different seasons and during the wildfire episodes in 2006. Submicron particles are those most harmful to human beings, as they are able to penetrate deep into the human respiratory system and may cause severe health effects. About 70-80%, of the toxic trace elements, like lead, cadmium, arsenic and nickel, as well as PAH compounds, were found in particles smaller than 1 microm. Furthermore, the main part of the copper, zinc, and vanadium was associated with submicron particles. In practice, all the PAHs found in PM(10) were actually in PM(2.5). For PAHs and trace elements, it is more beneficial to analyse the PM(2.5) or even the PM(1) fraction instead of PM(10), because exclusion of the large particles reduces the need for sample cleaning to minimize the matrix effects during the analysis. During the wildfire episodes, the concentrations of particles smaller than 2.5 microm, as well as those of submicron particles, increased, and also the ratio PM(1)/PM(10) increased to about 50%. On the fire days, the mean potassium concentration was higher in all particle fractions, but ammonium and nitrate concentrations rose only in particles smaller than 1.0 microm. PAH concentrations rose even to the same level as in winter.
[3]
杨孝文, 周颖, 程水源 , 等. 北京冬季一次重污染过程的污染特征及成因分析[J]. 中国环境科学, 2016,36(3):679-686.
YANG X W, ZHOU Y, CHENG S Y , et al. Characteristics and formation mechanism of a heavy winter air pollution event in Beijing[J]. China Environ Sci, 2016,36(3):679-686.DOI: 10.3969/j.issn.1000-6923.2016.03.007.
[4]
张晓茹, 孔少飞, 银燕 , 等. 亚青会期间南京大气PM2.5中重金属来源及风险[J]. 中国环境科学, 2016,36(1):1-11.
ZHANG X R, KONG S F, YIN Y , et al. Sources and risk assessment of heavy metals in ambient PM2.5 during Youth Asian Game period in Nanjing[J]. China Environ Sci, 2016,36(1):1-11.DOI: 10.3969/j.issn.1000-6923.2016.01.001.
[5]
刘小生, 李胜, 赵相博 . 基于基因表达式编程的PM2.5浓度预测模型研究[J]. 江西理工大学学报, 2013,34(5):1-5.
LIU X S, LI S, ZHAO X B . A study on the prediction model of PM2.5 concentration based on gene expression programming[J]. J Jiangxi Univ Sci Technol, 2013,34(5):1-5.DOI: 10.13265/j.cnki.jxlgdxxb.2013.05.019.
[6]
DUNEAD, POHOATA A, IORDACHE S . Using wavelet-feedforward neural networks to improve air pollution forecasting in urban environments[J]. Environ Monit Assess, 2015,187(7):1-16. DOI: 10.1007/s10661-015-4697-x.
[7]
彭斯俊, 沈加超, 朱雪 . 基于ARIMA模型的PM2.5预测[J]. 安全与环境工程, 2014,21(6):125-128.
PENG S J, SHEN J C, ZHU X . Forecast of PM2.5 based on the ARIMA model[J]. Saf Environ Eng, 2014,21(6):125-128.DOI: 10.13578/j.cnki.issn.1671-1556.2014.06.023.
[8]
贺祥, 林振山 . 基于GAM模型分析影响因素交互作用对PM2.5浓度变化的影响[J]. 环境科学, 2017,38(1):22-32.
HE X, LIN Z S . Interactive effects of the influencing factors on the changes of PM2.5 concentration based on GAM model[J]. Environ Sci, 2017,38(1):22-32.DOI: 10.13227/j.hjkx.201606061.
[9]
刘杰, 杨鹏, 吕文生 , 等. 基于气象因素的PM2.5质量浓度预测模型[J]. 山东大学学报(工学版), 2015,45(6):76-83.
LIU J, YANG P, LV W S , et al. Prediction models of PM2.5 mass concentration based on meteorological factors[J]. J Shandong Univ (Eng Sci), 2015,45(6):76-83.DOI: 10.6040/j.issn.1672-3961.0.2014.214.
[10]
COBOURN W G . An enhanced PM2.5 air quality forecast model based on nonlinear regression and back-trajectory concentrations[J]. Atmos Environ, 2010,44(25):3015-3023.DOI: 10.1016/j.atmosenv.2010.05.009.
[11]
沈剑波, 雷相东, 李玉堂 , 等. 基于BP神经网络的长白落叶松人工林林分平均高预测[J]. 南京林业大学学报(自然科学版), 2018,42(2):147-154.
SHEN J B, LEI X D, LI Y T , et al. Prediction mean height for Larix olgensis plantation based on Bayesian-regularization BP neural network[J]. J Nanjing For Univ (Nat Sci Ed), 2018,42(2):147-154.DOI: 10.3969/j.issn.1000-2006.201706012.
[12]
王嫣然, 张学霞, 赵静瑶 , 等. 北京地区不同季节PM2.5和PM10浓度对地面气象因素的响应[J]. 中国环境监测, 2017,33(2):34-41.
WANG Y R, ZHANG X X, ZHAO J Y , et al. Study on the response of PM2.5 and PM10 concentrations to the ground meteorological conditions in different seasons in Beijing[J]. Environ Monit China, 2017,33(2):34-41. DOI: 10.19316/j.issn.1002-6002.2017.02.06.
[13]
姚达文, 刘永红, 丁卉 , 等. 气象参数对基于BP神经网络的PM2.5日均值预报模型的影响[J]. 安全与环境学报, 2015,15(6):324-328.
YAO D W, LIU Y H, DING H , et al. Effect of meteorological parameters on the PM2.5 daily concentration forecasting model based on the BP neural network[J]. J Saf Environ, 2015,15(6):324-328.DOI: 10.13637/j.issn.1009-6094.2015.06.067.
[14]
杨笑笑, 汤莉莉, 张运江 , 等. 南京夏季市区VOCs特征及O3生成潜势的相关性分析[J]. 环境科学, 2016,37(2):443-451.
YANG X X, TANG L L, ZHANG Y J , et al. Correlation analysis between characteristics of VOCs and ozone formation potential in summer in Nanjing urban district[J]. Environ Sci, 2016,37(2):443-451.DOI: 10.13227/j.hjkx.2016.02.006.
[15]
陈刚, 刘佳媛, 皇甫延琦 , 等. 合肥城区PM10及PM2.5季节污染特征及来源解析[J]. 中国环境科学, 2016,36(7):1938-1946.
CHEN G, LIU J Y, HUANGPU Y Q , et al. Seasonal variations and source apportionment of ambient PM10 and PM2.5 at urban area of Hefei,China[J]. China Environ Sci, 2016,36(7):1938-1946.DOI: 10.3969/j.issn.1000-6923.2016.07.003
[16]
ZHOU F N, PARK J H, LIU Y J . Differential feature based hierarchical PCA fault detection method for dynamic fault[J]. Neurocomputing, 2016,202:27-35.DOI: 10.1016/j.neucom.2016.03.007.
[17]
REN T, LIU S, MU H P , et al. Temperature prediction of the molten salt collector tube using BP neural network[J]. IET Renew Power Gener, 2016,10(2):212-220.DOI: 10.1049/iet-rpg.2015.0065.
[18]
肖小兵, 刘宏立, 马子骥 . 基于奇异谱分析的经验模态分解去噪方法[J]. 计算机工程与科学, 2017,39(5):919-924.
XIAO X B, LIU H L, MA Z J . An empirical mode decomposition de-noising method based on singular spectrum analysis[J]. Comput Eng Sci, 2017,39(5):919-924.DOI: 10.3969/j.issn.1007-130X.2017.05.015.
[19]
XIEX, LAM K M . Gabor-based kernel PCA with doubly nonlinear mapping for face recognition with a single face image[J]. IEEE Trans Image Process, 2006,15(9):2481-2492. DOI: 10.1109/TIP.2006.877435.
In this paper, a novel Gabor-based kernel principal component analysis (PCA) with doubly nonlinear mapping is proposed for human face recognition. In our approach, the Gabor wavelets are used to extract facial features, then a doubly nonlinear mapping kernel PCA (DKPCA) is proposed to perform feature transformation and face recognition. The conventional kernel PCA nonlinearly maps an input image into a high-dimensional feature space in order to make the mapped features linearly separable. However, this method does not consider the structural characteristics of the face images, and it is difficult to determine which nonlinear mapping is more effective for face recognition. In this paper, a new method of nonlinear mapping, which is performed in the original feature space, is defined. The proposed nonlinear mapping not only considers the statistical property of the input features, but also adopts an eigenmask to emphasize those important facial feature points. Therefore, after this mapping, the transformed features have a higher discriminating power, and the relative importance of the features adapts to the spatial importance of the face images. This new nonlinear mapping is combined with the conventional kernel PCA to be called
[20]
关蓓蓓, 郑思俊, 崔心红 . 城市人工林空气负离子变化特征及其主要影响因子[J]. 南京林业大学学报(自然科学版), 2016,40(1):73-79.
GUAN B B, ZHENG S J, CUI X H . The variation and main influencing factors of negative air ions in urban plantation[J]. J Nanjing For Univ (Nat Sci Ed), 2016,40(1):73-79.DOI: 10.3969/j.issn.1000-2006.2016.01.012.
[21]
污染物数据采集[EB /OL].http://www.pm25.in/.2013-08-06/2018-2-20.
[22]
气数据采集[EB /OL].http://www.weather.com.cn/.2008-05-16/2018-5-10.

基金

安徽省高校优秀青年人才支持计划项目(gxyqZD2018087)
安徽省高校自然科学重点项目(KJ2019A0877)

编辑: 刘昌来

版权

版权所有,未经授权,不得转载、摘编本刊文章,不得使用本刊的版式设计。
PDF(1849 KB)

Accesses

Citation

Detail

段落导航
相关文章

/