基于PCA-BP神经网络的PM2.5季节性预测方法研究

张怡文; 郭傲东; 吴海龙; 袁宏武; 董云春

doi:10.3969/j.issn.1000-2006.201806011

基于PCA-BP神经网络的PM_2.5季节性预测方法研究

张怡文, 郭傲东, 吴海龙, 袁宏武, 董云春

南京林业大学学报（自然科学版） ›› 2020, Vol. 44 ›› Issue (5) : 231-238.

PDF(1849 KB)

国家林草科技领军期刊
中国精品科技期刊
中国高校百佳科技期刊
江苏省新闻出版政府奖期刊奖
RCCSE林学权威期刊（A+）
CSCD核心期刊
Scopus数据库收录期刊
中文核心期刊
SCD核心期刊

作者加群：102861116

微信公众号：南京林业大学学报

高级检索

PDF(1849 KB)

南京林业大学学报（自然科学版） ›› 2020, Vol. 44 ›› Issue (5) : 231-238. DOI: 10.3969/j.issn.1000-2006.201806011

研究论文

基于PCA-BP神经网络的PM_2.5季节性预测方法研究

张怡文 ¹ ,
郭傲东 ² ,
吴海龙 ¹ ,
袁宏武 ¹ ,
董云春 ¹

作者信息 +

Seasonal prediction of PM_2.5 based on the PCA-BP neural network

Author information +

文章历史 +

摘要

【目的】分季节预测PM_2.5浓度值,利用PCA方法对数据进行降维,分析季节及气象因素对PM_2.5的影响,在提高预测准确率的同时降低时间复杂度。【方法】以合肥市2014—2017年的PM₁₀、SO₂、CO₂、CO、O₃浓度值,以及同时段的气象因素值,对PM_2.5浓度进行预测。数据分析中发现PM_2.5在不同季节浓度差异较大,故本研究选择分季节进行预测;为了提高预测准确率,加入如风力、温度、湿度、气压等气象因素进行预测,同时采用主成分分析(PCA)的方法进行数据降维,将降维后的数据再输入BP神经网络模型进行预测。【结果】实验采用3组实验进行对比:5种污染物指标(PM_2.5-5)预测PM_2.5、加入气象因素的综合12项指标(PM_2.5-12)预测PM_2.5、对综合指标进行PCA处理后的(PM_2.5-PCA)预测PM_2.5。实验结果表明:4个季节的PM_2.5浓度值有较大变化,均方根误差(RMSE)的差值较大;采用PM_2.5-PCA的方法,在任何季节的RMSE均有降低,相关系数(r)均有所提高。【结论】PM_2.5浓度具有季节性特征,采用季节性预测方法可以提高预测准确率;同时采用PCA方法进行降维,可以在保证准确率的同时降低预测时间复杂度。

Abstract

【Objective】 We predict the concentration of PM_2.5 during different seasons and use the Principal Component Analysis (PCA) method to reduce the dimensionality of the data, while also improving the accuracy of the prediction and reducing the time complexity, to serve as a reference for travel by people and government decision-making. 【Method】 The PM_2.5 concentration was forecast based on the values of PM₁₀, SO₂, CO₂, CO and O₃ concentration in Hefei from 2014 to 2017, and the meteorological factors during the same period. The data analysis found that the concentration of PM_2.5 varies greatly across seasons; therefore, this study is focused on the forecasting during different seasons. To improve the accuracy of forecasting, influencing meteorological factors such as wind, temperature, humidity and air pressure were added to the forecasting. The PCA method was used for data dimensionality reduction, and then, the data were input into the BP neural network model for prediction. 【Result】 The experiment used three groups of assessments for comparison: five types of pollutant indicators to predict PM_2.5 (PM_2.5-5), addition of twelve comprehensive indicators of meteorological factors to predict PM_2.5 (PM_2.5-12), and use of comprehensive indicators which were processed by PCA (PM_2.5-PCA) to predict PM_2.5. The experimental results showed that the PM_2.5 concentration in the four seasons had large changes, and the difference in Root Mean Square Error(RMSE) is large, when using the PM_2.5-PCA method. We found that if the RMSE is reduced in any season, the correlation coefficient (r) value is increased. 【Conclusion】 The PM_2.5 concentration value has seasonal characteristics, and the seasonal prediction method can improve prediction accuracy. Moreover, adopting the PCA method to reduce data dimensionality can ensure accuracy and decrease the time complexity at the same time.

导出引用

张怡文, 郭傲东, 吴海龙, 等. 基于PCA-BP神经网络的PM_2.5季节性预测方法研究[J]. 南京林业大学学报（自然科学版）. 2020, 44(5): 231-238 https://doi.org/10.3969/j.issn.1000-2006.201806011

ZHANG Yiwen, GUO Aodong, WU Hailong, et al. Seasonal prediction of PM_2.5 based on the PCA-BP neural network[J]. Journal of Nanjing Forestry University (Natural Sciences Edition）. 2020, 44(5): 231-238 https://doi.org/10.3969/j.issn.1000-2006.201806011

中图分类号： X831

参考文献

列表( 原文顺序 | 文献年度倒序 | 文中引用次数倒序 ) 可视化分析

[1]	李令军, 王占山, 张大伟 , 等. 2013—2014年北京大气重污染特征研究[J]. 中国环境科学, 2016,36(1):27-35. LI L J, WANG Z S, ZHANG D W , et al. Analysis of heavy air pollution episodes in Beijing during 20132014[J]. China Environ Sci, 2016,36(1):27-35.DOI: 10.3969/j.issn.1000-6923.2016.01.005. 本文引用 [1]

[2]

MAKKONEN

, HELLÉN

, ANTTILA

, et al. Size distribution and chemical composition of airborne particles in south-eastern Finland during different seasons and wildfire episodes in 2006[J]. Sci Total Environ, 2010,408(3):644-651. DOI: 10.1016/j.scitotenv.2009.10.050.

https://doi.org/10.1016/j.scitotenv.2009.10.050

https://www.ncbi.nlm.nih.gov/pubmed/19903567

本文引用 [1] 摘要

The inorganic main elements, trace elements and PAHs were determined from selected PM(1), PM(2.5) and PM(10) samples collected at the Nordic background station in Virolahti during different seasons and during the wildfire episodes in 2006. Submicron particles are those most harmful to human beings, as they are able to penetrate deep into the human respiratory system and may cause severe health effects. About 70-80%, of the toxic trace elements, like lead, cadmium, arsenic and nickel, as well as PAH compounds, were found in particles smaller than 1 microm. Furthermore, the main part of the copper, zinc, and vanadium was associated with submicron particles. In practice, all the PAHs found in PM(10) were actually in PM(2.5). For PAHs and trace elements, it is more beneficial to analyse the PM(2.5) or even the PM(1) fraction instead of PM(10), because exclusion of the large particles reduces the need for sample cleaning to minimize the matrix effects during the analysis. During the wildfire episodes, the concentrations of particles smaller than 2.5 microm, as well as those of submicron particles, increased, and also the ratio PM(1)/PM(10) increased to about 50%. On the fire days, the mean potassium concentration was higher in all particle fractions, but ammonium and nitrate concentrations rose only in particles smaller than 1.0 microm. PAH concentrations rose even to the same level as in winter.

[3]	杨孝文, 周颖, 程水源 , 等. 北京冬季一次重污染过程的污染特征及成因分析[J]. 中国环境科学, 2016,36(3):679-686. YANG X W, ZHOU Y, CHENG S Y , et al. Characteristics and formation mechanism of a heavy winter air pollution event in Beijing[J]. China Environ Sci, 2016,36(3):679-686.DOI: 10.3969/j.issn.1000-6923.2016.03.007. 本文引用 [1]

[4]

张晓茹, 孔少飞, 银燕 , 等. 亚青会期间南京大气PM2.5中重金属来源及风险[J]. 中国环境科学, 2016,36(1):1-11.

ZHANG X

, KONG S

, YIN

, et al. Sources and risk assessment of heavy metals in ambient PM2.5 during Youth Asian Game period in Nanjing[J]. China Environ Sci, 2016,36(1):1-11.DOI: 10.3969/j.issn.1000-6923.2016.01.001.

本文引用 [1]

[5]	刘小生, 李胜, 赵相博 . 基于基因表达式编程的PM_2.5浓度预测模型研究[J]. 江西理工大学学报, 2013,34(5):1-5. LIU X S, LI S, ZHAO X B . A study on the prediction model of PM_2.5 concentration based on gene expression programming[J]. J Jiangxi Univ Sci Technol, 2013,34(5):1-5.DOI: 10.13265/j.cnki.jxlgdxxb.2013.05.019. 本文引用 [1]

[6]	DUNEAD, POHOATA A, IORDACHE S . Using wavelet-feedforward neural networks to improve air pollution forecasting in urban environments[J]. Environ Monit Assess, 2015,187(7):1-16. DOI: 10.1007/s10661-015-4697-x. 本文引用 [1]

[7]	彭斯俊, 沈加超, 朱雪 . 基于ARIMA模型的PM_2.5预测[J]. 安全与环境工程, 2014,21(6):125-128. PENG S J, SHEN J C, ZHU X . Forecast of PM_2.5 based on the ARIMA model[J]. Saf Environ Eng, 2014,21(6):125-128.DOI: 10.13578/j.cnki.issn.1671-1556.2014.06.023. 本文引用 [1]

[8]	贺祥, 林振山 . 基于GAM模型分析影响因素交互作用对PM_2.5浓度变化的影响[J]. 环境科学, 2017,38(1):22-32. HE X, LIN Z S . Interactive effects of the influencing factors on the changes of PM_2.5 concentration based on GAM model[J]. Environ Sci, 2017,38(1):22-32.DOI: 10.13227/j.hjkx.201606061. 本文引用 [1]

[9]	刘杰, 杨鹏, 吕文生 , 等. 基于气象因素的PM_2.5质量浓度预测模型[J]. 山东大学学报(工学版), 2015,45(6):76-83. LIU J, YANG P, LV W S , et al. Prediction models of PM_2.5 mass concentration based on meteorological factors[J]. J Shandong Univ (Eng Sci), 2015,45(6):76-83.DOI: 10.6040/j.issn.1672-3961.0.2014.214. 本文引用 [1]

[10]	COBOURN W G . An enhanced PM_2.5 air quality forecast model based on nonlinear regression and back-trajectory concentrations[J]. Atmos Environ, 2010,44(25):3015-3023.DOI: 10.1016/j.atmosenv.2010.05.009. 本文引用 [1]

[11]

沈剑波, 雷相东, 李玉堂 , 等. 基于BP神经网络的长白落叶松人工林林分平均高预测[J]. 南京林业大学学报(自然科学版), 2018,42(2):147-154.

SHEN J

, LEI X

, LI Y

, et al. Prediction mean height for Larix olgensis plantation based on Bayesian-regularization BP neural network[J]. J Nanjing For Univ (Nat Sci Ed), 2018,42(2):147-154.DOI: 10.3969/j.issn.1000-2006.201706012.

本文引用 [1]

[12]

王嫣然, 张学霞, 赵静瑶 , 等. 北京地区不同季节PM_2.5和PM₁₀浓度对地面气象因素的响应[J]. 中国环境监测, 2017,33(2):34-41.

WANG Y

, ZHANG X

, ZHAO J

, et al. Study on the response of PM_2.5 and PM₁₀ concentrations to the ground meteorological conditions in different seasons in Beijing[J]. Environ Monit China, 2017,33(2):34-41. DOI: 10.19316/j.issn.1002-6002.2017.02.06.

本文引用 [1]

[13]

姚达文, 刘永红, 丁卉 , 等. 气象参数对基于BP神经网络的PM_2.5日均值预报模型的影响[J]. 安全与环境学报, 2015,15(6):324-328.

YAO D

, LIU Y

, DING

, et al. Effect of meteorological parameters on the PM_2.5 daily concentration forecasting model based on the BP neural network[J]. J Saf Environ, 2015,15(6):324-328.DOI: 10.13637/j.issn.1009-6094.2015.06.067.

本文引用 [1]

[14]

杨笑笑, 汤莉莉, 张运江 , 等. 南京夏季市区VOCs特征及O₃生成潜势的相关性分析[J]. 环境科学, 2016,37(2):443-451.

YANG X

, TANG L

, ZHANG Y

, et al. Correlation analysis between characteristics of VOCs and ozone formation potential in summer in Nanjing urban district[J]. Environ Sci, 2016,37(2):443-451.DOI: 10.13227/j.hjkx.2016.02.006.

本文引用 [1]

[15]

陈刚, 刘佳媛, 皇甫延琦 , 等. 合肥城区PM₁₀及PM_2.5季节污染特征及来源解析[J]. 中国环境科学, 2016,36(7):1938-1946.

CHEN

, LIU J

, HUANGPU Y

, et al. Seasonal variations and source apportionment of ambient PM₁₀ and PM_2.5 at urban area of Hefei,China[J]. China Environ Sci, 2016,36(7):1938-1946.DOI: 10.3969/j.issn.1000-6923.2016.07.003

本文引用 [1]

[16]	ZHOU F N, PARK J H, LIU Y J . Differential feature based hierarchical PCA fault detection method for dynamic fault[J]. Neurocomputing, 2016,202:27-35.DOI: 10.1016/j.neucom.2016.03.007. 本文引用 [1]

[17]	REN T, LIU S, MU H P , et al. Temperature prediction of the molten salt collector tube using BP neural network[J]. IET Renew Power Gener, 2016,10(2):212-220.DOI: 10.1049/iet-rpg.2015.0065. 本文引用 [1]

[18]	肖小兵, 刘宏立, 马子骥 . 基于奇异谱分析的经验模态分解去噪方法[J]. 计算机工程与科学, 2017,39(5):919-924. XIAO X B, LIU H L, MA Z J . An empirical mode decomposition de-noising method based on singular spectrum analysis[J]. Comput Eng Sci, 2017,39(5):919-924.DOI: 10.3969/j.issn.1007-130X.2017.05.015. 本文引用 [1]

[19]

XIEX, LAM K

. Gabor-based kernel PCA with doubly nonlinear mapping for face recognition with a single face image[J]. IEEE Trans Image Process, 2006,15(9):2481-2492. DOI: 10.1109/TIP.2006.877435.

https://doi.org/10.1109/tip.2006.877435

https://www.ncbi.nlm.nih.gov/pubmed/16948295

本文引用 [1] 摘要

In this paper, a novel Gabor-based kernel principal component analysis (PCA) with doubly nonlinear mapping is proposed for human face recognition. In our approach, the Gabor wavelets are used to extract facial features, then a doubly nonlinear mapping kernel PCA (DKPCA) is proposed to perform feature transformation and face recognition. The conventional kernel PCA nonlinearly maps an input image into a high-dimensional feature space in order to make the mapped features linearly separable. However, this method does not consider the structural characteristics of the face images, and it is difficult to determine which nonlinear mapping is more effective for face recognition. In this paper, a new method of nonlinear mapping, which is performed in the original feature space, is defined. The proposed nonlinear mapping not only considers the statistical property of the input features, but also adopts an eigenmask to emphasize those important facial feature points. Therefore, after this mapping, the transformed features have a higher discriminating power, and the relative importance of the features adapts to the spatial importance of the face images. This new nonlinear mapping is combined with the conventional kernel PCA to be called

[20]

关蓓蓓, 郑思俊, 崔心红 . 城市人工林空气负离子变化特征及其主要影响因子[J]. 南京林业大学学报(自然科学版), 2016,40(1):73-79.

GUAN B

, ZHENG S

, CUI X

. The variation and main influencing factors of negative air ions in urban plantation[J]. J Nanjing For Univ (Nat Sci Ed), 2016,40(1):73-79.DOI: 10.3969/j.issn.1000-2006.2016.01.012.

本文引用 [1]