Open Access Paper
11 September 2023 Research on short-term power load forecasting and analysis based on EEMD-LSSVM- integrated BPNN
Hui Liu, Xiaomin Xie, Shanshan Tang
Author Affiliations +
Proceedings Volume 12779, Seventh International Conference on Mechatronics and Intelligent Robotics (ICMIR 2023); 127790Q (2023) https://doi.org/10.1117/12.2688712
Event: Seventh International Conference on Mechatronics and Intelligent Robotics (ICMIR 2023), 2023, Kunming, China
Abstract
In view of the stability and reliability of power system operation, it is necessary to adopt accurate short-time load forecasting. In order to improve the forecasting accuracy, a very short time power load forecasting method based on EEMD-LSSVM- integrated BPNN is proposed. Firstly, EEMD (Ensemble Empirical Mode Decomposition) is used to decompose the initial power sequence into a plurality of components with different frequencies, calculate the SE (Sample Entropy) of each component, and recombine the similar components with entropy values. LSSVM (Least Squares Support Vector Machine) is used to predict the low-frequency trend components, and integrated BPNN (Back Propagation Neural Network) based on Bagging integrated learning algorithm is used to predict the remaining components. Finally, the distributed power series are comprehensively predicted. The example shows that this method is a feasible short-term power load forecasting method and can effectively improve the forecasting accuracy and stability.

1.

INTRODUCTION

According to the expected location of various state events in the life cycle profile and the characteristics of the state itself, combined with the mechanism of environmental stress, the environmental events that may be experienced by the in-vehicle environmental control device during the whole life cycle are analyzed. A day or a week in advance based on the historical load change law, combined with meteorological information, human activities and other external factors, and is an important part of smart grid development planning 1. Power system planning and operation2. Due to the increasing complexity and uncertainty of modern power grid, the fluctuation behavior of load has become more and more complicated. As the device is required to meet the use environment of the whole land, in addition to the above storage environment factors, it is also necessary to consider the influence of salt fog, mold and other environments on the device 3 and machine learning method represented by SVM (Support Vector Machine] 4. There are many factors that affect the power load. The power load itself is a random non-stationary sequence, and the statistical modeling method generally does not consider meteorological information, holiday information and other influencing factors, which leads to the failure to reflect the nonlinear characteristics that affect the load fluctuation. SVR(Support Vector Regression) method combines meteorological factors to realize dynamic modeling on SVM5. Literature 6-7 has constructed several different forecasting models, mainly including BPNN, GRNN (General Regression Neural Network) and so on, which can consider various external factors at the same time. And the fluctuation behavior of power load becomes more and more complicated. It is gradually difficult for a single forecasting model to obtain good forecasting accuracy. In order to make the model obtain better prediction performance, the combined prediction model constructed by introducing data decomposition algorithm came into being. References 8 and 9 discussed two data decomposition methods, EMD (Empirical Mode Decomposition) and EEMD. The defect of EMD is that modal aliasing is easy to occur when decomposing the original load series, and EEMD can improve this phenomenon by adding Gaussian white noise. It should be pointed out that the existing combined forecasting model usually uses a single model to predict multiple components after obtaining multiple components by using data decomposition algorithm, which cannot guarantee that the model has adaptability to multiple frequency components. Therefore, this paper proposes a short-term power load forecasting method based on EEMD-LSSVM- integrated BPNN. Firstly, the original load is decomposed by EEMD, and multiple components with different frequencies and characteristics are obtained. At the same time, in order to reduce the modeling task, the components with similar entropy values are calculated, the low-frequency trend components are predicted by LSSVM, and the remaining components are predicted by integrated BPNN based on Bagging integrated learning algorithm. The new model is verified in an actual example, and the prediction accuracy and stability are greatly improved.

2.

EEMD ALGORITHM AND SAMPLE ENTROPY

2.1

EEMD algorithm principle

The characteristic of EMD algorithm is that it decomposes the original signal adaptively with time scale characteristics without setting the basis function, which is very suitable for dealing with non-stationary and strongly random time series, and its main drawback is the phenomenon of modal aliasing 10-11.

By adding Gaussian white noise, a continuous signal to be decomposed is obtained, and then EMD11 is performed. The EEMD with uniformly distributed characteristic noise can promote the separation of input data at different scales, without actually affecting the IMF, which can effectively extract signals from the data and suppress modal aliasing.

The steps of EEMD algorithm are as follows 12:

  • (1) White noise is added to the original power load data, and 20% of the standard deviation of the signal is set as the standard deviation of the white noise, which can be adjusted.

  • (2) The signal is residual component decomposed by EMD, as shown in the following equation, and the signal to be decomposed is located on the left:

    00066_PSISDG12779_127790Q_page_2_1.jpg

    Where: is the decomposed IMF; Is the residual component after decomposition.

  • (3) Repeat the above steps (1) and (2) and add mutually different white noise sequences at the same time. The number of repetitions is determined according to the effect to be achieved by decomposition.

  • (4) After reaching the maximum number, get the average value of the corresponding IMF as the final result.

2.2

Sample entropy (SE)

The greater the complexity of time series in SE method, the greater the SE value, and the calculation has good consistency, independent of data length. Measuring the complexity of a sequence is mainly based on the probability of generating a new pattern in the measured signal 13. At present, SE is widely used in biomedicine, brain wave recognition and other fields. In this paper, SE is used to recombine the components obtained after EEMD decomposition, which can not only improve the calculation efficiency, but also reduce the modeling task.

{x(i)} = {x(1), x(2),⋯, x(N)} is time series data and consists of N samples. The steps of SE algorithm are as follows:

  • (1) Set the embedding dimension m in advance, and form a group of m-dimensional vectors in sequence:

    00066_PSISDG12779_127790Q_page_3_1.jpg

  • (2) The absolute value of the maximum difference between the orientations Xm (i) and Xm (j) of the distance d[Xm (i), Xm (j)], namely:

    00066_PSISDG12779_127790Q_page_3_2.jpg

  • (3) r (r > 0) is the set similarity tolerance, given that Xm (i), 00066_PSISDG12779_127790Q_page_3_3.jpg and Bm (r) are the given number ratio d[Xm (i), Xm (j)] ≤ r of Xm (i), the formula is as follows:

    00066_PSISDG12779_127790Q_page_3_4.jpg
    00066_PSISDG12779_127790Q_page_3_5.jpg

  • (4) Add 1 to the vector dimension, and repeat the above three steps to get the Bm +1 (r) mean of 00066_PSISDG12779_127790Q_page_3_6.jpg :

    00066_PSISDG12779_127790Q_page_3_7.jpg

  • (5) Calculate the SE of time series. When N is a finite value, the calculation formula is as follows:

    00066_PSISDG12779_127790Q_page_3_8.jpg

3.

SHORT-TERM LOAD FORECASTING BASED ON EEMD-LSSVM-BAGGING-BPNN

3.1

EEMD algorithm principle

The standard BPNN feed forward neural network is shown in Figure 1 below:

Figure 1.

BPNN structure diagram

00066_PSISDG12779_127790Q_page_4_1.jpg

Taking the prediction accuracy and time as the comprehensive measure, through many experiments, it is determined that 20 rounds of self-help random sampling with playback are carried out in proportion, and BPNN is used as a weak learner, and 20 prediction models are trained and predicted on the test set, and a strong learner is obtained after simple equal weight average. The implementation mechanism of the algorithm is shown in Figure 2:

Figure 2.

Realization mechanism of Bagging-BPNN algorithm

00066_PSISDG12779_127790Q_page_4_2.jpg

Many external factors, such as meteorological conditions, will affect the power load to varying degrees, which leads to the randomness of the power load, while the time factor makes the fluctuation of the power load show periodic changes 14. EEMD-LSSVM-Bagging-BPNN model, and the steps are as follows:

  • (1) Input historical load data and preprocess abnormal or incomplete data.

  • (2) The prototype should be in the form of a highly compact all-in-one machine, with light weight, integration, high reliability and environmental adaptability as the main technical characteristics, and pay attention to component scheme optimization and performance matching, so as to effectively improve the actual cooling and heating adjustment effect.

  • (3) Analyze the characteristics of the recombined components, and select the characteristic input variables with high correlation. The characteristic pool includes meteorological, time information and historical load.

  • (4) Normalizing the data, using LSSVM model to predict the low-frequency trend components, using Bagging-BPNN to predict the remaining components, as shown in Figure 3 below.

Figure 3.

EEMD-LSSVM- integrated BPNN model prediction flow chart

00066_PSISDG12779_127790Q_page_5_1.jpg

3.2

Load decomposition and reorganization results

In this paper, the load of Denmark, a central and northern European country, from April 1, 2015 to September 19, 2015 is selected as the research object, and the data sampling frequency is 1h. After EEMD, 11 components are obtained, and the selected parts are shown in Figure 4.

Figure 4.

EEMD decomposes load sequence

00066_PSISDG12779_127790Q_page_5_2.jpg

If these 11 components are modeled separately, it will increase the computational complexity of overall model training and prediction. In order to reduce the modeling task and improve the calculation efficiency, a new sequence is obtained after using SE, in which the embedding dimension m = 2 and similarity tolerance r = 0.2 ×std(IMFi) are included. As can be seen from fig. 5, the SE value decreases with the decrease of the frequency of each component, and some adjacent components have similar SE values. Among them, the SE value of IMF1 is the highest, representing the highest sequence complexity, and IMF1 is taken as a random component alone; Reorganize IMF2, 3 and 4 into detail components; The rest are reorganized into trend components.

Figure 5.

Entropy value of each component sample

00066_PSISDG12779_127790Q_page_6_1.jpg

As can be seen from Figure 6, the random component has strong randomness and high frequency, which is mainly influenced by external factors. The detail component changes periodically on a daily basis, which has certain regularity. The trend component is the most gentle, changes periodically in weeks, and its load value is the largest. It can be known that the trend component is the base load part of the original sequence load and has the strongest correlation with the historical load. Under this condition, the device and the parts with ventilation or leakage will be affected by dust, especially the condenser and condensing fan. The heat exchange fins of the condenser are easily blocked by dust, and the fan blades are easily damaged by dust.

Figure 6.

Sequence merging result

00066_PSISDG12779_127790Q_page_6_2.jpg

The feature pool considered in this paper includes meteorology, time information and historical load. Historical data is select as that data of the first 24 time points of the forecast time point, so as to forecast the load value of the next time point; Meteorological information includes wind speed and temperature, and considering the lag effect of temperature on load, the temperature of the first five time points of the forecast time point is also taken as the characteristic input variable; Time information includes week type, time point type and holiday type, and the time information is coded by unique heat coding 15. In order to model the recombined components, the three components are sorted by mutual information correlation analysis 16, and the low-frequency trend component, as the base load part, has the least correlation with meteorological information, so meteorological information is not considered when modeling the trend component, and there are 56-dimensional characteristic input variables, and the rest components have 63-dimensional characteristic input variables.

3.3

Predictive evaluation index

In this paper, MAPE and RMSE are used to quantify the error and evaluate the effectiveness of the prediction model. The calculation formula is as follows:

00066_PSISDG12779_127790Q_page_7_1.jpg
00066_PSISDG12779_127790Q_page_7_2.jpg

The smaller the calculation results of these two indicators, the better the prediction performance; Yi is the actual value of the i sample, and 00066_PSISDG12779_127790Q_page_7_3.jpg is the predicted value; N is the total number of predicted outputs.

4.

EXAMPLE ANALYSIS

4.1

Overview of calculation examples

In order to verify established separately the validity of the forecasting method proposed in this paper, the load and meteorological data of Denmark, a central and northern European country, from April 1, 2015 to September 18, 2015 are selected as established separately the research object. In order to facilitate the comparative analysis, the EEMD-LSSVM-Bagging-BPNN model proposed in this paper and six forecasting models, namely LSSVM, BPNN, Bagging-BPNN, EEMD-LSSVM and EEMD-Bagging-BPNN, are established to compare the forecasting performance. Among them, the penalty coefficient of LSSVM and the parameters of radial basis function are selected by grid search optimization method, and the parameter range is set to [0,10] and the iteration step is 1. When forecasting the original load series, the BPNN model selects a single hidden layer structure, the number of hidden layer neurons is set to 15, the transfer function is tansig, the learning rate is 0.01, the learning target is 0.001, and the number of iterations is 100. Levenberg-Marquardt optimization is adopted, and the prediction results are averaged several times. When EEMD and SE methods are used to get the trend, details and random components after recombination, and the model is established separately, the structural parameters of BPNN in EEMD-Bagging-BPNN model are 56-12-1, 63-15-1 and 63-11-1 respectively, and the other settings are consistent with the above.

4.2

Prediction result analysis

Firstly, BPNN, Bagging-BPNN and EEMD-Bagging-BPNN models are established to verify the effectiveness of Bagging ensemble learning algorithm in the destination city, and the prediction results are compared and analyzed. The load and meteorological data of Denmark from April 1, 2015 to September 11, 2015 are trained to predict the big load data of 24 hours on September 12, 2015, and the two results are very shown in Figure 7. It can be seen from Figure 7 that structural parameters compared with BPNN, the predicted results of EEMD-Bagging-BPNN and Bagging-BPNN can better fit the real value. Table 1 shows the MAPE values and RMSE values of the three models. It can be seen that the MAPE values and RMSE values of Bagging-BPNN model are small reduced by 27.33% and 19.54% respectively compared with the BPNN model, which proves that the introduction of Bagging ensemble learning algorithm can effectively optimize the prediction performance of BPNN. At the same time, after EEMD decomposition, each component is modeled separately to predict, and the MAPE values and RMSE values are also reduced by 9.17 compared with the Bagging-BPNN model.

Figure 7.

Comparison of Bagging-BPNN and BPNN Prediction Results Curve on September 12th

00066_PSISDG12779_127790Q_page_8_1.jpg

Table 1.

Comparison of evaluation indexes of September 12 forecast results

modelMAPE/%RMSE/MW
BPNN1.5062.02
Bagging-BPNN1.0949.93
EEMD-Bagging-BPNN0.9942.12

However, Bagging-BPNN still has an over-fitting phenomenon on the decomposed low-frequency trend components. The prediction result of low-frequency trend components on September 12, 2015 is shown in Figure 8, from which it can be seen that the prediction result of LSSVM model is closer to the real value. Thus, the EEMD-LSSVM-Bagging-BPNN prediction model is established, the LSSVM model is used to model the low-frequency trend component, and the Bagging-BPNN model is used to model the detailed and random components.

Figure 8.

Comparison of prediction results of trend components

00066_PSISDG12779_127790Q_page_8_2.jpg

Fig. 9 shows the forecast results of six forecasting models, namely LSSVM, BPNN, Bagging-BPNN, EEMD-LSSVM, EEMD-Bagging-BPNN and EEMD-LSSVM-Bagging-BPNN, on September 12th. It can be seen from the local enlargement that the EEMD-LSSVM-Bagging-BPNN forecasting.

Figure 9.

Comparison of prediction results curves of all models on September 12th.

00066_PSISDG12779_127790Q_page_9_1.jpg

In order to verify the applicability of the model, the load from September 12 to September 18 is further predicted, and the prediction evaluation indicators are shown in Table 2. Based on the analysis of the average of this week’s forecast results, an integrated BPNN model is constructed by introducing Bagging ensemble learning algorithm, and its forecast results MAPE 18.52% and 17.02% lower than that of a single BPNN model, which further verifies that Bagging ensemble learning algorithm can effectively improve the forecasting performance of BPNN. The original sequence is decomposed by EEMD, the low-frequency trend component is modeled separately by introducing LSSVM, and the detailed and random components are modeled by introducing Bagging-BPNN, that is, after the EEMD-LSSVM-Bagging-BPNN model is constructed, the prediction performance of the model is improved to some extent compared with EEMD-Bagging-BPNN model and EEMD-LSSVM model. MAPE value decreased by 10.92% and 27.40% respectively, and RMSE value decreased by 12.27% and 23.93% respectively. It is verified that LSSVM model is more suitable for predicting low-frequency trend components after EEMD sequence decomposition, and combining Bagging-BPNN to predict other components can effectively improve the prediction accuracy. The MAPE value of the predicted results of this model decreased by 28.86%, 34.57% and 19.70% respectively, and the RMSE value also decreased by 24.70%, 31.96% and 18.01% respectively. EEMD-LSSVM-Bagging-BPNN model is a feasible prediction model.

Table 2.

Comparison of evaluation indexes of prediction results of various models from September 12th to September 18th.

dateevaluating indicatorLSSVMEEMD-LSSVMBPNNBagging-BPNNEEMD-Bagging-BPNNThis paper model
September 12(th)MAPE/%1.101.161.501.090.990.79
RMSE/MW42.0147.4062.0249.9342.1234.72
September 13(th)MAPE/%1.402.011.241.221.331.09
RMSE/MW52.8480.6553.9153.5258.3350.20
September 14(th)MAPE/%1.621.421.901.461.181.15
RMSE/MW75.4771.8592.8372.3561.9659.76
September 15(th)MAPE/%1.581.461.931.721.661.64
RMSE/MW78.2370.31100.1089.1588.6680.53
September 16(th)MAPE/%1.001.151.641.331.030.83
RMSE/MW57.6955.1277.4764.8254.6442.35
September 17(th)MAPE/%2.361.551.471.401.030.99
RMSE/MW112.6879.1476.6769.9056.9753.14
September 18(th)MAPE/%1.401.461.691.011.120.92
RMSE/MW68.2777.8476.2447.7855.4946.16
average valueMAPE/%1.491.461.621.321.191.06
RMSE/MW69.6068.9077.0363.9259.7452.41

5.

CONCLUSION

The integrated BPNN model based on Bagging integration algorithm can effectively overcome the problems of poor generalization ability and low prediction accuracy of single BPNN. The deficiency of integrated BPNN in predicting low-frequency trend components is effectively compensated by LSSVM. Finally, each component is recombined to get the final prediction. The research on related technologies and product application at home and abroad was carried out, and the equipment status, development trend and technical development and application in the field of equipment environmental control system were analyzed in detail. Based on the basic principles of advanced performance and mature technology, the technical route selection and overall scheme design optimization were carried out. At the same time, combined with the requirements of new equipment and the specific functional performance indicators of the tender, the integrated environmental detection technology of the passenger cabin, cabin ambient intelligence control technology, integrated miniaturization technology, nuclear and biochemical detection and protection technology, temperature and humidity adjustment technology, oxygen generation and cabin air purification were studied, and the overall performance of the system was matched and key technologies were verified by means of analysis, calculation, simulation and test, forming an integrated and efficient indoor environmental control scheme.

FUND PROJECT

Natural Science Fund Project of Universities in Anhui Province (KJ2020A1035);

Major projects of natural scientific research in colleges and universities in Anhui Province (2022AH040281).

REFERENCES

[1] 

Liang, Z., Sun, G. Q., Li, H. C., et al., “Short-term load forecasting based on VMD and PSO optimized deep belief network[J],” Power System Technology, 42 (2), 598 –606 (2018). Google Scholar

[2] 

Li, W. Q., Chang, L., “A combination model with variable weight optimization for short-term electrical load forecasting[J],” Energy, 164 575 –593 (2018). https://doi.org/10.1016/j.energy.2018.09.027 Google Scholar

[3] 

Lee, C. M., Ko, C. N., “Short-term load forecasting using lifting scheme and ARIMA models[J]Expert Systems with Applications,” 38 (5), 5902 –5911 (2011). Google Scholar

[4] 

Zhao, F. Z., Hao, S., Zhang, Y., et al., “Short-term load forecasting for distribution transformer based on VMD-BA-LSSVM algorithm[J],” Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 35 (14), 190 –197 (2019). Google Scholar

[5] 

Cai, S. P., Yan, J., Liu, G. H., et al., “A dynamic modeling methodology based on fisher information and on-line SVR for smart grids weather sensitive load forecasting[J],” in Proceedings of the CSEE, 3441 –3452 (2020). Google Scholar

[6] 

Zhu, X. C., “Fuzzy Research on short-term power load forecasting method based on IFOA-GRNN[J],” Power System Protection and Control, 48 (9), 121 –127 (2020). Google Scholar

[7] 

Peng, W., Wang, J. R., Yin, S. Q., “Short-term load forecasting model based on Attention-LSTM in electricity market[J],” Power System Technology, 43 (5), 1745 –1751 (2019). Google Scholar

[8] 

Liang, Y., Niu, D., Hong, W. C., “Short term load forecasting based on feature extraction and improved general regression neural network model[J],” Energy, 166 653 –663 (2019). https://doi.org/10.1016/j.energy.2018.10.119 Google Scholar

[9] 

Wang, X., Meng, L. L., “Ultra-short-term load forecasting based on EEMD-LSSVM[J],” Power System Protection and Control, 43 (1), 61 –66 (2015). Google Scholar

[10] 

Yeh, J. R., Shieh, J. S., Huang, N. E., “Complementary ensemble empirical mode decomposition: A novel noise enhanced data analysis method[J],” Advances in adaptive data analysis, 2 (2), 135 –156 (2010). https://doi.org/10.1142/S1793536910000422 Google Scholar

[11] 

Deng, D. Y., Li, J., Zhang, Z. Y., et al., “Short-term Electric Load Forecasting Based on EEMD-GRU-MLR[J],” Power System Technology, 44 (2), 593 –602 (2020). Google Scholar

[12] 

Zhu, N. H., Bai, X. M., Dong, W. J., “Harmonic Detection Method Based on EEMD[J],” in Proceedings of the CSEE, 92 –98+14 (2013). Google Scholar

[13] 

Jiang, F., Qiao, Y. Q., “PM2.5 concentration prediction based on sample entropy and improved extreme learning machine[J],” Statistics & Decision, 37 (3), 166 –171 (2021). Google Scholar

[14] 

Chang, X. X., “Research on short-term Load Forecasting of power System based on Integrated Elman Neural Network[D],” Qingdao University, Qingdao (2019). Google Scholar

[15] 

Liang, J., Chen, J. H., Zhang, X. Q., et al., “One-hot encoding and convolutional neural network based anomaly detection[J],” Journal of Tsinghua University(Science and Technology), 59 (7), 523 –529 (2019). Google Scholar

[16] 

Kong, X. Y., Li, C., Zheng, F., et al., “Short-term load forecasting method based on empirical mode decomposition and feature correlation analysis[J],” Automation of Electric Power Systems, 43 (5), 46 –56 (2019). Google Scholar
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hui Liu, Xiaomin Xie, and Shanshan Tang "Research on short-term power load forecasting and analysis based on EEMD-LSSVM- integrated BPNN", Proc. SPIE 12779, Seventh International Conference on Mechatronics and Intelligent Robotics (ICMIR 2023), 127790Q (11 September 2023); https://doi.org/10.1117/12.2688712
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Performance modeling

Data modeling

Machine learning

Meteorology

Atmospheric modeling

Analytical research

Modal decomposition

Back to Top