Professional Documents
Culture Documents
art ic l e i nf o
a b s t r a c t
Article history:
Received 12 September 2014
Received in revised form
18 February 2015
Accepted 26 February 2015
Available online 19 March 2015
To protect running trains against the strong crosswind along Chinese QinghaiTibet railway, a strong
wind warning system is developed. As one of the most important technologies of the developed system,
a new short-term wind speed forecasting method is proposed by adopting the Empirical Mode
Decomposition (EMD) and the improved Recursive Autoregressive Integrated Moving Average (RARIMA)
model. The proposed forecasting method consists of three computational steps as: (a) use the EMD
method to decompose the original wind speed data into a group of wind speed sub-layers; (b) build the
forecasting models for all the decomposed sub-layers by utilizing the RARIMA algorithm; (c) employ the
built RARIMA models to predict the wind speed in the sub-layers; and (d) summarize the predicted
results of the wind speed sub-layers to get the nal forecasting results for the original wind speed. Since
the wind speed forecasting method is proposed for the real-time warning system, the forecasting
accuracy and the time performance of the forecasting computation are both considered. Two experiments show that: (a) the proposed method has better forecasting performance than the traditional
Autoregressive Integrated Moving Average (ARIMA) model, the Persistent Random Walk Model (PRWM)
and the Back Propagation (BP) neural networks; and (b) the proposed method has satisfactory
performance in both of the accuracy and the time performance.
& 2015 Elsevier Ltd. All rights reserved.
Keywords:
QinghaiTibet railway
Strong wind
Wind speed forecasting
Wind speed prediction
Warning system
Empirical Mode Decomposition
Recursive ARIMA
Neural networks
1. Introduction
In recent decade, there are several derailment accidents happened in the world caused by strong crosswind. A train consisting of
six vehicles was derailed between Akita station and Niigata station in
Uetsu railway line on December 25, 2005 in Japan (Train death toll
rises in Japan, 2005). The investigators from the Department of
Trafc Safety of East Japan Railway Company conrmed that the train
derailment was caused by the strong wind along the Uetsu line.
Eleven vehicles of a train were blown over in Xinjiang railway on
February 28, 2007 in China (Chen et al., 2010; Strong wind topples
Chinese train, 2007). The reason of the derailment proposed by the
Ministry of Railway of China was also the strong crosswind. Unfortunately, similar train derailment accidents also happened in Canada
(Freight train derails in southern Saskatchewan, 2014) and USA
(Shust et al., 2010).
n
Corresponding author at: Key Laboratory of Trafc Safety on Track of Ministry of
Education, School of Trafc and Transportation Engineering, Central South
University, Changsha 410075, Hunan, China.
Tel.: 86 73182655294; fax: 86 73182656374.
E-mail address: csuliuhui@csu.edu.cn (H. Liu).
http://dx.doi.org/10.1016/j.jweia.2015.02.004
0167-6105/& 2015 Elsevier Ltd. All rights reserved.
28
Drivers of Running
Trains
Wind Database
Train Information
Fibre
network
Network
Router
Track Information
Wind speed
station
Network
Hub
Wireless
network
Client Computers
Wind speed
station
Fig. 1. Architecture of the strong wind warning system for the QinghaiTibet railway.
29
Fig. 3. Mileage positions of the 52 wind speed stations along the QinghaiTibet
railway. (For interpretation of the references to color in this gure, the reader is
referred to the web version of this article.)
Fig. 2. Monitoring GUI of the strong wind warning system for the QinghaiTibet railway.
30
L1
t1
L2
t2
t4
t3
Fig. 5. Concept of the application of the wind speed predictions in the warning
system.
Start:
Original Railway
Wind Speed Data
Empirical Mode
Decomposition
(EMD)
Summarizing
Calculation
End:
Final Wind Speed
Predictions
31
n
X
C i t Rt
i1
where C i t
; i 1; 2; ; n is the IMFs in different decompositions, Rt is the residue and n is the number of the IMFs.
The computational steps of the EMD are given as follows
(Huang et al., 1998):
Fig. 7. Original wind speed time series fX 1t g.
Step 1: Identify all the local extrema of series Xt , including
local maxima and local minima.
Step 2: Connect all the local maxima
by a cubic spline line to
generate its upper
envelop X up t . Similarly the lower
envelop X low t is made with all the
local
minima.
Step 3: Compute the mean envelop Mt from the upper and
lower envelops as follows:
Mt X up t X low t =2
2
Step 4: Extract the details as follows:
Zt Xt Mt
wind speed data into a number of wind speed sub-layers; (b) build
the recursive ARIMA models for all the sub-layers and adopt
the built models to forecast the multi-step predictions; and
(c) summarize the wind speed multi-step predictions of the sublayers to get the nal predictions for the original wind speed.
3.2. Original wind speed data
Two groups of real wind speed series measured by two
different wind speed stations (i.e., the 30th station and the 32nd
station) along the QinghaiTibet railway are adopted to demonstrate the forecasting performance of the proposed EMD-RARIMA
method, as shown in Figs. 7 and 8. The time interval of two
consecutive data points is one minute. The mileage distance
between the two wind speed stations is 41 km. They are named
fX 1t g series and fX 2t g series, respectively. The 1st300th ones of
them are used to establish the forecasting models and the 301st
400th ones are utilized to check the built models.
3.3. Wind speed decomposition by EMD
As explained in Section 3.1, the Empirical Mode Decomposition
(EMD) is adopted to decompose the originally non-stationary
wind speed samples. The EMD is a mathematical time domain
decomposing method presented by scientists N. E. Huang in 1998
(Huang et al., 1998), which can convert a group of time series into
locally narrow band components, named Intrinsic Mode Functions
(IMFs). The purpose of executing the EMD decomposition in the
study is to convert the original non-stationary wind speed series
into a number of relatively stable wind speed sub-layers, which
will decrease the difculty to realize the high-precision predictions for the original wind speed. Different to the wavelet decomposition and the wavelet packet decomposition (Liu et al., 2013a,
2013b) which both need to select the decomposing parameters
Step 5: Check whether Zt is an IMF: (a) if Zt is an IMF
then set Ct Zt and meantime
Xt with the
replace
residual Rt Xt Ct ; (b) if Zt is not an IMF, replace
Xt with Zt then repeat Steps 24 until the termination
criterion is satised. The following equation can be regarded as
the termination condition of this iterative calculation:
2
m
X
Z j 1 t Z j t
r j 1; 2; ; t 1; 2; ; m
4
2
Z j 1 t
t1
where m is the number of the wind speed data points, is the
terminated parameter, and j denotes the times of iterative
calculation. In this study, the is equal to 0.2.
Step 6: The procedure of Steps 15 is repeated until all the IMFs
are found.
The EMD decomposed results of the original wind speed series
fX 1t g are given in Fig. 9. From Fig. 9, it can be seen that the fX 1t g
series have been converted into a group of wind speed sub-layers
successfully. A number of RARIMA models will be built in these
decomposed wind speed sub-layers to complete the multi-step
forecasting computation. It is obvious that it is easier for the built
RARIMA models to obtain the accurate forecasting results by using
the decomposed sub-layers than adopting the original wind
speed data.
In this study, all of the decomposed wind speed sub-layers will
be checked their stability by the Run Sequence Method (RSM)
(Hentati-Kaffel and de Peretti, 2015). The RSM is a non-parametric
statistical test to check the stability of a group of time series data.
The contents of the RSM algorithm are given as follows:
Step 1: Calculate the average value X of an original series fX t g.
Step 2: Convert the original series fX t g to a symbol series fSt g
using the criterion as below: set X t i is the ith wind speed
sample in the series fX t g; if X t i Z X, the symbol will be
adopted to replace the position of this sample; Oppositely, if
X t i o X, the symbol will be used to replace the position of
this sample.
Step 3: Calculate the number of the positive runs N and the
number of the negative runs N using the standard as below:
32
Fig. 9. Decomposed results of the original wind speed series fX 1t g. (a) fX D1t g series, (b) fX D2t g series, (c) fX D3t g series, (d) fX D4t g series, (e) fX D5t g series, (f) fX D6t g series,
(g) fX D7t g series, (h) fX D8t g series, (i) fX Rt g series.
in the converted symbol series fSt g, a sequence of the consecutive symbols is named a positive run, and a sequence of
the consecutive symbols is named a negative run. For
example, the N is equal to three and the N is equal to two
in a symbol series like .
Step 4: The RSM considers that if the time series fX t g is stable,
its corresponding symbol series fSt g should meet the normal
distribution. So the following statistical equations can be used
to check whether the series fX t g is stable as:
Z =
where:
2N 1 N 2 2N 1 N 2 N 1=2
N2 N 1
2N1 N 2
1
N
N N1 N2
Then, if jZ jr 1:96, the series fX t g is stable under the signicant level parameter 0:05. Otherwise, the series fX t g is
unstable.
3.4. Wind speed forecasting by RARIMA
Time series method is a statistical algorithm which can build an
explicit equation to describe the potential changing law of a
section of time series data (Wei, 1994). It has ve standard models,
including the Auto-Regressive (AR), the Moving Average (MA), the
Auto-Regressive Moving Average (ARMA) and the Auto-Regressive
Integrated Moving Average (ARIMA). The ARIMA models can
describe all the other ones.
3.4.1. Autoregressive Integrated Moving Average (ARIMA) model
The equations of the ARIMA (p, d, q) are dened as follows:
B UK d UXt B Uat
where:
8
B Xt 1=Xt
>
>
>
>
< K 1 B
B 1 1 B 2 B2 3 B3 p 1 Bp 1 p Bp
>
>
>
>
: B 1 B B2 B3
Bq 1 Bq
1
q1
10
where Xt is the wind speed time series, at is the random error
series assumed to be a white noise with a mean of zero and equal
variance, B is the backward shift operator, K is the difference
33
p
parameter of the auto-regressive part, and q is the parameter of
the moving average part.
As shown in Eq. (9), the ARIMA (p, d, q) model is actually a
combination of the AR(p) model, the MA(q) model and the
component of d times Difference Computation (DC). To demonstrate the derivation of the ARIMA (p, d, q) model, an ARIMA(2, 2,
2) model is taken as an example as follows:
1 1 B 2 B2 1 B2
2 B2 U at
U DC U Xt 1 1 BMA2
AR2
) 1 1 B 2 B2 U 1 B2 2B U Xt 1 1 B 2 B2 U at
) 1 2 1 B 1 21 2 B2 1 22 B3 2 B4 UXt
1 1 B 2 B2 U at
) Xt 2 1 Xt 1 1 21 2 Xt 2 1 22
Xt 3 2 Xt 4 at 1 at 1 2 at 2
) Xt 2 1 Xt 1 1 21 2 Xt 2 1 22
11
Xt 3 2 Xt 4 at 1 at 1 2 at 2
3.4.2. BoxJenkins methodology
The BoxJenkins methodology, named after statisticians George
Box and Gwilym Jenkins, is presented to establish the suitable
time series models. The reason to choose the BoxJenkins methodology in the study is that it has good real-time performance due
to its simple computational equations.
The BoxJenkins methodology consists of three modeling contents:
the model identication, the selection of model order and the
parameter estimation. The details of the BoxJenkins methodology
can be explained as: (a) in the step of model identication, the
Autocorrelation function (ACF) and partial autocorrelation function
(PACF) are always utilized to decide whether an Auto-Regressive (AR)
component or a Moving Average (MA) one should be included in the
ARIMA models. If the ACF coefcients show smearing effect, the AR
component will be included. If the PACF coefcients indicate censoring
effect, the MA component will be included. Additionally, the Run
Sequence Method (RSM) is employed to identify the stability of the
EMD decomposed sub-layers. If a sub-layer has been found that it is
unstable, the Difference Computation (DC) will be executed on this
sub-layer; (b) in the step of selection of model order, some criterions
(e.g., Final Prediction Error, Akaike Information Criterion, Bayes Information Criterion, etc) will be adopted to select the best orders of the
identied ARIMA models; and (c) in the step of parameter estimation,
some methods (e.g., Least Square Estimation, YuleWalker Estimation,
Greatly Relieved Estimate, etc) will be utilized to calculate the optimal
Fig. 10. Calculated results of the ACF/PACF coefcients for the fX D1t g series. (a) ACF results, (b) PACF results.
34
k
R
>
N
:
t1
R
R
6 27 6 1
0
p2 7
7
13
6
76
7 6
6 7 6
7 6
7
4
5 4
5 4 5
R^ p
R^
R^
R^
^
p
p1
p2
One-step forecasting:
X^ D1t 301 0:1879X D1t 300 0:1910X D1t 299
0:1988X D1t 298 0:2706X D1t 297
0:1377X D1t 296 0:3088X D1t 295
0:0873X D1t 294
15
Two-step forecasting:
X^ D1t 302 0:1879X^ D1t 301 0:1910X D1t 300 0:1988X D1t 299
16
Three-step forecasting:
X^ D1t 303 0:1879X^ D1t 302 0:1910X^ D1t 301 0:1988X D1t 300
0:2706X D1t 299 0:1377X D1t 298 0:3088X D1t 297
17
One-step forecasting:
14
X^ D1t 301 0:1879X D1t 300 0:1910X D1t 299 0:1988X D1t 298
0:2706X D1t 297 0:1377X D1t 296 0:3088X D1t 295
18
Two-step forecasting:
X^ D1t 302 0:1327X^ D1t 301 0:1253X D1t 300 0:1680X D1t 299
0:2837X D1t 298 0:1102X D1t 297 0:3235X D1t 296
0:1071X D1t 295
19
35
(c) Since the RARIMA models can adopt the latest forecasted results
to update the parameters of the forecasting equations before starting a
new computational process so that the RARIMA models can have
better forecasting accuracy than the traditional ARIMA models (i.e.,
non-recursive ARIMA models).
Three-step forecasting:
X^ D1t 303 0:1993X^ D1t 302 0:0945X^ D1t 301
0:1895X D1t 300 0:2710X D1t 299
0:0771X D1t 298 0:2953X D1t 297
0:0623X D1t 296
Fig. 12. Results of the one-step predictions for the original wind speed series fX 1t g by the EMD-RARIMA model, the BP neural networks, the ARIMA and the PRWM.
Fig. 13. Results of the three-step predictions for the original wind speed series fX 1t g by the EMD-RARIMA model, the BP neural networks, the ARIMA and the PRWM.
Fig. 14. Results of the ve-step predictions for the original wind speed series fX 1t g by the EMD-RARIMA model, the BP neural networks, the ARIMA and the PRWM.
36
including the Mean Absolute Error (MAE), the Mean Absolute Percentage Error (MAPE) and the Standard Deviation (SD).
M
1 X
^
j Xi Xij
Mi1
21
M
^
1 X
Xi Xi
j
j
^
Mi1
Xi
22
Standard Deviation:
v
u
M
u 1 X
^ 2
Xi Xi
SD t
M 1 i 1
23
n
o
^
where Xi is the measured wind speed time series, Xi
is
the forecasted wind
time series and M is the number of
speed
the terms in the Xi series.
Table 2
Analysis of the time performance of the predictions given in Figs. 1214.
EMD-RARIMA
1-step
Time (s)
Indexes
Time (s)
EMD-RARIMA
Indexes
MAE (%)
MAPE (%)
SD (%)
BP neural network
3-step
5-step
1-step
3-step
5-step
1.2243
1.8413
ARIMA model
1-step
3-step
0.0631
0.0752
1.9743
2.8940
3.1223
PRWM model
1-step
3-step
0.0165
3.4475
5-step
0.0937
5-step
3-step
5-step
63.52
63.11
62.14
75.03
75.24
70.01
82.75
83.24
76.05
Table 4
Improved accuracy percentages of the ARIMA model by the proposed hybrid EMDRARIMA model.
Indexes
MAE (%)
MAPE (%)
SD (%)
Table 1
Analysis of the accuracies of the predictions given in Figs. 1214.
3-step
5-step
50.40
50.00
52.22
64.15
63.81
63.32
75.41
75.76
69.30
BP neural network
1-step
3-step
5-step
1-step
3-step
5-step
MAE (m/s)
MAPE (%)
SD (m/s)
0.1607
0.76
0.2343
0.2763
1.31
0.4166
0.3248
1.51
0.5522
0.4405
2.06
0.6188
1.1067
5.29
1.3889
1.8826
9.01
2.3054
Indexes
ARIMA model
MAE (m/s)
MAPE (%)
SD (m/s)
Indexes
Table 3
Improved accuracy percentages of the BP neural network by the proposed hybrid
EMD-RARIMA model.
Indexes
Table 5
Improved accuracy percentages of the PRWM model by the proposed hybrid EMDRARIMA model.
Indexes
PRWM model
1-step
3-step
5-step
1-step
3-step
5-step
0.3240
1.52
0.4904
0.7708
3.62
1.1357
1.3210
6.23
1.7988
0.5610
2.64
0.7886
MAE (%)
MAPE (%)
SD (%)
3-step
5-step
71.35
71.21
70.29
37
(d) All the built forecasting models can meet the elapsed time
requirements of the warning system.
4.2. Case Two
Figs. 1517 show the forecasting results of the 301st400th original
wind speed series fX 2t g by the different forecasting methods. The
results of the accuracy and the time performance estimation for these
predictions are given in Tables 6 and 7, respectively.
Fig. 15. Results of the one-step predictions for the original wind speed series fX 2t g by the EMD-RARIMA model, the BP neural networks, the ARIMA and the PRWM.
Fig. 16. Results of the three-step predictions for the original wind speed series fX 2t g by the EMD-RARIMA model, the BP neural networks, the ARIMA and the PRWM.
Fig. 17. Results of the ve-step predictions for the original wind speed series fX 2t g by the EMD-RARIMA model, the BP neural networks, the ARIMA and the PRWM.
38
Table 6
Analysis of the accuracies of the predictions given in Figs. 1517.
Indexes
EMD-RARIMA
BP neural network
1-step
3-step
5-step
1-step
3-step
5-step
MAE
MAPE
SD
0.2103
1.53
0.2747
0.3408
2.54
0.4924
0.6620
4.84
1.0625
0.7582
5.03
1.1185
1.6859
11.12
2.1842
2.4703
16.13
3.2208
Indexes
ARIMA model
MAE
MAPE
SD
PRWM model
1-step
3-step
5-step
1-step
3-step
5-step
0.4884
3.51
0.7016
1.2543
9.32
1.8286
1.9998
14.88
2.7480
0.7606
5.75
1.0161
Table 7
Analysis of the time performance of the predictions given in Figs. 1517.
Indexes
EMD-RARIMA
BP neural network
1-step
3-step
5-step
1-step
3-step
5-step
Time (s)
1.3289
1.9023
2.0189
2.8528
3.2109
3.5723
Indexes
ARIMA model
Time (s)
PRWM model
1-step
3-step
5-step
1-step
3-step
5-step
0.0708
0.0832
0.0998
0.0185
References
Andersson, E., Hggstrm, J., Sima, M., Stichel, S., 2004. Assessment of trainoverturning risk due to strong cross-winds. Proc. Inst. Mech. Eng. F: J. Rail Rapid
Transit 218, 213223. http://dx.doi.org/10.1243/0954409042389382.
Baker, C., 2013. A framework for the consideration of the effects of crosswinds on
trains. J. Wind Eng. Ind. Aerodyn. A 123, 130142. http://dx.doi.org/10.1016/j.
jweia.2013.09.015.
Burlando, M., Freda, A., Ratto, C.F., Solari, G., 2010. A pilot study of the wind speed
along the RomeNaples HS/HC railway line. Part 1numerical modelling and
wind simulations. J. Wind Eng. Ind. Aerodyn. 98, 392403. http://dx.doi.org/
10.1016/j.jweia.2009.12.006.
Cheli, F., Ripamonti, F., Rocchi, D., Tomasini, G., 2010. Aerodynamic behaviour
investigation of the new EMUV250 train to cross wind. J. Wind Eng. Ind.
Aerodyn. 98, 189201. http://dx.doi.org/10.1016/j.jweia.2009.10.015.
Chen, R., Zeng, Q., Huang, Y., Xiang, J., Wen, Y., Guo, X., Yin, C., Dong, H., Zhao, G.,
2010. Analysis theory of random energy of train derailment in wind. Sci. China
Phys. Mech. Astron. 53, 751757. http://dx.doi.org/10.1007/s11433-010-0158-2.
Cui, T., Zhang, W., Sun, B., 2014. Investigation of train safety domain in cross wind in
respect of attitude change. J. Wind Eng. Ind. Aerodyn. 130, 7587. http://dx.doi.
org/10.1016/j.jweia.2014.04.006.
Deodatis, G., 1996. Simulation of ergodic multivariate stochastic processes. J. Eng. Mech.
122, 778787. http://dx.doi.org/10.1061/(ASCE)0733-9399(1996)122:8(778).
Freda, A., Solari, G., 2010. A pilot study of the wind speed along the RomeNaples
HS/HC railway line.: Part 2probabilistic analyses and methodology assessment. J. Wind Eng. Ind. Aerodyn. 98, 404416. http://dx.doi.org/10.1016/j.
jweia.2009.12.005.
Freight train derails in southern Saskatchewan [WWW Document], 2014. Prince
George Citiz. URL http://www.princegeorgecitizen.com/news/national/freighttrain-derails-in-southern-saskatchewan-cpr-says-plow-winds-were-cause-1.
1334141 (accessed 9.03.14).
Hentati-Kaffel, R., de Peretti, P., 2015. Generalized runs tests to detect randomness
in hedge funds returns. J. Bank. Financ. 50, 608615. http://dx.doi.org/10.1016/j.
jbankn.2014.07.011.
Hoppmann, U., Koenig, S., Tielkes, T., Matschke, G., 2002. A short-term strong wind
prediction model for railway application: design and verication. In: Proceedings
of the 3rd European-African Conference on Wind Engineering, J. Wind Eng. Ind.
Aerodyn. 90, 11271134. http://dx.doi.org/10.1016/S0167-6105(02)00226-X.
Huang, N.E., Shen, Z., Long, S.R., Wu, M.C., Shih, H.H., Zheng, Q., Yen, N.-C., Tung, C.
C., Liu, H.H., 1998. The empirical mode decomposition and the Hilbert spectrum
for nonlinear and non-stationary time series analysis. Proc. R. Soc. Lond. Ser.
Math. Phys. Eng. Sci. 454, 903995. http://dx.doi.org/10.1098/rspa.1998.0193.
Kobayashi, N., Shimamura, M., 2003. Study of a strong warning system. JR East Tech.
Rev., 6165.
Liu, H., Tian, H., Chen, C., Li, Y., 2013a. An experimental investigation of two
Wavelet-MLP hybrid frameworks for wind speed prediction using GA and PSO
optimization. Int. J. Electr. Power Energy Syst. 52, 161173. http://dx.doi.org/
10.1016/j.ijepes.2013.03.034.
Liu, H., Tian, H., Li, Y., 2012. Comparison of two new ARIMA-ANN and ARIMAKalman hybrid methods for wind speed prediction. Appl. Energy 98, 415424.
http://dx.doi.org/10.1016/j.apenergy.2012.04.001.
Liu, H., Tian, H., Li, Y., 2015. Comparison of new hybrid FEEMD-MLP, FEEMD-ANFIS,
Wavelet Packet-MLP and Wavelet Packet-ANFIS for wind speed predictions. Energy
Convers. Manag. 89, 111. http://dx.doi.org/10.1016/j.enconman.2014.09.060.
Liu, H., Tian, H., Pan, D., Li, Y., 2013b. Forecasting models for wind speed using
wavelet, wavelet packet, time series and articial neural networks. Appl.
Energy 107, 191208. http://dx.doi.org/10.1016/j.apenergy.2013.02.002.
Liu, H., Tian, H.Q., Li, Y.-F., 2009. Short-term forecasting optimization algorithms for
wind speed along QinghaiTibet railway based on different intelligent modeling theories. J. Cent. South Univ. Technol. Engl. 16, 690696. http://dx.doi.org/
10.1007/s11771-009-0114-3.
Pan, D., Liu, H., Li, Y., Liang, H., 2008. Short-term forecast method for wind speed
along GolmudLhasa section of QinghaiTibet railway. China Railw. Sci. 29,
129133.
Pankratz, A., 1983. Forecasting with Univariate BoxJenkins Models: Concepts and
Cases. John Wiley & Sons.
Shust, W., Pasta, C., Milhon, L., Rosenberger, W., Dumont, B.J., Miller, J.L., 2010. An
approach to wind blow-over risk reduction at BNSF Railway. In: Proceedings of
the ASME Joint Rail Conference 2010, JRC2010, 5768. http://dx.doi.org/10.1115/
JRC2010-36019.
Strong wind topples Chinese train, 2007. BBC.
Train death toll rises in Japan, 2005. BBC.
Wei, W.W.-S., 1994. Time Series Analysis. Addison-Wesley publ.