Professional Documents
Culture Documents
By
Marius Mihai & Chaoyun Liu
University of New Orleans
December 2013
Acknowledgments
This report was prepared by Marius M. Mihai and Chaoyun Liu, Graduate Research Assistants at the
University of New Orleans (UNO), as a final project for the Time Series Analysis class. Thanks and recognition is
given to Dr. Tumulesh Solanky, for his valuable lessons which allowed us to complete this analysis.
Introduction
Total non-farm payroll employment in the United States (US) is one of the most important leading economic
indicators reported by the Bureau of Labor Statistics (BLS) on a monthly basis. Monitoring this time series is
imperative when assessing the strength of the national economy. Forecasting such a series is also crucial,
because it provides a good overview of how the job market will perform in future time periods. This
information is very important not only to the general public, but also to all decision makers in a business
environment.
For simplicity, throughout the report we will use US non-farm payroll employment and US employment
interchangeably. The data selected was seasonally adjusted and it goes back to 1980. The analysis and forecast
will be done on a quarterly basis. A total of 135 data points were collected.
A. Preliminary Analysis
1. Time Series Plot
140000
130000
120000
110000
100000
90000
80000
01JAN1980
01JAN1985
01JAN1990
01JAN1995
01JAN2000
01JAN2005
01JAN2010
Date
01JAN2015
2000
1000
-1000
-2000
-3000
01JAN1980
01JAN1985
01JAN1990
01JAN1995
01JAN2000
Date
01JAN2005
01JAN2010
01JAN2015
B. Model Identification
140000
130000
120000
110000
100000
90000
80000
01JAN1980
01JAN1985
01JAN1990
01JAN1995
01JAN2000
Date
01JAN2005
01JAN2010
01JAN2015
Parameter
MU
AR1,1
AR1,2
AR1,3
Estimate
216.85
1.12847
-0.26920
-0.05109
Pr > |t|
0.1032
<0.0001
0.0034
0.3617
Lag
0
1
2
11
AIC=1,691.964
SBC=1,703.014
Std. Deviation (Error)=332.32
The first model identified was an autoregressive
model that has the present value based on its
previous value, previous second value, and
previous eleventh value. The time series plot at the
bottom shows a good fit of the forecasted values.
Although the predicted values are a little lower
when compared to the actuals, this model
nonetheless is a strong candidate in our selection
process.
Other statistics are also confirming the validity of
this model. Out of all parameters, two are very
significant- the one at lag 1 and the one at lag 2.
The residuals are not showing any time series
pattern.
To Lag
6
12
18
130000
120000
110000
100000
90000
80000
01JAN1980
01JAN1985
01JAN1990
01JAN1995
01JAN2000
01JAN2005
01JAN2010
Date
PLOT
US_employment
01JAN2015
Parameter
MU
AR1,1
AR1,2
Estimate
165.299
0.989
-0.16310
Pr > |t|
0.2232
<0.0001
0.0036
Lag
0
1
4
AIC=1,690.613
SBC=1,698.975
Std. Deviation (Error)=665.271
The second model identified was an autoregressive
model that has the current value based on its
previous one, and previous fourth value. Again, the
time series plot at the bottom shows a relatively
good fit of the forecasted values. However, towards
the end of the forecast, the predicted values are
leveling off while the actual values continue on an
upward trend. Compared to the previous model, this
one has a slightly lower AIC and SBC, and both of the
autoregressive terms are significant. However, the
standard deviation of the errors is a lot higher when
compared to the previous model. Thus, the
predicted values for this second time series structure
are not as accurate. Same as above, the residuals are
not showing any time series pattern.
To Lag
6
12
18
130000
120000
110000
100000
90000
80000
01JAN1980
01JAN1985
01JAN1990
01JAN1995
01JAN2000
01JAN2005
01JAN2010
Date
PLOT
01JAN2015
Parameter
MU
AR1,1
AR1,2
Estimate
218.67
0.90776
-0.09385
Pr > |t|
0.1077
<0.0001
0.1007
Lag
0
1
7
AIC=1,697.128
SBC=1,705.49
Std. Deviation (Error)=181.33
The third model attempted was an autoregressive
model that has the current value based on its
previous one, and previous seventh value. Looking
at the time series plot on the bottom of the page,
the forecasted vales are following the actual
numbers very accurately. As a result, the standard
deviation of the errors is significantly lower than in
the previous models. Although AIC and SBC are
slightly higher, this AR model represents the best
fit so far. Both terms can be considered significant
at 90% confidence (the AR term of lag 7 is
marginally significant). The autocorrelation check
for residuals indicates that there might be some
significant correlations up to lag 6, however after
that the residuals are not showing any time series
pattern.
To Lag
6
12
18
130000
120000
110000
100000
90000
80000
01JAN1980
01JAN1985
01JAN1990
01JAN1995
01JAN2000
01JAN2005
01JAN2010
Date
PLOT
US_employment
01JAN2015
Estimate
188.78246
1.12788
-0.25276
-0.04967
Pr > |t|
0.1807
<.0001
0.0075
0.3824
Lag
0
1
2
7
AIC=1,691.721
SBC=1,702.871
Std. Deviation (Error)=274.107
This model is an autoregressive model with the
present value based on the previous value, the
previous second value, and previous seventh value.
The forecast made by this model fits the data very
good at first, but then the forecast line goes below
the actual plot as is shown in the graph - the model
predicts a slower increase in employment than it
actually is. The conditional least squares estimation
shows that the p-values of the first and second
parameter are significant. The AIC and SBC of this
model are 1,692 and 1,703, and the standard
deviation of the error is 274.
To Lag
6
12
18
140000
130000
120000
110000
100000
90000
80000
01JAN1980
01JAN1985
01JAN1990
01JAN1995
01JAN2000
01JAN2005
01JAN2010
Date
PLOT
US_employment
01JAN2015
Parameter
MU
AR1,1
AR1,2
AR1,3
Estimate
205.89901
0.93087
-0.08619
-0.03007
Pr > |t|
0.1332
<.0001
0.2718
0.7029
Lag
0
1
5
7
AIC=1,697.878
SBC=1709.028
Std. Deviation (Error)=281.2292
This model is an autoregressive model with the
present value based on the previous value, the
previous fifth value, and previous seventh value.
The forecast made by this model provides a good
fit, same as in previous models. However, the
forecast line goes slightly higher over the actuals,
and then it moves back down. The conditional least
squares estimation shows that the p-value of the
first parameter is significant. The AIC and SBC of
this model are 1,698 and 1,709, and the standard
deviation of the error is 281.
To Lag
6
12
18
140000
130000
120000
110000
100000
90000
80000
01JAN1980
01JAN1985
01JAN1990
01JAN1995
01JAN2000
01JAN2005
01JAN2010
Date
PLOT
US_employment
01JAN2015
6. AR non-zero parameters at 1, 3, 7, 11
Parameter
MU
AR1,1
AR1,2
AR1,3
AR1,4
Estimate
220.25644
1.01007
-0.16063
-0.02278
-0.03926
Pr > |t|
0.0805
<.0001
0.0196
0.7302
0.5255
Lag
0
1
3
7
11
AIC=1,695.23
SBC=1709.168
Std. Deviation (Error)=277.0325
This model is an autoregressive model with the
present value based on the previous value, the
previous third value, the previous seventh value,
and the previous eleventh value. The forecast
made by this model fits the data very well. The
conditional least squares estimation shows that the
p-values of the first and second parameter are
significant. The AIC and SBC of this model are 1,695
and 1,709, and the standard deviation of the error
is 277.
To Lag
6
12
18
24
130000
120000
110000
100000
90000
80000
01JAN1980
01JAN1985
01JAN1990
01JAN1995
01JAN2000
01JAN2005
01JAN2010
Date
PLOT
US_employment
01JAN2015
C. Model Selection
AIC, SBC and St. Deviation were plotted
and compared in order to make the best
model selection.
As it can be seen, AICs and SBCs were
not very different - that made it hard to
make a decision based solely on those
two criteria. Thus, more weight was
given to the standard deviation value.
AR 1, 7 was selected over the other
models because it had the minimum
deviation from the actual values.
AR 1, 7 was applied to all data points
and the following results were obtained:
Parameter
MU
AR1,1
AR1,2
Estimate
213.22
0.89
-0.07
AIC
800
St. Dev.
600
400
332
274
281
277
AR 1, 2, 7
AR 1, 5, 7
AR 1, 3, 7,
11
181
200
0
AR 1,2,11
AR 1,4
AR 1,7
To Lag
6
12
18
SBC
665
Pr > |t|
0.1049
<.0001
0.1211
Lag
0
1
7
140000
130000
120000
110000
100000
90000
80000
01JAN1980 01JAN1985 01JAN1990 01JAN1995 01JAN2000 01JAN2005 01JAN2010 01JAN2015 01JAN2020
Date
PLOT
US_employment
Upper 95% Confidence Limit
Above, there is the forecast plot for the next 8 quarters along with its 95% confidence limits. According to our
model, US non-farm employment will continue to increase over the next two years. In addition, the 95%
confidence limits were assumed to represent various scenarios in the US economy. Thus, the upper 95% limit
represents the number of non-farm jobs amid a potential boom in the national economy, while the lower 95%
limit shows the number of jobs in case another recession happens. Otherwise, with everything else assumed
to be equal, in the next two years US non-farm employment should follow the optimal forecast represented
by the red line in our plot above. The table below presents the actual forecast, upper 95, and lower 95 values
generated by our model.
Date
2013 Q4
2014 Q1
2014 Q2
2014 Q3
2014 Q4
2015 Q1
2015 Q2
2015 Q3
Forecast(million)
Optimal conditions
136.6
136.9
137.3
137.6
137.9
138.1
138.3
138.5
Lower 95(million)
Economic Recession
136.0
135.7
135.4
134.9
134.3
133.8
133.1
132.5
Upper 95(million)
Economic Boom
137.1
138.1
139.2
140.3
141.4
142.4
143.5
144.5
E. Data Sources
Our primary data source was the Bureau of Labor Statistics. Below, we provided the link and the Series Id
where this data can be downloaded from.
http://data.bls.gov/cgi-bin/srgate
Series ID: CES0000000001