Using Normalized Bayesian Information Criterion (Bic) To Improve Box - Jenkins Model Building
Using Normalized Bayesian Information Criterion (Bic) To Improve Box - Jenkins Model Building
DOI: 10.5923/j.ajms.20140405.02
Department of Mathematics and Statistics University of Uyo, Akwa Ibom State, Nigeria
Abstract A Statistical Time Series model is fitted to the Chemical Viscosity Reading data. Comparison with the original
models fitted to the same data set by Box and Jenkins is made using the Normalized Bayesian Information Criterion (BIC)
and analysis and evaluation are presented. The analysis proved that the proposed model is superior to the Box and Jenkins
models.
Keywords ARIMA, Bayesian Information Criterion (BIC), Box-Jenkins Approach, Ljung-Box Statistic, Time Series
Analysis
3.1. Model Identification That is, the AR coefficient ∅1 was estimated to be 0.814
with standard error of 0.045 and a t-ratio of 18.024 while the
The graphical plot of the original series of the chemical
MA coefficient θ was estimated to be 0.972, with standard
process viscosity Reading: (Every Hour) is given in figure 1.
error of 0.020 and a t-ratio of 49.007.
It is observed that, the series exhibits non-stationary
For this model Q = 9.746. The 10% and 5% points of
behaviour indicated by its growth.
chi-square with 16 degree of freedom are 23.50 and 26.30
The sample autocorrelations of the original series in figure
2 failed to die out quickly at high lags, confirming the respectively. Therefore, since Q is not unduly large and the
non-stationarity behaviour of the series, which equally evidence does not contradict the hypothesis of White Noise
suggests that transformation is required to attain stationarity. behaviour in the residuals, the model is very adequate and
Consequently, the difference method of transformation was significantly appropriate.
adopted and the first difference ( d = 1 ) of the series was
3.3. Model Diagnostic Check
made. The plot of the stationary equivalent is given in figure
3 while the plots of the autocorrelation and partial It is concerned with testing the goodness of fit of the
autocorrelation functions of the differenced series are given model. From plots of the residual acf and pacf, it can be seen
in figure 4 and figure 5 respectively. that all points are randomly distributed and it can be
The autocorrelation and partial autocorrelation functions concluded that there is an irregular pattern which means that
of the differenced series indicated no need for further the model is adequate. Also, the individual residual
differencing as they tend to be tailing off rapidly. They also autocorrelations are very small and are generally within
indicated no sign of seasonality since they do not repeat ±2√𝑛𝑛 significance bounds. Also the statistical significance
themselves at lags that are multiples of the number of periods of the model was checked. Five criteria: The Normalized
per season. Bayesian information criterion (BIC), the R – square, Root
Using figure 4 and figure 5, the differenced series will be Mean Square Error (RMSE), the Mean Absolute Percentage
denoted by 𝜔𝜔𝑡𝑡 for 𝑡𝑡 = 1,2, … ,309 where 𝜔𝜔𝑡𝑡 = ∇𝑧𝑧𝑡𝑡 . It is Error (MAPE) and the Ljung – Box Q statistic were used to
observed that both the autocorrelation and partial test for the adequacy and statistical appropriateness of the
autocorrelation functions of 𝜔𝜔𝑡𝑡 are characterized by model.
correlations that alternate in sign and which tend to damp out First, the Ljung – Box (Q) Statistic test was performed
with increasing lag. Consequently, a mix autoregressive using SPSS 17 Expert Modeler (see table 1 and 2), the Ljung
moving average of order (1, 1, 1) was proposed since both – Box Statistic of the model is not significantly different
the autocorrelation and partial autocorrelation functions of from zero, with a value of 9.746 for 16 d.f and associated
the 𝑤𝑤𝑡𝑡 seem to be tailing off. p-value of 0.880, thus failing to reject the null hypothesis of
Thus, using equation (1), the proposed model is an white noise. This indicates that the model has adequately
ARIMA (1,1,1). captured the correlation in the time series.
𝜙𝜙(𝐵𝐵)∇𝑧𝑧𝑡𝑡 = 𝜃𝜃(𝐵𝐵)𝑎𝑎𝑡𝑡 (12) Table 1. Model Parameters
(1 − 𝜙𝜙1 𝐵𝐵)𝜔𝜔𝑡𝑡 = (1 − 𝜃𝜃1 𝐵𝐵)𝑎𝑎𝑡𝑡 (13) Estimates S.E t-radio Sig.
AR Lag1 0.814 0.045 18.024 0.000
(1 − 𝜙𝜙1 𝐵𝐵)(𝑧𝑧𝑡𝑡 − 𝑧𝑧𝑡𝑡−1 ) = (1 − 𝜃𝜃1 𝐵𝐵)𝑎𝑎𝑡𝑡 (14)
Coefficients Difference 1 - - -
The plot of the autocorrelation and partial autocorrelation MA Lag1 0.972 0.020 49.007 0.000
functions of the residuals from the tentatively identified
ARIMA (1, 1, 1) model are given in figure 6. Moreover, the low value of RMSE indicates a good fit for
the model. Also, the high value of the R-Square and MAPE
3.2. Estimation of Parameters indicate a perfect prediction over the mean.
Having tentatively identified what appears to be a suitable Again, the model is adequate in the sense that the plots of
model, the next step is to obtain the least squares estimates of the residual acf and pacf in figure 6 show a random variation,
the parameters of the model. The SPSS 17 Expert Modeler thus, from the origin zero (0), the points below and above are
was used to fit the model to the data. The coefficient of both all uneven, hence the model fitted is adequate. The adequacy
the AR and the MA were not significantly different from and significant appropriateness of the model was confirmed
zero with values of 0.814 and 0.972 respectively. This model by exploring the Normalized Bayesian Information Criterion
enables us to write the model equation as: (BIC). In a class of statistically significant ARIMA (p,d,q)
models fitted to the series, the ARIMA (1,1,1) model had the
𝑧𝑧𝑡𝑡 = 0.814𝑧𝑧𝑡𝑡−1 + 0.972𝑎𝑎𝑡𝑡−1 + 𝑎𝑎𝑡𝑡 (15) least BIC value of −2.366.
Table 2. Model Statistics