Time Series Notes (4) - Model specification
Model-building strategy
Box-Jenkins method for model-building strategy
graph TD
A([Start with a time series realization])
--> B[Identify a preliminary time series model]
--> C[Estimation of the model parameters]
--> D{Is the model adequate?}
D --> |No| B
D --> |Yes| E([Stop])
Three steps of building models
- Model specification. The model is tentative at this moment and it is from the observed part of the time series. The model also should be chosen under the principle of parsimony.
- Model fitting.
- Model diagnostics. If inadequacies found, another model ought be considered until an acceptable model is found.
Autocorrelation function (ACF)
For a
For a
Sample autocorrelation function
For a sequence of observations
where
Bartlett’s approximation
The 95% approximate confidence interval of
The above confidence interval acts as the accepting region for the test with
In real applications, the original condition of the Bartlett’s approximation is usually ignored. The squared part are usually ignored. And the value of 1.96, the corresponding 95% percentile of the standard normality, is replaced by 2.0 in R.
Partial autocorrelation function (PACF)
Definition of partial autocorrelation
The partial autocorrelation (PACF) at lag
Or, in other words, the equations are equivalent:
where
In particular:
: : we have
Thus,
The PACF is to measure the direct impact from
For
- Notice that the partial autocorrelations of an
model are never zero. Except the special case of . They decay exponentially to zero, rather like the autocorrelation for an series. - The behaviors of PACF for ARMA processes is quite similar to that for MA processes.
Sample PACF
For an observed time series, an obvious method is to estimate the
Inverse autocorrelation function (IACF) and the sample IACF
- The sample IACF plays much the same role in ARIMA modeling as the sample PACF, but it generally indicates subset and seasonal autoregressive models better than the sample PACF.
- Additionally, the sample IACF can be useful for detecting over-differencing.
- The IACF is defined in the frequency domain. It is not often discussed in textbooks.
Extended autocorrelation function (EACF) and the sample EACF
- It can be used to identify the values of
and for an . - It is not a very popular method.
- There are other similar methods in the literature such as the corner method, the smallest canonical correlation (SCAN) method, etc..
Nonstationarity
How to determine the value of in models
- To determine the order of
is equivalent to determine whether or not a sequence of values are from a stationary time series. - The definition of the sample ACF implicitly assumes stationarity.
- However, for a nonstationary series, the sample ACF typically decays slowly as the lags increase.
- If the sample ACF decays slowly, we may say the sequence is nonstationary. Otherwise, it is stationary.
Overdifferencing
If a time series
To avoid overdifferencing, we should look carefully at each difference in succession and always keep in mind the principle of parsimony – models should be as simple as possible.
Other specification methods
Akaike information criterion (AIC)
This criterion is to select the model that minimizes
where
Bayesian information criterion (BIC)
This criterion is to select the model that minimizes
The AIC is more popular in real applications although the BIC has a well justified theoretical basis.