Legacy Documentation

Time Series (2011)

This is documentation for an obsolete product.

Current products and services

User's Guide to Time Series

Structural Models and the Kalman Filter

1.9.3 Applications of the Kalman Filter

The Kalman filtering technique can be used conveniently in the analysis of certain time series, once we write the time series model in a state-space form. In the following we will mention a few applications of the Kalman filter and illustrate some of them by examples. For a detailed treatment of the Kalman filter see, for example, Harvey (1989), Chapter 3.

The simple local level model (9.5) is in fact in the state-space form already with X_t=

_t, G_t=1, F_t=1, c_t=0, and d_t=0. It is easy to see that the local linear trend model (9.4) can be written in the state-space form as

and

with

Similarly, the basic structural model ((9.1) to (9.3)) when s=4 is equivalent to

where

ARMA models can also be cast into a state-space form, which in general is nonunique. For example, one particular representation of the AR(2) model in state-space form is (for the state-space form of an ARMA(p, q) model see Harvey (1989), Chapter 3)

and

An alternative representation for the state vector is

Initializing the Kalman Filter

The initial values of

and P_tt-1 have to be given in order to start the Kalman filter. If the coefficients in (9.7) are all time independent and the eigenvalues of the transition matrix F are inside the unit circle, the state vector is stationary and the initial value

can be solved as

and P₁₀ obtained from

The solution to the above equation is

where vec(P) is the vector obtained by stacking the columns of P one below the other.

For example, the AR(2) model with

₁=0.9,

₂=-0.5, and

²=1 is stationary and the initial values are

and P can be obtained using (9.14). The following Mathematica program solves for P.

This loads the package.

In[1]:=

getP solves for the initial value P₁₀ for a stationary state equation. Note that Q is always symmetric.

In[2]:=

This gives P₁₀ which is used to start the Kalman filter recursion.

In[3]:=

Out[3]=

Consider the case when the state equation is not stationary, and we do not have any prior information about the distribution of the state vector. In general, if the dimension of X is m, we can use the first m values of Y (assuming that Y is a scalar) to calculate

and P_m+1m and start the recursion from there. To obtain

and P_m+1m, we can use the Kalman filter ((9.8) to (9.11)) with E(X₁

I₀)=E(X₁)="0" and the so-called diffuse prior P₁₀=

I, where I is an identity matrix and

is eventually set to infinity. For example, the local level model is not stationary. In the absence of any prior information, we use

and P₁₀=k and

and P₂₁ can be obtained as follows.

This gives

. Note that the result depends on the first data point y₁.

In[4]:=

Out[4]=

Now we take the limit k→

. Note that we get a finite result, and the first two elements will be used as the initial values to start the recursion.

In[5]:=

Out[5]=

For the local linear trend model, X is a two-dimensional vector (m=2), and the calculation of the initial values using a diffuse prior, in general, needs the first two data points y₁ and y₂.

This gives

In[6]:=

Note that P₂₁ is infinite, whereas P₃₂ is finite.

In[7]:=

Out[7]=

Here

is extracted, and it can be used as the initial value to start the Kalman filter recursion.

In[8]:=

Out[8]=

Although in principle KalmanFilter can be used to calculate

using the first m data points as illustrated above, it is infeasible when m is large because of the symbolic parameter k involved in the calculation. Alternatively, we can obtain the initial values by writing X_m in terms of first m data points of Y, and therefore, solve for

. The following is a program that gives

. Note that the program is only for the case where Y is a scalar and the coefficient matrices are independent of time. For other ways of calculating starting values, see Harvey (1989), Chapter 3.

This yields

given the known matrices and vectors of a state model and the first m values of data. Note that the first argument contains the first m values of the data, F and Q are m

m matrices, and G is a 1

m matrix. When m=1, every input argument is a scalar.

In[9]:=

In[10]:=

For example, this gives the same initial values for the local linear trend model as above but is much faster.

In[11]:=

Out[11]=

Kalman Filtering and Kalman Smoothing

Often, the state variable X represents some quantity we are interested in knowing. The Kalman filter enables us to estimate the state variable X from the observations of Y. For example, in the local level model,

is the trend and the estimation of

_t, given y₁, y₂, ..., y_t, is given by the Kalman filter (9.8) to (9.11).

Example 9.1 To illustrate the use of the Kalman filter, we generate a series according to the local level model (9.5) with R=E

²=0.5 and Q=E

²=1.

This generates a time series of length 100 according to the local level model (9.5).

In[12]:=

This plots the series.

In[13]:=

Out[13]=

As illustrated earlier, these are

and P₂₁ and they are used as the starting values for KalmanFilter.

In[14]:=

Out[14]=

Using getInit we get the same result.

In[15]:=

Out[15]=

kf contains the result of the Kalman filtering. (Note that we have suppressed the output.) Note that y₁ is used to obtain the initial values, so the data now starts from y₂.

In[16]:=

From the Kalman filter result kf, we extract

, the estimated values of

In[17]:=

Here the estimate of

_t given {y₁, y₂, ..., y_t} is plotted as a function of t.

In[18]:=

Out[18]=

X_tt is the estimate of X_t based on the information up to t. However, if we know Y_t up to t=T, we can use the information up to T to improve our estimate of X_t. This is called Kalman smoothing.

This gives the Kalman smoothing result.

In[19]:=

The smoothed values are plotted.

In[20]:=

Out[20]=

Example 9.2 As a second example, we use the Kalman filter to estimate the trend and the cyclic component in a particular case of the basic structural model given in (9.1) to (9.3).

This generates a time series of length 100 from the basic structural model. (See 9.1 to 9.3.)

In[21]:=

The series is plotted here.

In[22]:=

Out[22]=

Various matrices that are needed are defined.

In[23]:=

Since m=5, the first five points of data y are needed to calculate the initial values.

In[24]:=

Out[24]=

This gives the Kalman filtering result. Note that the numerical value of init is used to speed up the calculation.

In[25]:=

The first component of

is the estimated trend

. The trend is extracted and plotted.

In[26]:=

Out[26]=

Similarly, the third component of

is the estimated seasonal

. Its plot is shown here.

In[27]:=

Out[27]=

This gives the smoothed estimates of the state variables and their mean square errors.

In[28]:=

Here is the plot of the smoothed trend.

In[29]:=

Out[29]=

This is the plot of the smoothed seasonal component.

In[30]:=

Out[30]=

This gives the predicted values of the state variable for the next 15 points.

In[31]:=

We plot the smoothed trend together with the next 15 predicted values of the trend.

In[32]:=

Out[32]=

Here the smoothed seasonal component is plotted together with the next 15 predicted values.

In[33]:=

Out[33]=

Parameter Estimation and Prediction

The likelihood function of a state-space time series can be easily calculated using the Kalman filter technique. The joint density of {Y₁, Y₂, ..., Y_T} is L=

p(Y_t

I_t-1), where

and

. The log likelihood is given by

Note that if the first m values of data are used to calculate the starting values, the lower limit of the summations is t=m+1 and the corresponding "Log"L is the conditional log likelihood, conditional on y₁, y₂, ..., y_m being fixed.

If you wish to get the logarithm of the likelihood function of a state-space time series, you can use the function

LogLikelihood[data, init, F, G, Q, R, c, d]

Note that it has the same input arguments as those of KalmanFilter. When the first m points of the series are used to calculate the initial values init, data should start from y_m+1; when any one of F, G, Q, R, c, d is time dependent, the above arguments to LogLikelihood, F, G, Q, R, c, d, should be replaced by {F_m+2, F_m+3, ..., F_T+1}, {G_m+1, G_m+2, ..., G_T}, etc. Again, if c=0 and d=0, the last two arguments can be omitted.

Example 9.3 We again look at the local level model (see Example 9.1). This time we try to estimate the variance parameters of the model from the given series.

We generate a time series of length 300 according to the local level model (9.5).

In[34]:=

This gives the initial values. Note that the parameters to be estimated should be in symbolic form.

In[35]:=

Out[35]=

To get the maximum likelihood estimate of the parameters, we need to maximize the likelihood function. This is done using the built-in Mathematica function FindMinimum. Note that the function to be minimized is the negative of the log likelihood function. Also, we define a special function which evaluates only for numerical input to prevent FindMinimum from attempting symbolic preprocessing that can potentially take a long time.

In[36]:=

In[39]:=

Out[39]=

In[40]:=

Example 9.4 Here we give another example using the local linear trend model.

We generate a series from the local linear trend model.

In[41]:=

The initial values depend on the parameters

, and

, which are to be estimated.

In[42]:=

Out[42]=

This gives the estimate of the parameters.

In[43]:=

In[46]:=

Out[46]=

In[47]:=

Having obtained the estimated parameters, we can now use them to predict the future trend

by using KalmanPredictor. First, we calculate {X_t+1t, P_t+1t} from KalmanFilter.

We substitute the estimated parameters.

In[48]:=

Out[48]=

These are the values of {X_t+1t, P_t+1t}.

In[49]:=

Out[49]=

This gives the next 20 predicted values of X and their mean square errors.

In[50]:=

The predicted trend is given by the first component of X.

In[51]:=

Out[51]=

This shows the predicted trend along with the given series.

In[52]:=

Out[52]=

Example 9.5 As a final example, we look at the AR(2) model and show that the Kalman filter approach gives the same estimation of parameters as the method introduced in the previous sections.

This generates an AR(2) series of length 50.

In[53]:=

This gives the initial P₁₀.

In[54]:=

Out[54]=

FindMinimum finds the parameter values that maximize the likelihood function.

In[55]:=

In[58]:=

Out[58]=

We can get the same result using MLEstimate. Note that for the one-dimensional ARMA model, the concentrated likelihood is calculated. Therefore, the variance q should not be included as a search parameter.

In[59]:=

Out[59]=

Note that the value of the log likelihood is different from that obtained using the Kalman filter. This is because the former calculates the concentrated likelihood whereas the latter, the full likelihood (apart from a constant).

In[60]:=

Out[60]=

On the other hand, had we used the first two data points to calculate the initial values, we would have effectively obtained the conditional maximum likelihood estimate, conditional on the first two data points being fixed.

The initial values obtained this way correspond to using a diffuse prior.

In[61]:=

Out[61]=

This gives the estimated parameters.

In[62]:=

In[65]:=

Out[65]=

In[66]:=

We can get the same result much faster using ConditionalMLEstimate.

In[67]:=

Out[67]=