# 51. Eliminating Cross Products¶

## 51.1. Overview¶

This lecture describes formulas for eliminating

cross products between states and control in linear-quadratic dynamic programming problems

covariances between state and measurement noises in Kalman filtering problems

For a linear-quadratic dynamic programming problem, the idea involves these steps

transform states and controls in a way that leads to an equivalent problem with no cross-products between transformed states and controls

solve the transformed problem using standard formulas for problems with no cross-products between states and controls presented in this lecture Linear Control: Foundations

transform the optimal decision rule for the altered problem into the optimal decision rule for the original problem with cross-products between states and controls

## 51.2. Undiscounted Dynamic Programming Problem¶

Here is a nonstochastic undiscounted LQ dynamic programming with cross products between states and controls in the objective function.

The problem is defined by the 5-tuple of matrices \((A, B, R, Q, H)\) where \(R\) and \(Q\) are positive definite symmetric matrices and \(A \sim m \times m, B \sim m \times k, Q \sim k \times k, R \sim m \times m\) and \(H \sim k \times m\).

The problem is to choose \(\{x_{t+1}, u_t\}_{t=0}^\infty\) to maximize

subject to the linear constraints

where \(x_0\) is a given initial condition.

The solution to this undiscounted infinite-horizon problem is a time-invariant feedback rule

where

and \(P \sim m \times m \) is a positive definite solution of the algebraic matrix Riccati equation

It can be verified that an **equivalent** problem without cross-products between states and controls
is defined by a 4-tuple of matrices : \((A^*, B, R^*, Q) \).

That the omitted matrix \(H=0\) indicates that there are no cross products between states and controls in the equivalent problem.

The matrices \((A^*, B, R^*, Q) \) defining the equivalent problem and the value function, policy function matrices \(P, F^*\) that solve it are related to the matrices \((A, B, R, Q, H)\) defining the original problem and the value function, policy function matrices \(P, F\) that solve the original problem by

## 51.3. Kalman Filter¶

The **duality** that prevails between a linear-quadratic optimal control and a Kalman filtering problem means that there is an analogous transformation that allows us to transform a Kalman filtering problem
with non-zero covariance matrix between between shocks to states and shocks to measurements to an equivalent Kalman filtering problem with zero covariance between shocks to states and measurments.

Let’s look at the appropriate transformations.

First, let’s recall the Kalman filter with covariance between noises to states and measurements.

The hidden Markov model is

where \(A \sim m \times m, B \sim m \times p \) and \(D \sim k \times m, F \sim k \times p \), and \(w_{t+1}\) is the time \(t+1\) component of a sequence of i.i.d. \(p \times 1\) normally distibuted random vectors with mean vector zero and covariance matrix equal to a \(p \times p\) identity matrix.

Thus, \(x_t\) is \(m \times 1\) and \(z_t\) is \(k \times 1\).

The Kalman filtering formulas are

Define tranformed matrices

### 51.3.1. Algorithm¶

A consequence of formulas {eq}`eq:Kalman102} is that we can use the following algorithm to solve Kalman filtering problems that involve non zero covariances between state and signal noises.

First, compute \(\Sigma, K^*\) using the ordinary Kalman filtering formula with \(BF' = 0\), i.e., with zero covariance matrix between random shocks to states and random shocks to measurements.

That is, compute \(K^*\) and \(\Sigma\) that satisfy

The Kalman gain for the original problem **with non-zero covariance** between shocks to states and measurements is then

The state reconstruction covariance matrix \(\Sigma\) for the original problem equals the state reconstrution covariance matrix for the transformed problem.

## 51.4. Duality table¶

Here is a handy table to remember how the Kalman filter and dynamic program are related.

Dynamic Program |
Kalman Filter |
---|---|

\(A\) |
\(A'\) |

\(B\) |
\(D'\) |

\(H\) |
\(FB'\) |

\(Q\) |
\(FF'\) |

\(R\) |
\(BB'\) |

\(F\) |
\(K'\) |

\(P\) |
\(\Sigma\) |