59. Eliminating Cross Products#
59.1. Overview#
This lecture describes formulas for eliminating
cross products between states and control in linear-quadratic dynamic programming problems
covariances between state and measurement noises in Kalman filtering problems
For a linear-quadratic dynamic programming problem, the idea involves these steps
transform states and controls in a way that leads to an equivalent problem with no cross-products between transformed states and controls
solve the transformed problem using standard formulas for problems with no cross-products between states and controls presented in this lecture Linear Control: Foundations
transform the optimal decision rule for the altered problem into the optimal decision rule for the original problem with cross-products between states and controls
59.2. Undiscounted Dynamic Programming Problem#
Here is a nonstochastic undiscounted LQ dynamic programming with cross products between states and controls in the objective function.
The problem is defined by the 5-tuple of matrices \((A, B, R, Q, H)\) where \(R\) and \(Q\) are positive definite symmetric matrices and \(A \sim m \times m, B \sim m \times k, Q \sim k \times k, R \sim m \times m\) and \(H \sim k \times m\).
The problem is to choose \(\{x_{t+1}, u_t\}_{t=0}^\infty\) to maximize
subject to the linear constraints
where \(x_0\) is a given initial condition.
The solution to this undiscounted infinite-horizon problem is a time-invariant feedback rule
where
and \(P \sim m \times m \) is a positive definite solution of the algebraic matrix Riccati equation
It can be verified that an equivalent problem without cross-products between states and controls is defined by a 4-tuple of matrices : \((A^*, B, R^*, Q) \).
That the omitted matrix \(H=0\) indicates that there are no cross products between states and controls in the equivalent problem.
The matrices \((A^*, B, R^*, Q) \) defining the equivalent problem and the value function, policy function matrices \(P, F^*\) that solve it are related to the matrices \((A, B, R, Q, H)\) defining the original problem and the value function, policy function matrices \(P, F\) that solve the original problem by
59.3. Kalman Filter#
The duality that prevails between a linear-quadratic optimal control and a Kalman filtering problem means that there is an analogous transformation that allows us to transform a Kalman filtering problem with non-zero covariance matrix between between shocks to states and shocks to measurements to an equivalent Kalman filtering problem with zero covariance between shocks to states and measurments.
Let’s look at the appropriate transformations.
First, let’s recall the Kalman filter with covariance between noises to states and measurements.
The hidden Markov model is
where \(A \sim m \times m, B \sim m \times p \) and \(D \sim k \times m, F \sim k \times p \), and \(w_{t+1}\) is the time \(t+1\) component of a sequence of i.i.d. \(p \times 1\) normally distibuted random vectors with mean vector zero and covariance matrix equal to a \(p \times p\) identity matrix.
Thus, \(x_t\) is \(m \times 1\) and \(z_t\) is \(k \times 1\).
The Kalman filtering formulas are
Define tranformed matrices
59.3.1. Algorithm#
A consequence of formulas {eq}`eq:Kalman102} is that we can use the following algorithm to solve Kalman filtering problems that involve non zero covariances between state and signal noises.
First, compute \(\Sigma, K^*\) using the ordinary Kalman filtering formula with \(BF' = 0\), i.e., with zero covariance matrix between random shocks to states and random shocks to measurements.
That is, compute \(K^*\) and \(\Sigma\) that satisfy
The Kalman gain for the original problem with non-zero covariance between shocks to states and measurements is then
The state reconstruction covariance matrix \(\Sigma\) for the original problem equals the state reconstrution covariance matrix for the transformed problem.
59.4. Duality table#
Here is a handy table to remember how the Kalman filter and dynamic program are related.
Dynamic Program |
Kalman Filter |
---|---|
\(A\) |
\(A'\) |
\(B\) |
\(D'\) |
\(H\) |
\(FB'\) |
\(Q\) |
\(FF'\) |
\(R\) |
\(BB'\) |
\(F\) |
\(K'\) |
\(P\) |
\(\Sigma\) |