Matrix Stability
The main stability matrix is A_eta = D_mu(I - P_pi + eta ee^T). The stability region for A_eta is always a finite union of open intervals. For L = D_mu(I - P_pi), nonzero eigenvalues have strictly positive real parts. A_eta is treated as a…
1 sources - 6 claims
The main stability matrix is A_eta = D_mu(I - P_pi + eta ee^T). The stability region for A_eta is always a finite union of open intervals. For L = D_mu(I - P_pi), nonzero eigenvalues have strictly positive real parts. A_eta is treated as a rank-one perturbation of a singular M-matrix. Discounted TD has an M-matrix structure that preserves positive stability under positive diagonal multiplication. Differential TD lacks a general M-matrix guarantee for global-clock stability.