Eigenvalue Perturbation Theorem: Hoffman-Wielandt Inequality

$A$ $A+E$ $A$ ?

The Hoffman-Wielandt inequality answers this question. Roughly, it tells us that the eigenvalues of a symmetric (actually only need to be normal) matrix enjoy a strong stability under small perturbations in its entries.

Hoffman-Wielandt Inequality

$A,E$ $n\times n$ $\lambda_1,\cdots,\lambda_n$ $A$ $\hat \lambda_1,\cdots,\hat\lambda_n$ $A+E$ $\sigma(\cdot)$ $1,\cdots,n$ such that

\sum_{i = 1}^{n} | {\hat{λ}}_{σ (i)} - λ_{i} |^{2} \leq ∥ E ∥_{2}^{2}

$\| E \|_2^2$ $E$ .

Proof

$\Lambda = \text{diag} (\lambda_1,\cdots,\lambda_n)$ $\hat\Lambda = \text{diag} (\hat \lambda_1,\cdots,\hat\lambda_n)$ $V,W$ $n\times n$ $A$ $A+E$ are

A = V Λ V^{⊤}, A + E = W \hat{Λ} W^{⊤}

Because the Frobenius norm is invariant when multiplying an orthogonal matrix, we calculate

\begin{aligned} ∥ E ∥_{2}^{2} & = ∥ (A + E) - A ∥ \\ = ∥ W \hat{Λ} W^{⊤} - V Λ V^{⊤} ∥_{2}^{2} \\ = ∥ W^{⊤} (W \hat{Λ} W^{⊤} - V Λ V^{⊤}) V ∥_{2}^{2} \\ = ∥ \hat{Λ} W^{⊤} V - W^{⊤} V Λ ∥_{2}^{2} \\ = ∥ \hat{Λ} U - U Λ ∥_{2}^{2} \end{aligned}

$U \equiv W^\top V$ $U$ $u_{ij}$

$\hat\Lambda,\Lambda$ are diagonal matrices, we have

\begin{aligned} \hat{Λ} U & = [\begin{array}{c} {\hat{λ}}_{1} \\ ⋱ \\ {\hat{λ}}_{n} \end{array}] [\begin{array}{c} u_{11} & u_{12} & \dots & u_{1 n} \\ u_{21} & u_{22} & \dots & u_{2 n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ u_{n 1} & u_{n 2} & \dots & u_{n n} \end{array}] = [\begin{array}{c} {\hat{λ}}_{1} u_{11} & {\hat{λ}}_{1} u_{12} & \dots & {\hat{λ}}_{1} u_{1 n} \\ {\hat{λ}}_{2} u_{21} & {\hat{λ}}_{2} u_{22} & \dots & {\hat{λ}}_{2} u_{2 n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\hat{λ}}_{n} u_{n 1} & {\hat{λ}}_{n} u_{n 2} & \dots & {\hat{λ}}_{n} u_{n n} \end{array}] \\ U Λ & = [\begin{array}{c} u_{11} & u_{12} & \dots & u_{1 n} \\ u_{21} & u_{22} & \dots & u_{2 n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ u_{n 1} & u_{n 2} & \dots & u_{n n} \end{array}] [\begin{array}{c} λ_{1} \\ ⋱ \\ λ_{n} \end{array}] = [\begin{array}{c} λ_{1} u_{11} & λ_{2} u_{12} & \dots & λ_{n} u_{1 n} \\ λ_{1} u_{21} & λ_{2} u_{22} & \dots & λ_{n} u_{2 n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ λ_{1} u_{n 1} & λ_{2} u_{n 2} & \dots & λ_{n} u_{n n} \end{array}] \end{aligned}

Thus their difference is

\begin{array}{r} ∥ E ∥_{2}^{2} = ∥ \hat{Λ} U - U Λ ∥_{2}^{2} = \sum_{i, j = 1}^{n} ({\hat{λ}}_{i} - λ_{j})^{2} u_{i j}^{2} \end{array}

$U$ $u_{ij}^2$ $\| E \|_2^2$ must be larger than the following quantity

\begin{aligned} min & \sum_{i, j = 1}^{n} ({\hat{λ}}_{i} - λ_{j})^{2} u_{i j}^{2} \\ subject to & \sum_{i = 1}^{n} u_{i j}^{2} = \sum_{j = 1}^{n} u_{i j}^{2} = 1 \end{aligned}

$u_{ij}^2$ $u_{ij}^2$ $u_{ij}^2$ constitute a permutation matrix, the sum can be written as

min \sum_{i, j = 1}^{n} ({\hat{λ}}_{i} - λ_{j})^{2} u_{i j}^{2} = \sum_{i = 1}^{n} ({\hat{λ}}_{σ (i)} - λ_{i})^{2}

$\sigma(\cdot)$ $1,\cdots ,n$ . This completes the proof.

Corollary

$\sigma(\cdot)$ is the natural permutaion where we arrange the eigenvalues by descending order. Stating more formally, we have

\sum_{i = 1}^{n} | {\hat{λ}}_{i} - λ_{i} |^{2} \leq ∥ E ∥_{2}^{2}

$\hat\lambda_1, \cdots, \hat\lambda_n$ $\lambda_1, \cdots, \lambda_n$ .

Reference

Chapter 6.3 of this book:

Horn, R. A., & Johnson, C. R. (2012). Matrix analysis. Cambridge university press.