Eigenvalue Perturbation Theorem: Hoffman-Wielandt Inequality

We are interested in the question: say matrix A is perturbed by a small matrix E, how much do the eigenvalues of the perturbed matrix A+E deviate from the eigenvalues of A?

The Hoffman-Wielandt inequality answers this question. Roughly, it tells us that the eigenvalues of a symmetric (actually only need to be normal) matrix enjoy a strong stability under small perturbations in its entries.

Hoffman-Wielandt Inequality

Let A,E be n×n real and symmetric square matrices. Let λ1,,λn be the eigenvalues of A in some given order; and let λ^1,,λ^n be the eigenvalues of A+E in some given order. There is a permutation σ() of the indexes 1,,n such that

i=1n|λ^σ(i)λi|2E22

Here E22 denotes the Frobenius norm of matrix E.

Proof

Let Λ=diag(λ1,,λn) and Λ^=diag(λ^1,,λ^n). Let V,W be n×n orthonormal matrices such that the diagonalization of A and A+E are

A=VΛV,A+E=WΛ^W

Because the Frobenius norm is invariant when multiplying an orthogonal matrix, we calculate

E22=(A+E)A=WΛ^WVΛV22=W(WΛ^WVΛV)V22=Λ^WVWVΛ22=Λ^UUΛ22

where we denote UWV. The entris of U are denoted as uij

Since Λ^,Λ are diagonal matrices, we have

Λ^U=[λ^1λ^n][u11u12u1nu21u22u2nun1un2unn]=[λ^1u11λ^1u12λ^1u1nλ^2u21λ^2u22λ^2u2nλ^nun1λ^nun2λ^nunn]UΛ=[u11u12u1nu21u22u2nun1un2unn][λ1λn]=[λ1u11λ2u12λnu1nλ1u21λ2u22λnu2nλ1un1λ2un2λnunn]

Thus their difference is

E22=Λ^UUΛ22=i,j=1n(λ^iλj)2uij2

Because U is the multiplication of two orthonormal matrices, the matrix whose entries are uij2 is doubly stochastic. A doubly stochastic matrix is a square matrix of nonnegative entries, each of whose rows and columns sums to 1. Therefore, E22 must be larger than the following quantity

mini,j=1n(λ^iλj)2uij2subject toi=1nuij2=j=1nuij2=1

We can ponder about it for a while or just take Birkhoff's theorem off the shelf, which states this minimum is attained when uij2 constitute a permutation matrix. A permutation matrix refers to a matrix where uij2 is either 0 or 1 and satisfies the above equality, i.e., rows and columns sum to 1. When uij2 constitute a permutation matrix, the sum can be written as

mini,j=1n(λ^iλj)2uij2=i=1n(λ^σ(i)λi)2

where σ() is a permutation of 1,,n. This completes the proof.

Corollary

Actually, for a real and symmetric matrix, the permutation σ() is the natural permutaion where we arrange the eigenvalues by descending order. Stating more formally, we have

i=1n|λ^iλi|2E22

if λ^1,,λ^n are arranged by descending order and so are λ1,,λn.

Reference

Chapter 6.3 of this book:

Horn, R. A., & Johnson, C. R. (2012). Matrix analysis. Cambridge university press.

© Yedi Zhang | Last updated: September 2024