Skip to content
homeaboutworkprojectsthesiswritingresume
Loading
~/blog/compact-operators0%dark
  1. home/
  2. writing/
  3. Compact Operators and the Spectral Theorem

16 May 2026 · 7 min read · updated 13 June 2026

Compact Operators and the Spectral Theorem

A compact self-adjoint operator behaves like a symmetric matrix of countable size. We define compact operators, show finite-rank operators are compact and the compact operators are closed under the operator norm, prove the operator norm of a compact self-adjoint operator is attained at an eigenvector, and iterate this to the spectral theorem, that such an operator is a norm-convergent sum of rank-one projections onto an orthonormal sequence of eigenvectors whose eigenvalues tend to zero. This decomposition underlies the Karhunen-Loeve expansion and Mercer's theorem.

  • 4 equations
  • 10 results
  • 13 connections
  • functional-analysis
  • hilbert-space
  • spectral-theory
On this page▾
  • Compact operators
  • The norm is an eigenvalue
  • The spectral theorem
  • Positive operators and the square root

7 min left

  • Compact operators2m
  • The norm is an eigenvalue1m
  • The spectral theorem3m
  • Positive operators and the square root1m

A symmetric matrix is diagonalised by an orthonormal basis of eigenvectors with real eigenvalues. The infinite-dimensional statement fails for general operators, which need have no eigenvectors at all, but it survives intact for one class, the compact self-adjoint operators, where the unit ball is squeezed tightly enough that an extremal eigenvector must exist. This post proves the spectral theorem for that class, the engine behind every eigenfunction expansion. The proof combines the compactness of metric spaces with the quadratic-form bound for self-adjoint operators [@rudinFA1991; @reedSimon1980]. Throughout HHH is a real Hilbert space.

#Compact operators

Definition1

An operator K:H→HK:H\to HK:H→H is compact when every bounded sequence (xn)(x_n)(xn​) has a subsequence for which (Kxn)(Kx_n)(Kxn​) converges in HHH.

Equivalently, KKK carries the unit ball to a set of compact closure. In finite dimensions every operator is compact, by the Bolzano-Weierstrass theorem, but in infinite dimensions the identity is not, since an orthonormal sequence is bounded yet has no convergent subsequence, its terms staying distance 2\sqrt 22​ apart. Compactness is the property that recovers the finite-dimensional behaviour, and it is built from the simplest operators by completion.

Proposition2

Every operator of finite rank is compact, and a limit in operator norm of compact operators is compact. Consequently every norm limit of finite-rank operators is compact.

Proof

Let FFF have finite-dimensional range and (xn)(x_n)(xn​) be bounded, ∥xn∥≤C\norm{x_n}\le C∥xn​∥≤C. Then (Fxn)(Fx_n)(Fxn​) is bounded in the finite-dimensional space ran⁡F\operatorname{ran}FranF, which is complete and closed in HHH, so the Bolzano-Weierstrass limit lies in ran⁡F⊂H\operatorname{ran}F\subset HranF⊂H and (Fxn)(Fx_n)(Fxn​) converges in HHH; thus FFF is compact. Now let Km→KK_m\to KKm​→K in operator norm with each KmK_mKm​ compact, and let (xn)(x_n)(xn​) be bounded by CCC. Since K1K_1K1​ is compact, extract a subsequence indexed by S1S_1S1​ on which (K1xn)(K_1 x_n)(K1​xn​) converges; since K2K_2K2​ is compact, extract S2⊂S1S_2\subset S_1S2​⊂S1​ on which (K2xn)(K_2 x_n)(K2​xn​) converges, and inductively Sm+1⊂SmS_{m+1}\subset S_mSm+1​⊂Sm​ on which (Km+1xn)(K_{m+1} x_n)(Km+1​xn​) converges. The compactness of each KmK_mKm​, not merely of the norm-limit KKK, supplies each SmS_mSm​. The diagonal sequence whose kkk-th index is the kkk-th element of SkS_kSk​ is eventually a subsequence of every SmS_mSm​, so (Kmxn)(K_m x_n)(Km​xn​) converges for every fixed mmm; call this diagonal sequence (xn)(x_n)(xn​). For this subsequence,

∥Kxn−Kxn′∥≤∥(K−Km)xn∥+∥Kmxn−Kmxn′∥+∥(Km−K)xn′∥≤2C∥K−Km∥+∥Kmxn−Kmxn′∥.(1)\norm{Kx_n-Kx_{n'}}\le\norm{(K-K_m)x_n}+\norm{K_m x_n-K_m x_{n'}}+\norm{(K_m-K)x_{n'}} \le 2C\norm{K-K_m}+\norm{K_m x_n-K_m x_{n'}}. \tag{1}∥Kxn​−Kxn′​∥≤∥(K−Km​)xn​∥+∥Km​xn​−Km​xn′​∥+∥(Km​−K)xn′​∥≤2C∥K−Km​∥+∥Km​xn​−Km​xn′​∥.(1)

Given ε>0\varepsilon>0ε>0, fix mmm with ∥K−Km∥<ε/(4C)\norm{K-K_m}<\varepsilon/(4C)∥K−Km​∥<ε/(4C), then choose n,n′n,n'n,n′ large so the middle term is below ε/2\varepsilon/2ε/2. Hence (Kxn)(Kx_n)(Kxn​) is Cauchy, and converges by completeness, so KKK is compact.

The proposition identifies the compact operators with the closure of the finite-rank operators. The spectral theorem will exhibit a compact self-adjoint operator as exactly such a limit, with the approximating ranks spanned by eigenvectors.

#The norm is an eigenvalue

For a general operator the supremum defining the norm need not be attained. Compactness forces attainment, and self-adjointness makes the maximiser an eigenvector.

Lemma3

Let TTT be compact and self-adjoint with T≠0T\neq 0T=0. There is a unit vector eee and a scalar λ\lambdaλ with ∣λ∣=∥T∥\abs\lambda=\norm T∣λ∣=∥T∥ and Te=λeTe=\lambda eTe=λe.

Proof

By the quadratic-form identity, ∥T∥=sup⁡∥x∥=1∣⟨Tx,x⟩∣\norm T=\sup_{\norm x=1} \abs{\ip{Tx}{x}}∥T∥=sup∥x∥=1​∣⟨Tx,x⟩∣, so there are unit vectors xnx_nxn​ with ∣⟨Txn,xn⟩∣→∥T∥\abs{\ip{Tx_n}{x_n}}\to\norm T∣⟨Txn​,xn​⟩∣→∥T∥. Since HHH is real, ⟨Tx,x⟩∈R\ip{Tx}{x}\in\R⟨Tx,x⟩∈R for every xxx, so each ⟨Txn,xn⟩\ip{Tx_n}{x_n}⟨Txn​,xn​⟩ is real, and along a subsequence ⟨Txn,xn⟩→±∥T∥=:λ\ip{Tx_n}{x_n}\to\pm\norm T=:\lambda⟨Txn​,xn​⟩→±∥T∥=:λ with λ∈R\lambda\in\Rλ∈R and ∣λ∣=∥T∥\abs\lambda=\norm T∣λ∣=∥T∥. The vectors are approximate eigenvectors, and by symmetry of the real inner product ⟨λxn,Txn⟩+⟨Txn,λxn⟩=2λ⟨Txn,xn⟩\ip{\lambda x_n}{Tx_n}+\ip{Tx_n}{\lambda x_n}=2\lambda\ip{Tx_n}{x_n}⟨λxn​,Txn​⟩+⟨Txn​,λxn​⟩=2λ⟨Txn​,xn​⟩, so

∥Txn−λxn∥2=∥Txn∥2−2λ⟨Txn,xn⟩+λ2≤2λ2−2λ⟨Txn,xn⟩→0,(2)\norm{Tx_n-\lambda x_n}^2=\norm{Tx_n}^2-2\lambda\ip{Tx_n}{x_n}+\lambda^2\le 2\lambda^2-2\lambda \ip{Tx_n}{x_n}\to 0, \tag{2}∥Txn​−λxn​∥2=∥Txn​∥2−2λ⟨Txn​,xn​⟩+λ2≤2λ2−2λ⟨Txn​,xn​⟩→0,(2)

using ∥Txn∥≤∥T∥=∣λ∣\norm{Tx_n}\le\norm T=\abs\lambda∥Txn​∥≤∥T∥=∣λ∣ and ⟨Txn,xn⟩→λ\ip{Tx_n}{x_n}\to\lambda⟨Txn​,xn​⟩→λ. By compactness a subsequence has Txn→zTx_n\to zTxn​→z, and then λxn=Txn−(Txn−λxn)→z\lambda x_n=Tx_n-(Tx_n-\lambda x_n)\to zλxn​=Txn​−(Txn​−λxn​)→z. Since λ≠0\lambda\neq 0λ=0, xn→e:=z/λx_n\to e:=z/ \lambdaxn​→e:=z/λ, a unit vector as the limit of unit vectors. Continuity of TTT gives Te=lim⁡Txn=z=λeTe=\lim Tx_n=z=\lambda eTe=limTxn​=z=λe.

The lemma already contains the induction step of the theorem. It produces the largest eigenvalue and its eigenvector, and the rest of the spectrum is found by removing that direction and repeating.

#The spectral theorem

Theorem4

Let TTT be a compact self-adjoint operator on HHH. There is a finite or countably infinite orthonormal system of eigenvectors (en)(e_n)(en​) with real eigenvalues λn\lambda_nλn​ satisfying ∣λ1∣≥∣λ2∣≥⋯\abs{\lambda_1}\ge \abs{\lambda_2}\ge\cdots∣λ1​∣≥∣λ2​∣≥⋯, tending to 000 if infinite, such that

Tx=∑nλn⟨x,en⟩enfor every x∈H,(3)Tx=\sum_n\lambda_n\ip{x}{e_n}e_n\qquad\text{for every }x\in H, \tag{3}Tx=n∑​λn​⟨x,en​⟩en​for every x∈H,(3)

the series converging in HHH.

Proof

Build the system by exhaustion, as an induction on nnn. Set H0=HH_0=HH0​=H and T0=TT_0=TT0​=T; the base case is that H0H_0H0​ is TTT-invariant. Suppose H0,…,Hn−1H_0,\dots,H_{n-1}H0​,…,Hn−1​ are TTT-invariant and e1,…,en−1e_1,\dots,e_{n-1}e1​,…,en−1​ are eigenvectors of TTT. Given the compact self-adjoint Tn−1=T∣Hn−1T_{n-1}=T|_{H_{n-1}}Tn−1​=T∣Hn−1​​ on the closed subspace Hn−1H_{n-1}Hn−1​, if Tn−1=0T_{n-1}=0Tn−1​=0 stop. Otherwise Lemma 3 gives a unit eigenvector en∈Hn−1e_n\in H_{n-1}en​∈Hn−1​ with Tn−1en=λnenT_{n-1}e_n=\lambda_n e_nTn−1​en​=λn​en​ and ∣λn∣=∥Tn−1∥\abs{\lambda_n}=\norm{T_{n-1}}∣λn​∣=∥Tn−1​∥; since Hn−1H_{n-1}Hn−1​ is TTT-invariant, Tn−1en=TenT_{n-1}e_n=Te_nTn−1​en​=Ten​, so ene_nen​ is a genuine eigenvector of TTT. Set Hn={x∈Hn−1:x⊥en}={e1,…,en}⊥H_n=\{x\in H_{n-1}:x\perp e_n\}=\{e_1,\dots,e_n\}^\perpHn​={x∈Hn−1​:x⊥en​}={e1​,…,en​}⊥. This HnH_nHn​ is TTT-invariant, because for x∈Hnx\in H_nx∈Hn​ and every k≤nk\le nk≤n self-adjointness gives ⟨Tx,ek⟩=⟨x,Tek⟩=λk⟨x,ek⟩=0\ip{Tx}{e_k}=\ip{x}{Te_k}=\lambda_k\ip{x}{e_k}=0⟨Tx,ek​⟩=⟨x,Tek​⟩=λk​⟨x,ek​⟩=0, so Tx⊥e1,…,enTx\perp e_1,\dots,e_nTx⊥e1​,…,en​, that is Tx∈HnTx\in H_nTx∈Hn​, completing the induction. The restriction Tn=T∣HnT_n=T|_{H_n}Tn​=T∣Hn​​ is again compact and self-adjoint with ∥Tn∥≤∥Tn−1∥\norm{T_n}\le\norm{T_{n-1}}∥Tn​∥≤∥Tn−1​∥, whence ∣λn+1∣≤∣λn∣\abs{\lambda_{n+1}}\le\abs{\lambda_n}∣λn+1​∣≤∣λn​∣.

If the process stops at stage NNN, then TTT vanishes on HN={e1,…,eN}⊥H_N=\{e_1,\dots,e_N\}^\perpHN​={e1​,…,eN​}⊥, and writing x=∑n≤N⟨x,en⟩en+rx=\sum_{n\le N}\ip{x}{e_n}e_n+rx=∑n≤N​⟨x,en​⟩en​+r with r∈HNr\in H_Nr∈HN​ gives Tx=∑n≤Nλn⟨x,en⟩enTx=\sum_{n\le N}\lambda_n\ip{x}{e_n}e_nTx=∑n≤N​λn​⟨x,en​⟩en​ since Tr=0Tr=0Tr=0, which is Equation (3) with a finite sum.

If it never stops, the eigenvalues tend to 000. Were ∣λn∣≥δ>0\abs{\lambda_n}\ge\delta>0∣λn​∣≥δ>0 for all nnn, the bounded sequence (en)(e_n)(en​) would have ∥Ten−Tem∥2=∥λnen−λmem∥2=λn2+λm2≥2δ2\norm{Te_n-Te_m}^2=\norm{\lambda_n e_n-\lambda_m e_m}^2=\lambda_n^2+ \lambda_m^2\ge 2\delta^2∥Ten​−Tem​∥2=∥λn​en​−λm​em​∥2=λn2​+λm2​≥2δ2 for n≠mn\neq mn=m by orthonormality, so (Ten)(Te_n)(Ten​) would have no convergent subsequence, contradicting compactness. Hence λn→0\lambda_n\to 0λn​→0. For the expansion, fix xxx and set rN=x−∑n≤N⟨x,en⟩enr_N=x-\sum_{n\le N}\ip{x}{e_n}e_nrN​=x−∑n≤N​⟨x,en​⟩en​, which lies in HNH_NHN​ with ∥rN∥≤∥x∥\norm{r_N}\le\norm x∥rN​∥≤∥x∥ by Bessel's inequality. Then

∥Tx−∑n≤Nλn⟨x,en⟩en∥=∥TrN∥=∥TNrN∥≤∥TN∥∥rN∥≤∣λN+1∣∥x∥→0,(4)\Big\|Tx-\sum_{n\le N}\lambda_n\ip{x}{e_n}e_n\Big\|=\norm{Tr_N}=\norm{T_N r_N}\le\norm{T_N}\norm{r_N} \le\abs{\lambda_{N+1}}\norm x\to 0, \tag{4}​Tx−n≤N∑​λn​⟨x,en​⟩en​​=∥TrN​∥=∥TN​rN​∥≤∥TN​∥∥rN​∥≤∣λN+1​∣∥x∥→0,(4)

using Ten=λnenTe_n=\lambda_n e_nTen​=λn​en​ to pull the finite sum out of TxTxTx and ∥TN∥=∣λN+1∣\norm{T_N}=\abs{\lambda_{N+1}}∥TN​∥=∣λN+1​∣. The partial sums converge to TxTxTx, which is Equation (3).

The eigenvectors with nonzero eigenvalue span the closure of the range of TTT, and adjoining an orthonormal basis of the kernel ker⁡T={en}⊥\ker T=\{e_n\}^\perpkerT={en​}⊥ completes them to an orthonormal basis of HHH when HHH is separable, the basis in which TTT is the diagonal operator en↦λnene_n\mapsto\lambda_n e_nen​↦λn​en​. The decay λn→0\lambda_n\to 0λn​→0 is the residue of compactness. It is why the operator is approximated in norm by its finite-rank truncations ∑n≤Nλn⟨⋅,en⟩en\sum_{n\le N}\lambda_n \ip{\cdot}{e_n}e_n∑n≤N​λn​⟨⋅,en​⟩en​, with error ∣λN+1∣\abs{\lambda_{N+1}}∣λN+1​∣.

#Positive operators and the square root

The operators arising as covariances are not merely self-adjoint but positive, and positivity pins the sign of the spectrum.

Definition5

A self-adjoint operator TTT is positive, written T≥0T\ge 0T≥0, when ⟨Tx,x⟩≥0\ip{Tx}{x}\ge 0⟨Tx,x⟩≥0 for all xxx.

Corollary6

A compact positive operator has λn≥0\lambda_n\ge 0λn​≥0 for every nnn, and the expansion Equation (3) has nonnegative coefficients. Its eigenvalues are summable against the squared coordinates of any vector, ∑nλn⟨x,en⟩2=⟨Tx,x⟩≥0\sum_n\lambda_n\ip{x}{e_n}^2=\ip{Tx}{x}\ge 0∑n​λn​⟨x,en​⟩2=⟨Tx,x⟩≥0.

Proof

For an eigenvector, λn=⟨Ten,en⟩≥0\lambda_n=\ip{Te_n}{e_n}\ge 0λn​=⟨Ten​,en​⟩≥0 by positivity. Pairing Equation (3) with xxx and using continuity of the inner product gives ⟨Tx,x⟩=∑nλn⟨x,en⟩2\ip{Tx}{x}=\sum_n\lambda_n\ip{x}{e_n}^2⟨Tx,x⟩=∑n​λn​⟨x,en​⟩2, a sum of nonnegative terms.

This is the exact structure a covariance operator carries, and the spectral theorem applied to it is the Karhunen-Loeve expansion, whose eigenvalues are the variances of the uncorrelated coordinates and whose eigenvectors are the principal directions of the process. When the operator is the integral operator of a continuous kernel, the same decomposition with its eigenfunctions made continuous is Mercer's theorem, the bridge from the abstract spectral theorem to the explicit series a covariance function admits. Among infinite-dimensional operators, the compact self-adjoint ones are uniquely as transparent as a symmetric matrix, and the spectral theorem is why.

Part 6 of 7 in Hilbert Spaces and Operators

← previousBounded Operators and the Adjointnext →Mercer's Theorem and Reproducing Kernels

Explore connections

see in the atlas →

related

  • L-squared and Completeness
  • Projection and Riesz Representation
  • Orthonormal Bases

referenced by (6)

  • Bounded Operators and the Adjoint
  • Eigenvalues and the Spectral Theorem
  • Gaussian Vectors and Processes
  • Mercer's Theorem and Reproducing Kernels
  • Orthonormal Bases
  • Second-Order Processes and Mean-Square Calculus
cite
@misc{compact-operators,
  author = {Zac Kienzle},
  title  = {Compact Operators and the Spectral Theorem},
  year   = {2026},
  month  = {05},
  url    = {https://zackienzle.com/blog/compact-operators}
}