Karhunen-Loeve Transform

Intutively, we want to transform the samples into a different coordinate system where the new axes are uncorrelated, and most of the information is concentrated in just a few axes.

Let’s say you have a random vector $x \in R^{D}$ . The covariance matrix of the data is:

Ψ = E [(x - μ) (x - μ)^{T}] = E [x x^{T}]

The covariance matrix reflects how spread out the data is, and how correlated one dimension with another.

We have to perform eigen-decomposition $Ψ$ next.

Ψ u_{i} = λ_{i} u_{i}

where $u_{i}$ is the eigenvector and $λ_{i}$ is the eigenvalue. Next, we project the original data $x$ to new axes:

c_{i} = u_{i}^{T} x

Following Feature Extraction, we should only keep the first $d$ coefficient with the largest eigenvalues, to get a compressed representation:

\tilde{x} = i = 1 \sum d c_{i} u_{i}

Messy Notes

Explorer

Karhunen-Loeve Transform

Graph View

Backlinks