Eigenvalues and Eigenvectors — Explained with Applications
Learn eigenvalues and eigenvectors — what they mean geometrically, how to compute them, and why they underpin PCA, Google PageRank, vibration analysis, and quantum mechanics.
- Eigenvector of a matrix A: a nonzero vector v such that Av = λv (only scaled, not rotated)
- Eigenvalue λ: the scalar factor by which the eigenvector is stretched or shrunk
- Compute: solve det(A - λI) = 0 for λ, then (A - λI)v = 0 for v
- Real symmetric matrices: always have real eigenvalues and orthogonal eigenvectors — use numpy.linalg.eigh
- Power iteration: repeated Av / ‖Av‖ converges to dominant eigenvector — original PageRank algorithm
- Production trap: near-zero eigenvalues cause division‑by‑zero in PCA whitening and numerical instability in any downstream inversion
An eigenvector of a matrix is a direction that the matrix does not rotate — it only stretches or shrinks. The eigenvalue is how much it stretches. If you multiply a transformation matrix by its eigenvector, you get the same vector scaled by the eigenvalue. This 'invariant direction' concept appears everywhere: principal components in PCA, dominant webpage in PageRank, stable modes in structural engineering, and quantum states in physics.
Eigenvalues and eigenvectors are perhaps the most fundamental concept in applied linear algebra. Av = λv — a matrix A, when applied to its eigenvector v, produces the same vector scaled by eigenvalue λ. Finding the directions that a transformation preserves (up to scaling) reveals the essential structure of the transformation.
Google's original PageRank was the dominant eigenvector of the web's link matrix. PCA computes the eigenvectors of a covariance matrix to find principal components. Quantum states are eigenvectors of Hamiltonian operators. Structural resonant frequencies are eigenvalues of stiffness matrices. Understanding eigenvectors is understanding how to extract the 'essential directions' from a linear system.
Computing Eigenvalues and Eigenvectors
Eigenvalues: solve det(A - λI) = 0 (characteristic polynomial). Eigenvectors: for each λ, solve (A - λI)v = 0.
The characteristic polynomial of an n×n matrix is degree n. For large n, numerical methods (QR algorithm, power iteration) are used. Python's numpy.linalg.eig uses the LAPACK library, which applies the QR algorithm with shifts.
Power Iteration — Finding the Dominant Eigenvector
Power iteration finds the largest eigenvalue by repeatedly multiplying by A and normalising. This is the algorithm behind original PageRank.
Algorithm: initialize random v, then loop v = A v / ‖A v‖. Convergence rate depends on the ratio |λ₂/λ₁| — closer to 1 means slower convergence. Rayleigh quotient λ = vᵀAv / vᵀv gives eigenvalue estimate.
Geometric Intuition: What Does Av = λv Mean?
An eigenvector points in a direction that is invariant under the transformation A — it only gets stretched (or shrunk, or reversed) by λ. This is the 'natural axis' of the matrix.
Example: A = [[2,0],[0,3]]. The eigenvectors are [1,0]ᵀ (λ=2) and [0,1]ᵀ (λ=3). Multiply any point: x-coordinate doubles, y-coordinate triples — the axes are scaled independently. If you pick any other direction, the point rotates AND scales.
For a symmetric matrix, eigenvectors are orthogonal — they form a perfect coordinate system. For a non‑symmetric matrix, eigenvectors are not orthogonal, and the transformation can shear.
Real‑World Applications: Where Eigenvalues Matter in Production
Eigenvalues and eigenvectors are not just theoretical. They power critical algorithms:
- PCA (Principal Component Analysis): eigenvectors of the covariance matrix = principal components; eigenvalues = variance explained. Used for dimensionality reduction, noise filtering, and anomaly detection.
- PageRank: the dominant eigenvector of the web's transition matrix gives page importance. Google solved this for billions of pages using iterative methods.
- Structural Engineering: eigenvalues of the stiffness matrix = natural frequencies. If a bridge's frequency matches wind gust frequency, resonance can collapse it (Tacoma Narrows).
- Quantum Mechanics: eigenstates of the Hamiltonian operator are the possible energy levels. The Schrödinger equation is an eigenvalue problem.
- Graph Theory: eigenvalues of the adjacency matrix indicate community structure, connectivity, and node centrality (spectral clustering).
- Eigenvalues are the roots of the characteristic polynomial — they capture the 'stretch factors' along invariant directions.
- The spectral radius (largest |λ|) determines stability of dynamical systems: if > 1, the system diverges.
- The condition number (max λ / min λ) tells you how close the matrix is to singular.
- For symmetric matrices, eigenvalues are real and the eigenvectors form an orthonormal basis — the matrix can be 'diagonalized' perfectly.
Numerical Stability and Production Gotchas
Eigendecomposition is numerically tricky. Common pitfalls:
- Near‑zero eigenvalues cause division by zero in whitening or matrix functions.
- Round‑off asymmetry: a matrix that is theoretically symmetric may have tiny asymmetry due to floating‑point ops, leading to complex eigenvalues.
- Defective matrices: not all matrices have a full set of eigenvectors — numpy.linalg.eig still returns a result, but the eigenvectors may be linearly dependent or wildly inaccurate.
- Condition number: if the condition number (max|λ|/min|λ|) is huge (e.g., > 10¹²), small changes in A cause large changes in eigenvectors. The problem is ill‑posed.
- Always check residual ‖Av
- λv‖.
- Use numpy.linalg.eigh when symmetry is guaranteed.
- For ill‑conditioned cases, switch to SVD (singular value decomposition), which is more stable for any matrix.
- When computing matrix functions (sqrt, inverse, exp) via eigendecomposition, use scipy.linalg.sqrtm, inv, expm which handle stability internally.
Near‑Zero Eigenvalues Silently Break PCA Whitening in Production
- Never compute the inverse of a matrix based on eigendecomposition without checking the condition number.
- Always handle near‑zero eigenvalues by clamping or using SVD.
- Monitor the condition number of your covariance matrix in production. A condition number > 1e6 is a red flag.
- Use numpy.linalg.pinv or scipy.linalg.pinv for robust generalized inverses.
Key takeaways
Common mistakes to avoid
4 patternsUsing eigendecomposition instead of SVD for rectangular matrices
Assuming all matrices have a full set of eigenvectors (diagonalizable)
Forgetting to handle negative eigenvalues in covariance matrix due to numeric noise
Using numpy.linalg.eig on a symmetric matrix with tiny asymmetry
Interview Questions on This Topic
What is an eigenvector geometrically?
Frequently Asked Questions
That's Linear Algebra. Mark it forged?
3 min read · try the examples if you haven't