Eigenvalues and eigenvectors


In linear algebra, an eigenvector or characteristic vector is a vector that has its direction unchanged by a given linear transformation. More precisely, an eigenvector of a linear transformation is scaled by a constant factor when the linear transformation is applied to it:. The corresponding eigenvalue, characteristic value, or characteristic root is the multiplying factor .
Geometrically, vectors are multi-dimensional quantities with magnitude and direction, often pictured as arrows. A linear transformation rotates, stretches, or shears the vectors upon which it acts. A linear transformation's eigenvectors are those vectors that are only stretched or shrunk, with neither rotation nor shear. The corresponding eigenvalue is the factor by which an eigenvector is stretched or shrunk. If the eigenvalue is negative, the eigenvector's direction is reversed.
The eigenvectors and eigenvalues of a linear transformation serve to characterize it, and so they play important roles in all areas where linear algebra is applied, from geology to quantum mechanics. In particular, it is often the case that a system is represented by a linear transformation whose outputs are fed as inputs to the same transformation. In such an application, the largest eigenvalue is of particular importance, because it governs the long-term behavior of the system after many applications of the linear transformation, and the associated eigenvector is the steady state of the system.

Matrices

For an matrix and a nonzero -vector, if multiplying by simply scales by a factor, where is a scalar, then is called an eigenvector of, and is the corresponding eigenvalue. This relationship can be expressed as:.
Given an n-dimensional vector space and a choice of basis, there is a direct correspondence between linear transformations from the vector space into itself and n-by-n square matrices. Hence, in a finite-dimensional vector space, it is equivalent to define eigenvalues and eigenvectors using either the language of linear transformations, or the language of matrices.

Overview

Eigenvalues and eigenvectors feature prominently in the analysis of linear transformations. The prefix eigen- is adopted from the German eigen for 'proper', 'characteristic', 'own'. Originally used to study principal axes of the rotational motion of rigid bodies, eigenvalues and eigenvectors have a wide range of applications, for example in stability analysis, vibration analysis, atomic orbitals, facial recognition, and matrix diagonalization.
In essence, an eigenvector of a linear transformation is a nonzero vector that, when is applied to it, does not change direction. Applying to the eigenvector only scales the eigenvector by the scalar value, called an eigenvalue. This condition can be written as the equation
referred to as the eigenvalue equation or eigenequation. In general, may be any scalar. For example, may be negative, in which case the eigenvector reverses direction as part of the scaling, or it may be zero or complex.
The example here, based on the Mona Lisa, provides a simple illustration. Each point on the painting can be represented as a vector pointing from the center of the painting to that point. The linear transformation in this example is called a shear mapping. Points in the top half are moved to the right, and points in the bottom half are moved to the left, proportional to how far they are from the horizontal axis that goes through the middle of the painting. The vectors pointing to each point in the original image are therefore tilted right or left, and made longer or shorter by the transformation. Points along the horizontal axis do not move at all when this transformation is applied. Therefore, any vector that points directly to the right or left with no vertical component is an eigenvector of this transformation, because the mapping does not change its direction. Moreover, these eigenvectors all have an eigenvalue equal to one, because the mapping does not change their length either.
Linear transformations can take many different forms, mapping vectors in a variety of vector spaces, so the eigenvectors can also take many forms. For example, the linear transformation could be a differential operator like in which case the eigenvectors are functions called eigenfunctions that are scaled by that differential operator, such as
Alternatively, the linear transformation could take the form of an matrix, in which case the eigenvectors are matrices. If the linear transformation is expressed in the form of an matrix, then the eigenvalue equation for a linear transformation above can be rewritten as the matrix multiplication
where the eigenvector is an matrix. For a matrix, eigenvalues and eigenvectors can be used to decompose the matrix—for example by diagonalizing it.
Eigenvalues and eigenvectors give rise to many closely related mathematical concepts, and the prefix eigen- is applied liberally when naming them:
  • The set of all eigenvectors of a linear transformation, each paired with its corresponding eigenvalue, is called the eigensystem of that transformation.
  • The set of all eigenvectors of corresponding to the same eigenvalue, together with the zero vector, is called an eigenspace, or the characteristic space of associated with that eigenvalue.
  • If a set of eigenvectors of forms a basis of the domain of, then this basis is called an eigenbasis.

    History

Eigenvalues are often introduced in the context of linear algebra or matrix theory. Historically, however, they arose in the study of quadratic forms and differential equations.
In the 18th century, Leonhard Euler studied the rotational motion of a rigid body, and discovered the importance of the principal axes. Joseph-Louis Lagrange realized that the principal axes are the eigenvectors of the inertia matrix.
In the early 19th century, Augustin-Louis Cauchy saw how their work could be used to classify the quadric surfaces, and generalized it to arbitrary dimensions. Cauchy also coined the term racine caractéristique, for what is now called eigenvalue; his term survives in characteristic equation.
Later, Joseph Fourier used the work of Lagrange and Pierre-Simon Laplace to solve the heat equation by separation of variables in his 1822 treatise The Analytic Theory of Heat . Charles-François Sturm elaborated on Fourier's ideas further, and brought them to the attention of Cauchy, who combined them with his own ideas and arrived at the fact that real symmetric matrices have real eigenvalues. This was extended by Charles Hermite in 1855 to what are now called Hermitian matrices.
Around the same time, Francesco Brioschi proved that the eigenvalues of orthogonal matrices lie on the unit circle, and Alfred Clebsch found the corresponding result for skew-symmetric matrices. Finally, Karl Weierstrass clarified an important aspect in the stability theory started by Laplace, by realizing that defective matrices can cause instability.
In the meantime, Joseph Liouville studied eigenvalue problems similar to those of Sturm; the discipline that grew out of their work is now called Sturm–Liouville theory. Schwarz studied the first eigenvalue of Laplace's equation on general domains towards the end of the 19th century, while Poincaré studied Poisson's equation a few years later.
At the start of the 20th century, David Hilbert studied the eigenvalues of integral operators by viewing the operators as infinite matrices. He was the first to use the German word eigen, which means "own", to denote eigenvalues and eigenvectors in 1904, though he may have been following a related usage by Hermann von Helmholtz. For some time, the standard term in English was "proper value", but the more distinctive term "eigenvalue" is the standard today.
The first numerical algorithm for computing eigenvalues and eigenvectors appeared in 1929, when Richard von Mises published the power method. One of the most popular methods today, the QR algorithm, was proposed independently by John G. F. Francis and Vera Kublanovskaya in 1961.

Eigenvalues and eigenvectors of matrices

Eigenvalues and eigenvectors are often introduced to students in the context of linear algebra courses focused on matrices.
Furthermore, linear transformations over a finite-dimensional vector space can be represented using matrices, which is especially common in numerical and computational applications.
Consider -dimensional vectors that are formed as a list of scalars, such as the three-dimensional vectors
These vectors are said to be scalar multiples of each other, or parallel or collinear, if there is a scalar such that
In this case,
Now consider the linear transformation of -dimensional vectors defined by an -by- matrix,
or
where, for each row,
If it occurs that and are scalar multiples, that is if
then is an eigenvector of the linear transformation and the scale factor is the eigenvalue corresponding to that eigenvector. Equation is the eigenvalue equation for the matrix.
Equation can be stated equivalently as
where is the -by- identity matrix and is the zero vector.

Eigenvalues and the characteristic polynomial

Equation has a nonzero solution if and only if the determinant of the matrix is zero. Therefore, the eigenvalues of are values of that satisfy the equation
Using the Leibniz formula for determinants, the left-hand side of equation is a polynomial function of the variable and the degree of this polynomial is, the order of the matrix. Its coefficients depend on the entries of, except that its term of degree is always. This polynomial is called the characteristic polynomial of. Equation is called the characteristic equation or the secular equation of.
The characteristic polynomial of an -by- matrix, being a polynomial of degree, has at most complex number roots, which can be found by factoring the characteristic polynomial, or numerically by root finding. The characteristic polynomial can be factored into the product of linear terms,
where the complex numbers,,...,, each of which is an eigenvalue, may not all be distinct.
As a brief example, which is described in more detail in the examples section later, consider the matrix
Taking the determinant of, the characteristic polynomial of is
Setting the characteristic polynomial equal to zero, it has roots at and, which are the two eigenvalues of. The eigenvectors corresponding to each eigenvalue can be found by solving for the components of in the equation. In this example, the eigenvectors are any nonzero scalar multiples of
If the entries of the matrix are all real numbers, then the coefficients of the characteristic polynomial will also be real numbers, but the eigenvalues may still have nonzero imaginary parts. The entries of the corresponding eigenvectors therefore may also have nonzero imaginary parts. Similarly, the eigenvalues may be irrational numbers even if all the entries of are rational numbers or even if they are all integers. However, if the entries of are all algebraic numbers, which include the rationals, the eigenvalues must also be algebraic numbers.
The non-real roots of a real polynomial with real coefficients can be grouped into pairs of complex conjugates, namely with the two members of each pair having imaginary parts that differ only in sign and the same real part. If the degree is odd, then by the intermediate value theorem at least one of the roots is real. Therefore, any real matrix with odd order has at least one real eigenvalue, whereas a real matrix with even order may not have any real eigenvalues. The eigenvectors associated with these complex eigenvalues are also complex and also appear in complex conjugate pairs.