A square root of an matrix is any matrix such that .
For a scalar (), there are two square roots (which are equal if ), and they are real if and only if is real and nonnegative. For , depending on the matrix there can be no square roots, finitely many, or infinitely many. The matrix
is easily seen to have no square roots. The matrix
has four square roots, . The identity matrix
has infinitely many square roots (namely the involutory matrices), including , the lower triangular matrix
and any symmetric orthogonal matrix, such as
(which is a Householder matrix). Clearly, a square root of a diagonal matrix need not be diagonal.
The matrix square root of most practical interest is the one whose eigenvalues lie in the right half-plane, which is called the principal square root, written . If is nonsingular and has no eigenvalues on the negative real axis then has a unique principal square root. For the diagonal matrix above, .
A symmetric positive definite matrix has a unique symmetric positive definite square root. Indeed if is symmetric positive definite then it has a spectral decomposition , where is orthogonal and is diagonal with positive diagonal elements, and then is also symmetric positive definite.
If is nonsingular then it has at least square roots, where is the number of distinct eigenvalues. The existence of a square root of a singular matrix depends on the Jordan structure of the zero eigenvalues.
In some contexts involving symmetric positive definite matrices , such as Kalman filtering, a matrix such that is called a square root, but this is not the standard meaning.
When has structure one can ask whether a square root having the same structure, or some other related structure, exists. Results are known for (for example)
- stochastic matrices,
- skew-Hamiltonian matrices,
- centrosymmetric matrices, and
- matrices from an automorphism group.
An important distinction is between square roots of that can be expressed as a polynomial in (primary square roots) and those that cannot. Square roots of the latter type arise when has repeated eigenvalues and two copies of an eigenvalue are mapped to different square roots. In some contexts, a nonprimary square root may be the natural choice. For example, consider the matrix
which represents a rotation through an angle radians clockwise. The natural square root of is . For , this gives the square root
The matrix square root arises in many applications, often in connection with other matrix problems such as the polar decomposition, matrix geometric means, Markov chains (roots of transition matrices), quadratic matrix equations, and generalized eigenvalue problems. Most often the matrix is symmetric positive definite, but square roots of nonsymmetric matrices are also needed. Among modern applications, the matrix square root can be found in recent papers on machine learning.
This is a minimal set of references, which contain further useful references within.
- Nicholas J. Higham, Functions of Matrices: Theory and Computation, Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 2008.
- Nicholas J. Higham and Lijing Lin, On th roots of stochastic matrices, Linear Algebra Appl. 435, 448–463, 2011.
- Nicholas J. Higham, D. Steven Mackey, Niloufer Mackey and Françoise Tisseur, Functions preserving matrix groups and iterations for the matrix square root, SIAM J. Matrix Anal. Appl. 26(3), 849–877, 2005
- Roger Horn and Charles Johnson, Topics in Matrix Analysis, Cambridge University Press, 1991.