A square root of an matrix
is any matrix
such that
.
For a scalar (
), there are two square roots (which are equal if
), and they are real if and only if
is real and nonnegative. For
, depending on the matrix there can be no square roots, finitely many, or infinitely many. The matrix
is easily seen to have no square roots. The matrix
has four square roots, . The identity matrix
has infinitely many square roots (namely the involutory matrices), including
, the lower triangular matrix
and any symmetric orthogonal matrix, such as
(which is a Householder matrix). Clearly, a square root of a diagonal matrix need not be diagonal.
The matrix square root of most practical interest is the one whose eigenvalues lie in the right half-plane, which is called the principal square root, written . If
is nonsingular and has no eigenvalues on the negative real axis then
has a unique principal square root. For the diagonal matrix
above,
.
A symmetric positive definite matrix has a unique symmetric positive definite square root. Indeed if is symmetric positive definite then it has a spectral decomposition
, where
is orthogonal and
is diagonal with positive diagonal elements, and then
is also symmetric positive definite.
If is nonsingular then it has at least
square roots, where
is the number of distinct eigenvalues. The existence of a square root of a singular matrix depends on the Jordan structure of the zero eigenvalues.
In some contexts involving symmetric positive definite matrices , such as Kalman filtering, a matrix
such that
is called a square root, but this is not the standard meaning.
When has structure one can ask whether a square root having the same structure, or some other related structure, exists. Results are known for (for example)
- stochastic matrices,
-matrices,
- skew-Hamiltonian matrices,
- centrosymmetric matrices, and
- matrices from an automorphism group.
An important distinction is between square roots of that can be expressed as a polynomial in
(primary square roots) and those that cannot. Square roots of the latter type arise when
has repeated eigenvalues and two copies of an eigenvalue are mapped to different square roots. In some contexts, a nonprimary square root may be the natural choice. For example, consider the matrix
which represents a rotation through an angle radians clockwise. The natural square root of
is
. For
, this gives the square root
of
The matrix square root arises in many applications, often in connection with other matrix problems such as the polar decomposition, matrix geometric means, Markov chains (roots of transition matrices), quadratic matrix equations, and generalized eigenvalue problems. Most often the matrix is symmetric positive definite, but square roots of nonsymmetric matrices are also needed. Among modern applications, the matrix square root can be found in recent papers on machine learning.
References
This is a minimal set of references, which contain further useful references within.
- Nicholas J. Higham, Functions of Matrices: Theory and Computation, Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 2008.
- Nicholas J. Higham and Lijing Lin, On
th roots of stochastic matrices, Linear Algebra Appl. 435, 448–463, 2011.
- Nicholas J. Higham, D. Steven Mackey, Niloufer Mackey and Françoise Tisseur, Functions preserving matrix groups and iterations for the matrix square root, SIAM J. Matrix Anal. Appl. 26(3), 849–877, 2005
- Roger Horn and Charles Johnson, Topics in Matrix Analysis, Cambridge University Press, 1991.
Related Blog Posts
This article is part of the “What Is” series, available from https://nhigham.com/category/what-is and in PDF form from the GitHub repository https://github.com/higham/what-is.
Hello,
thank you very much for the post! It was really useful and one of the clearest sources I have found.
I have one question regarding this statement: “A symmetric positive definite matrix has a unique symmetric positive definite square root.”
Can it be generalised to symmetric positive semi-definite matrices?
Thank you
Ferdinando
Yes – a symmetric positive semidefinite matrix has a unique symmetric positive semidefinite square root.
Thanks for your answer.