Before we will deal with normal forms for real matrices having a characteristic polynomial with non-real roots, we will discuss the extension of a real vector space towards a complex one and the question how the original real vector space can be found in the complex one.
Let be a real vector space. Take a look at the set
consisting of all vectors of the form , where and are vectors of , and the addition is formal (hence, can also be seen as the pair ). This is a complex vector space if it is provided with the following operations:
We will call this the extension of to a complex vector space.
The map defined by for in is called complex conjugate. This map is semi-linear, which means that, for all complex numbers and and all vectors and of we have
where indicates the complex conjugate of .
If is a basis for , then is also a basis for . In particular and have the same dimension.
If is a real -matrix, then can be interpreted as a linear map , where or . In the case the map is equal to the map of the second case. There is little confusion between the interpretation of as a linear map with a real vector space as domain and as a linear map on a complex vector space.
The fact that is semi-linear follows from the following two calculations, where we will write and for in :
With the term complex conjugate it's possible to get confused: complex conjugation of the elements of the matrix gives while conjugation over of results in a matrix of the form where is an invertible -matrix. The second operation is explicitly not named 'complex conjugate'. In the special case where is a permutation matrix corresponding to the permutation the two definitions are the same.
To view a complex vector space as a real space, no expansion is needed:
A complex vector space can be seen as a real vector space by limitation of scalars. This means we will only view the real numbers of as scalars. We will write to indicate this real vector space.
Let be a real vector space. After limitation of scalars for the semi-linear map complex conjugate becomes a linear map with the property that is the eigenspace of at eigenvalue and the eigenspace of at eigenvalue .
If is a basis for , then is a basis for . In particular, the dimension of is twice as large as the dimension of .
If we start with a real vector space , extend the scalars to obtain and next restrict the scalars, we get the real vector space of which is indeed a subspace, but of dimension twice the dimension of . The question how to recover from can be answered in multiple ways. The statement about gives one method: the minimal polynomial of is equal to , so
The other term of the direct sum is .
If , then determines a unique linear map . This map commutes with , so that is invariant under . Furthermore, the limitation of to is equal to . Since the characteristic polynomial of is a product of linear factors, we can find a basis for , in such a way that the matrix has a Jordan form. With help of the real and imaginary parts of this basis, we will find a basis of so that has a form closely resembling the Jordan normal form.
Take a look at the diagonal matrix where is a non-real complex number and the complex conjugate of . This matrix is conjugate with the real matrix This follow directly from the fact that the characteristic polynomials of and are both equal to , because two -matrices with two corresponding (but distinct) eigenvalues are always conjugate.
If we start with the matrix , we will find the diagonal form after determining the complex eigenvalues.
Vice versa, we can, starting with as above, find a real matrix conjugate to (such as ) in the following manner: we search for a 2-dimensional real subspace of which is kept invariant by . This boils down to determining an invertible -matrix such that is spanned by the columns vectors and . The matrix of relative to the basis for must have real elements. This matrix is , hence, the complex conjugate of this matrix must be equal to . Furthermore, the complex conjugate of is conjugated to through . We will use this to find a solution for :
With this we have found the conjugator that conjugates to .
As we will see later, it is possible to find a unique matrix in every conjugation class of real -matrices, starting with the complex Jordan normal form. However, with the theorem below, it is easy to find out when two real -matrices are conjugate, which is the case when they are conjugate as complex matrices.
If two real -matrices are conjugate over , then they are also conjugate over .
Assume that two -matrices and are conjugate over . Then there exists a complex invertible matrix so that . After multiplying from the left by on both sides we have Write , where and are the real and imaginary part of . We then have and thus
Hence, it is sufficient to derive the existence of a real number where is invertible. Therefore, consider the determinant as a function of . This is a polynomial function of degree in with real coefficients because and are real. Since is invertible, so the polynomial function is not the zero function. The polynomial function has no more than zeros, so that there must exist a real value of for which , so that is a real invertible matrix. This proves the fact.
In the complex case we already know that two -matrices are conjugate if and only if they have the same Jordan normal form. The theorem tells us that this is also sufficient to determine if two real -matrices are conjugate. The proof of the theorem shows us how, if this is the case, a real conjugating matrix can be found, starting with a complex conjugating matrix . Finding the Jordan form consists of finding a suitable basis for ; the complex conjugating matrix consists of the columns belonging to that basis.
In that way we can determine for each pair of -matrices if they are conjugate, and even indicate a conjugating matrix. The real Jordan form of such a matrix, which we will treat later on, is (apart from a permutation of the Jordan blocks) a unique matrix from the conjugation class of .
Consider the -matrix The minimal polynomial of is equal to the characteristic polynomial: The roots of this polynomial are and . The corresponding eigenvectors are and , respectively. The matrix is conjugate to the complex diagonal matrix The corresponding coordinate transformation is so .
The matrices and are not real. If we regard , and as linear transformations of , then the eigenvectors (the columns of ) each span a complex invariant subspace of of dimension . By taking the real and imaginary part of the first eigenvector , we find a basis for the original real vector space relative to which has a real matrix. Determine that matrix.
By restricting scalars, we can view the linear map as a linear map , where .
As a basis for we choose the real and imaginary part of the eigenvector with eigenvalue :
The corresponding transition matrix is . The matrix of relative to said basis is
If we work with the eigenvector of rather than the eigenvector of , we get the transition matrix . The matrix of with respect to the corresponding basis is
As the notation of the overline above suggests, this basis is the complex conjugate of the columns of .
The matrices and are conjugate by means of :
The matrices and are the real -matrices with respect to the basis that belong to complex multiplication by and , respectively.