To a linear map of an -dimensional vector space to itself belongs a matrix as soon as we have chosen a basis for . If is the matrix with respect to a second basis for , then and are conjugate, meaning: there exists an invertible -matrix with the property that . For example, the transition matrix has this property.
We keep in mind that a matrix is diagonizable if it is conjugate to a diagonal matrix, and not all matrices are diagonizable. Here we will discuss a normal form that can be found for all matrices. We will start with matrices whose characteristic polynomial only has one root.
Each -matrix with eigenvalue and characteristic polynomial can be transformed to Jordan normal form (in short: Jordan form); meaning: there is an invertible matrix such that
where the
-matrix
is equal to
and is called the
Jordan block of size
corresponding eigenvalue
.
The number of Jordan blocks of size satisfies
Assume that
is also an
-matrix with eigenvalue
whose characteristic polynomial is
. Then
and
are conjugate if and only if the number of Jordan blocks of size
match for
.
For a diagonalizable matrix all Jordan blocks have size .
Multiple Jordan blocks of the same dimension corresponding to eigenvalue are possible. The total number of Jordan blocks is equal to the dimension of the corresponding eigenspace , the null space .
From the Jordan form of we can tell right away what the characteristic polynomial of the matrix is: . A simple example is given by the matrices and , both having characteristic polynomial .
For determining the Jordan form in case of eigenvalues, we only have to determine the dimensions of the null spaces , where and runs from to , the multiplicity of in the minimal polynomial, equal to the lowest index for which is maximal.
First we determine the Jordan normal form for . It is sufficient to prove the statement for ; once this is proven, we apply the results to with eigenvalue . In order to find the Jordan form for , we will only need to add the scalar multiple by of the identity to the Jordan blocks for .
Let be the degree of the minimal polynomial of . Then is the smallest number such that . For every positive integer we choose a complement of in . This means we have the following direct sum decomposition:
We state that
is the direct sum of all subspaces
for
.
To see this, we first prove that
is equal to the sum
of the vector spaces
. To this end, we determine consecutively, for each
, the following claim:
The case
boils down to the inclusion
which follows directly from
. Suppose, therefore, that
and assume that the claim is proven for
.
Suppose . Then there exists a vector such that . From the assumption it follows that
from which we conclude that
. Thus we have proven the claim for
. Consequently, for each
,
so
This way we have determined that
is the sum of all subspaces
for
.
To show that this sum is a direct sum, we have to verify that the intersection of the summand with the sum of all other summands only consists of the zero vector. This follows directly from the following statement:
If , where and , we have for all .
After all, if, for an arbitrary nonzero vector , the image belongs to a sum of other summands than of , then a non-trivial linear combination as in this statement can be found, which leads to a contradiction with the assumption that is distinct from the zero vector.
Assume that this statement is not true. Then there are indices and such that . Since has intersection with , then we also have , and, since , also . There exists a maximum index such that, for a certain , we have .
Now for each and with . Therefore, the equality in the assumption of the statement can be rewritten as
Because
, we have
. Hence, the right-hand side of the equality above lies in
. For the left-hand side we have the same:
Write
and
, so
We find that
and
. In particular we have
. By reasoning for
in the same manner as we did above for
(with
instead of
), etc., we find
. But this contradicts our choice of
.
We have shown that is the direct sum of subspaces . For every we choose a basis of . We then get Jordan blocks with respect to the basis consisting of for . By joining these vectors for we get a basis of the whole vector space with respect to which the matrix of takes on the correct form.
For a proof of the formula, we write . A Jordan block of size is defined on a set of independent vectors in . In this last subspace each Jordan block of bigger length than leads to exactly one basis vector. This number is . Moreover, in exactly are contained in . Exactly basis vectors remain in , each belonging to a unique Jordan block of size . We conclude
This statement is true for all natural numbers
. In particular we have
Application of these formulas gives
Now we will verify why two -matrices and whose characteristic polynomial is for a certain scalar , are conjugate if and only if the number of Jordan blocks of size match for all . Because of the above, we know that each of the two matrices is conjugate to a Jordan form and that the list of sizes of the corresponding Jordan blocks are equal to each other, apart from the order. Two Jordan blocks in a matrix can be rearranged by conjugation with a permutation matrix, where the basis of one block is replaced by the basis of the other block; the corresponding permutation permutes the basis vectors of one Jordan block with those of the other block. If we continue like that with rearranging Jordan blocks, we can make sure that a Jordan form for by conjugation with a permutation matrix transforms into a Jordan form for . Thus, we have proven that and are conjugate if they have the same number of Jordan blocks of size for all .
Finally we note that and are not conjugate if a number of Jordan blocks of size differs for an . This follows from the fact that the numbers in the formulas above are expressed in terms of the dimension of the kernels of , numbers that do not change if is replaced by a conjugate.
Since the blocks belong to a direct sum decomposition, is the sum of the dimensions of the Jordan blocks. This means that
where
is the multiplicity of
in the minimal polynomial. On the one hand, this formula can be used as a verification of all found values of
. On the other hand, the formula is a key to the interpretation of the numbers
as a
partition of
, that is, a way to express
as a sum of natural numbers. Here,
indicates the number of terms in the sum equal to
. The order of these terms does not matter.
The partition
determines the Jordan normal form
at
with along the diagonal
Jordan blocks of size
, next
Jordan blocks of size
, and so on. Because of this, the Jordan normal form corresponding to
is determined uniquely (apart from the order of the Jordan blocks, hence also apart from conjugacy by a permutation matrix). Because two
-matrices
and
with the same characteristic polynomial
are conjugate if and only if the number
of Jordan blocks correspond for all
, the number of conjugacy classes is equal to the number of possible partitions of
.
We apply these results on the matrix of a linear map restricted to each of its generalized subspaces. The information on the Jordan blocks uniquely determines the conjugacy class of .
Assume that is a vector space of finite dimension , that is a linear map, and that the characteristic polynomial of is a product of linear factors:
where
are different from each other.
Then there are unique numbers
such that the matrix of the restriction of
to the generalized eigenspace corresponding to
with respect to a suitably chosen basis has a Jordan normal form with
blocks of size
.
A linear map has the the same matrix as relative to a suitably chosen basis for if and only if it has the same eigenvalues as and the same sizes (for ) of Jordan blocks corresponding to each eigenvalue (for ).
This solves the conjugation problem for complex vector spaces, since then each polynomial is a product of linear factors. In the real case the characteristic polynomial may have quadratic factors with non-real complex roots. This case will be discussed later.
According to the direct sum decomposition we can write as the direct sum of the generalized eigenspaces , where runs over the roots of . Since are invariant subspaces of under and since their order can be rearranged to any other order by conjugation with a permutation matrix, we can limit ourselves to the finding and studying of the Jordan normal form for each of . The result for restricted to each of the generalized subspaces follows directly from the above theorem The Jordan form with one eigenvalue.
The information on the sizes of the Jordan blocks suffices to determine the minimal polynomial: If is the maximum of the sizes (for ) of the Jordan blocks corresponding to eigenvalue (for ), then
To determine the conjugacy class of a complex square matrix we can use a table in which the Jordan blocks are described for each eigenvalue. To guarantee the uniqueness of the Jordan normal form we should order the occurring Jordan blocks along the diagonal in the Jordan normal form. Yet we do not worry about this, since we know that each order can be changed into every other order by a permutation matrix and because it is easy to check if tables with eigenvalues and sizes of corresponding Jordan blocks are equal to one another.
For the conjugacy class of a complex -matrix we have now found a good characteristic: the eigenvalues with the numbers of the Jordan blocks with size for each corresponding to each eigenvalue uniquely determine the conjugacy class.
This data is equivalent to the dimensions of the kernels of . We have already seen that the numbers follow from the dimensions thanks to the formula
Conversely it is easy to see that
where
is the multiplicity of
in the minimal polynomial
of
.
For the interpretation in terms of Jordan normal form the numbers are useful. For calculations the numbers are preferable: on the one hand they directly follow from determining the dimensions of the kernels (or images) of linear maps, on the other hand they form a strictly increasing sequence ending at index :
Consider the matrix
The characteristic polynomial of the matrix is equal to
. Therefore, the eigenvalues of
are
and
.
Which of the matrices below is a Jordan normal form of
?
Since the multiplicity of
in the characteristic polynomial is equal to
, the dimension of generalized eigenspace
equals
. Since the multiplicity of
in the characteristic polynomial is equal to
, the dimension of generalized eigenspace
equals
.
To determine the size of the Jordan blocks, we first calculate the dimensions
for the eigenvalues
,
and
. If
, this dimension is equal to
for
and if
, then this dimension is equal to
for
.
According to the theorem
The Jordan form with one eigenvalue, the numbers
,
,
of Jordan blocks of
corresponding to eigenvalue
of size
,
,
, respectively, can be determined as follows (not all steps are necessary because the dimension
has already reached the dimension of the generalized eigenspace after the first step):
Similarly, we find that only the number of Jordan blocks of size
is nonzero for eigenvalue
. We conclude that
has
- exactly one Jordan block of size on the generalized eigenspace corresponding to
- exactly one Jordan block of size on the generalized eigenspace corresponding to .
Because of the similarity in size of the Jordan blocks, we conclude that the Jordan form of
is equal to