simultaneous triangularisation of commuting matrices over any field
Let denote the (column) vector whose th position is and where all other positions are . Denote by the set. Denote by the set of all matrices over , and by the set of allinvertible elements of . Let be the functionwhich extracts the th diagonal element of a matrix, i.e., .
Theorem.
Let be a field, let be pairwise commuting matrices, and let be a field extensionof in which the characteristic polynomials
of all split (http://planetmath.org/SplittingField). Then there exists some such that
- 1.
is upper triangular for all ,and
- 2.
if are such that and for all, then for all as well.
The proof relies on two lemmas.
Lemma 1.
Let be a field, let be pairwise commuting matrices, and let be a field extensionof in which the characteristic polynomials of all split. Then there exists some nonzero whichis an eigenvector of for all .
Lemma 2.
For any sequence of uppertriangular pairwise commuting matrices and every row index, there exists such that
Proof.
This is by induction on . The induction hypothesis is that givenpairwise commuting matrices ,whose characteristic polynomials all split in , and asequence of arbitrary scalars ,there exists some such that:
- 1.
is upper triangular for all .
- 2.
If some are such that and for all ,then .
- 3.
If some is such that for all , then for all .
For this hypothesis is trivially fulfilled (all matrices are upper triangular). Assume that it holds for andconsider the case .
It is easy to see that condition 1 impliesthat must be an eigenvector that is common to all thematrices. If there exists a nonzero vector such that for all then this is such a common eigenvector, and in that case let for all . Otherwise there byLemma 1 exists a vector such that for some. Either way, one gets asuitable candidate for and eigenvalues that incidentally will satisfy for all .
Let be arbitraryvectors such that is a basis of. Let be the matrix whose th columnis for .11Byimposing extra conditions on the choice of the basis (such as for example requesting thatit is orthonormal) at this point, one can often prove a strongerclaim where the choice of is restricted to some smallergroup of matrices (for example the group of orthogonalmatrices), but this requires assuming additional things aboutthe fields and .Then is invertible
and for each the first column of is
Furthermore
for all and .
Now let be the matrix formed from rows and columns though of . Since byexpansion (http://planetmath.org/LaplaceExpansion) along the first column,it follows that the characteristic polynomial of splits in. Furthermore all the have side andcommute pairwise with each other, whence by the induction hypothesisthere exists some such that every is upper triangular. Let . Then the submatrix consisting of rows and columns through of is equal to and hencecontains no nonzero subdiagonal elements. Furthermore the firstcolumn of is equal to the first column of andthus the are all upper triangular, as claimed.
It also follows from the induction hypothesis that can be chosensuch that for all if there is any for which for all andmore generally if are such that for all then similarly for all . This has verifiedcondition 2 of the induction hypothesis.For the remaining condition 3, one may first observethat if there is some such that for all then by Lemma 2there exists a nonzero such that for all . Thismeans will fulfill the condition for choice of, and hence asclaimed.
The theorem now follows from the principle of induction.∎