A quadratic form on a vector space can be regarded as a homogeneous, second-degree polynomial in the coordinates of a vector relative to a fixed chosen basis of the vector space. An ordinary quadratic polynomial is the sum of a quadratic form, a linear map, and a constant function.
A quadratic polynomial function on a vector space is the sum of a quadratic form unequal to , a linear function and a constant function .
Let be a basis for . Then the function as above is determined by
for in , where is the matrix of with respect to and satisfies .
The quadric determined by is the set of vectors of that satisfy .
Two quadrics are called congruent if one can be obtained from the other by application of an isometry. If is an isometry, then the quadric determined by as above is congruent to the quadric determined by .
Often we write the coordinates of the vector as arguments of and rather than the vector itself. Thus, we do not make a distinction between and if is a vector of .
The quadratic polynomial on determined by can also be written as
A quadric is also called quadratic (hyper) surface. If the vector space has dimension , this is a quadratic curve of the plane. If the dimension is equal to , we also speak of a quadratic surface.
If is a quadric, then, up to a constant factor, there is a unique quadratic polynomial function determining . This can be proved by showing that, up to a constant factor, the polynomial function is fully determined by the fact that it has value on sufficiently many points of . A concrete lower limit on this number is equal to the dimension of the linear space of all quadratic forms on . In particular, a quadratic curve of the plane is determined by of its points with the property that no three of them lie on a single line.
Let be the quadric determined by and an invertible linear map . Then belongs to if and only if is on the quadric of . After all, so is equal to zero precisely if . This shows that the image under of the quadric determined by the standard polynomial is the quadric of .
The converse of the statement regarding congruence is also true. If is a quadric that is congruent to , then, by definition, there is an isometry which transforms into , so is the quadric determined by .
Consider the quadratic form on where The expression as a linear combination of two squares is not unique. For example, by
completing the square, the function can be written as a sum of quadratic functions as follows:In terms of matrices this equation can be written aswhere By replacing left and right by , we find where consists of the columns of Therefore, the matrix of the quadratic form with respect to the basis is also diagonal. But the matrix is not orthogonal. Therefore, we cannot conclude that the quadric determined by is congruent to the ellipse determined by . There is no
isometry that transforms the ellipse determined by into the ellipse determined by since the lengths of the axes (that is, line segments connecting pairs of points at largest distance on the ellipse) differ between and .
Theorem Quadratic forms and symmetric matrices says every quadratic form can be put in diagonal form by means of an orthogonal map. Here we show that every quadratic polynomial function can be brought in standard form by use of an isometry.
Let be a quadratic polynomial function on . Then there are an isometry , an index , and real numbers such that is unequal to , and
where .
The quadratic polynomial function can be written as follows as a function of a column vector
Here, is a symmetric matrix, is a column vector, and is a real number. We view the expression as the matrix product of a -matrix with an -matrix, which results in a real number; it is a different way of writing the inner product . Similarly, is equal to the inner product .
Let be an orthonormal basis of eigenvectors of with eigenvalues . Here we choose the order in such a way that the eigenvalues are equal to at the end. As a consequence, there is an index with for and for . We indicate the coordinates of with respect to by . The correspondence is
where the columns of are the vectors , so . In terms of vectors:Substitution in the function rule for gives Here, is the -diagonal matrix with the eigenvalues on the diagonal and is given by
Thus, the components of are the -coordinates of the vector .
If , then we can eliminate the linear term with by completing the square. Here we proceed by using translations: Let be a column vector in , and write so . Substituting this expression for in gives In order to eliminate the terms linear in for , we choose where is the diagonal matrix with as its -th diagonal entry for and zeros elsewhere. So, on the subspace perpendicular to the kernel of , the matrix is the inverse of , and The result is where is the vector with zeros in the first coordinates and for .
If , then the function rule for is as required, that is, as in the second case with . To see this, we choose so and If we replace the argument on both sides by , we find the required formula.
Assume, therefore, that is not equal to the zero vector. In the rest of the proof we will only consider isometries of which fix the first coordinates. So they will leave the following linear subspace invariant: Because the first coordinates of are equal to zero, this vector lies in .
Put . There is an orthogonal map which fixes each vector in and maps onto . An example is the orthogonal reflection . Because is orthogonal, according to property 5 of orthogonal maps it also leaves invariant, such that we find for an arbitrary vector in : Below, we use this result in the form . Write Then we have for all in , such that Finally, we choose the vector and we write so and This shows that we arrive at the function rule of the first case.
We verify that, also in the first case, the result can be formulated as indicated in the theorem. Composing all transformations involved, we see that is an isometry that satisfies : Using this in the function rule for , we find Finally, we replace by , and arrive at
The proof also provides a method for bringing a quadric defined by into standard form by means of isometries:
- Step 1: Find an orthonormal basis of consisting of eigenvectors of . This gives the orthogonal matrix such that is in diagonal form.
- Step 2: Cancel with the aid of a translation the linear terms of variables that also occur in the homogeneous quadratic part.
If there remains no linear term, then is an isometry as required. Otherwise, we carry out two more steps:
- Step 3: Rewrite the linear part with the aid of an orthogonal transformation to a constant multiple of , where is the rank of . The transformation must fix the first basis vectors. The orthogonal reflection satisfies, where is the vector of the coefficients of the linear terms.
- Step 4: Cancel the constant term by means of a translation .
Now is an isometry with the property that is in standard form. Examples of the working of this algorithm are given at the bottom of the page.
We consider the consequences of the theorem for quadratic curves. Up to congruence and multiplication by a nonzero constant, there are nine cases for the equation of the curve in terms of , where the polynomial function is scaled such that the coefficient of equals . We denote the coefficient of by , the coefficient of by , and the constant term by :
In general, the fact that the degree equals 2 means that every line intersects the curve in two points. Exceptions are the single line, the point, and the empty set. In the case of one line, this rule is often made valid by speaking of two overlapping lines. In the other two cases, the two points can be found after extension of the plane to a complex inner product space.
The type of geometric figure, where we count a circle as being an ellipse, is also invariant under affine transformations (that is, compositions of translations and invertible linear maps). The precise coefficients in the standard form will not be preserved, but the signs of the quadratic terms will be.
The shape and size of a quadric remains unchanged under an isometry . The isometry only changes its position in space. The quadric determined by does not change when is multiplied by a nonzero scalar.
The second step of the proof as well as the algorithm is in fact a completion of the square. The translation along can be described as follows: If , we write
For all with we replace by .
The origin is a special point for the quadratic polynomial functions in standard form. It is a sort of center of gravity and the point of intersection of the main axes (read: the -dimensional spans of linearly independent eigenvectors of ).
The standard form is not unique. For example, the ordering of diagonal entries of is not fixed (although we often order the diagonal entries according to decreasing absolute value), and the coefficient of the linear term with can be multiplied by .
Consider the quadratic function on given by
Determine a
standard form for .
We write The characteristic polynomial of in the variable is
Therefore, the eigenvalues of are
Since is already in diagonal form and the eigenvalues are ordered in such a way that all entries equal to are at the end, we do not need to carry out the first step of the algorithm for bringing into standard form. In order to get rid of terms that are linear in variables that also appear in quadratic terms, we use the translation vector Substitution of by gives
We now come to the third step of the algorithm. There is no linear term, so the polynomial function needs no further adjustment. We have arrived at a standard form:
The quadric given by can be described even simpler by dividing by . We conclude that the quadric of is congruent to the set of points satisfying The isometry that transforms the quadric of into the quadric of is
The quadric of , and therefore also of and its standard form , is a circle.