Resultant

This article is about the resultant of polynomials. For the result of adding two or more vectors, see Parallelogram rule. For the musical phenomenon, see Resultant tone.

In mathematics, the resultant of two polynomials is a polynomial expression of their coefficients, which is equal to zero if and only if the polynomials have a common root (possibly in a field extension), or, equivalently, a common factor (over their field of coefficients). In some older texts, the resultant is also called eliminant.^[1]

The resultant is widely used in number theory, either directly or through the discriminant, which is essentially the resultant of a polynomial and its derivative. The resultant of two polynomials with rational or polynomial coefficients may be computed efficiently on a computer. It is a basic tool of computer algebra, and is a built-in function of most computer algebra systems. It is used, among others, for cylindrical algebraic decomposition, integration of rational functions and drawing of curves defined by a bivariate polynomial equation.

The resultant of n homogeneous polynomials in n variables or multivariate resultant, sometimes called Macaulay's resultant, is a generalization, introduced by Macaulay, of the usual resultant.^[2] It is, with Gröbner bases, one of the main tools of effective elimination theory (elimination theory on computers).

Notation

The resultant of two univariate polynomials $A$ and $B$ is commonly denoted $\operatorname {res} (A,B)$ or $\operatorname {Res} (A,B).$

In many applications of the resultant, the polynomials depend on several indeterminates and may be considered as univariate polynomials in one of their indeterminates, with polynomials in the other indeterminates as coefficients. In this case, the indeterminate that is selected for defining and computing the resultant is indicated as a subscript: $\operatorname {res} _{x}(A,B)$ or $\operatorname {Res} _{x}(A,B).$

The degree of the polynomials are used in the definition of the resultant. However, a polynomial of degree $d$ may also be considered as a polynomial of higher degree such the leading coefficients are zero. If such a higher degree is used for the resultant, it is usually indicated as a subscript or a superscript, such as $\operatorname {res} _{d,e}(A,B)$ or $\operatorname {res} _{x}^{d,e}(A,B).$

Definition

The resultant of two univariate polynomials over a field or over a commutative ring is commonly defined as the determinant of their Sylvester matrix. More precisely, let

A=a_{0}x^{d}+a_{1}x^{d-1}+\cdots +a_{d}

and

B=b_{0}x^{e}+b_{1}x^{e-1}+\cdots +b_{e}

be nonzero polynomials of respective degrees $d$ and $e$ . Let us denote by ${\mathcal {P}}_{i}$ the vector space (or free module if the coefficients belong to a commutative ring) of dimension i whose elements are the polynomials of degree less than i. The map

\varphi :{\mathcal {P}}_{e}\times {\mathcal {P}}_{d}\rightarrow {\mathcal {P}}_{d+e}

such that

\varphi (P,Q)=AP+BQ

is a linear map between two spaces of the same dimension. Over the basis of the powers of $x$ , this map is represented by a square matrix of dimension $d + e$ , which called the Sylvester matrix of $A$ and $B$ (for many authors and in the article Sylvester matrix, the Sylvester matrix is defined as the transpose of this matrix; this convention is not used here, as it breaks the usual convention for writing the matrix of a linear map).

The resultant of $A$ and $B$ is thus the determinant

{\begin{vmatrix}a_{0}&0&\cdots &0&b_{0}&0&\cdots &0\\a_{1}&a_{0}&\cdots &0&b_{1}&b_{0}&\cdots &0\\a_{2}&a_{1}&\ddots &0&b_{2}&b_{1}&\ddots &0\\\vdots &\vdots &\ddots &a_{0}&\vdots &\vdots &\ddots &b_{0}\\\vdots &\vdots &\cdots &a_{1}&\vdots &\vdots &\cdots &b_{1}\\a_{d}&a_{d-1}&\cdots &\vdots &b_{e}&b_{e-1}&\cdots &\vdots \\0&a_{d}&\ddots &\vdots &0&b_{e}&\ddots &\vdots \\\vdots &\vdots &\ddots &a_{d-1}&\vdots &\vdots &\ddots &b_{e-1}\\0&0&\cdots &a_{d}&0&0&\cdots &b_{e}\end{vmatrix}},

which has $e$ columns of $a i$ and $d$ columns of $b j$ (for simplification, $d = e$ in the displayed determinant).

In the case of monic polynomials over an integral domain the resultant is equal to the product

\prod _{(x,y)\colon f(x)=g(y)=0}(x-y),

where $x$ and $y$ run over the roots of the polynomials over an algebraically closed field containing the coefficients. For non-monic polynomials with leading coefficients $a 0$ and $b 0$ , respectively, the above product is multiplied by $a_{0}^{e}b_{0}^{d}.$

Properties

In this section and its subsections, $A$ and $B$ are two polynomials in $x$ of respective degrees $d$ and $e$ , and their resultant is denoted $\operatorname {res} (A,B).$

Characterizing properties

If $d = 0$ (that is if $A=a_{0}$ is a nonzero constant) then $\operatorname {res} (A,B)=a_{0}^{e}.$ Similarly, if $e = 0$ , then $\operatorname {res} (A,B)=b_{0}^{d}.$
$\operatorname {res} (x-a_{0},y-b_{0})=b_{0}-a_{0}$
$\operatorname {res} (B,A)=(-1)^{de}\operatorname {res} (A,B)$
$\operatorname {res} (AB,C)=\operatorname {res} (A,C)\operatorname {res} (B,C)$
The preceding properties characterize the resultant. In other words, the resultant is the unique function of the coefficients of polynomials that has these properties.

Zeros

The resultant of two polynomials with coefficients in an integral domain is zero if and only if they have a common divisor of positive degree.
The resultant of two polynomials with coefficients in an integral domain is zero if and only if they have a common root in an algebraically closed field containing the coefficients.
There exist a polynomial $P$ of degree less than $e$ and a polynomial $Q$ of degree less than $d$ such that $\operatorname {res} (A,B)=AP+BQ.$ This is a generalization of Bézout's identity to polynomials over an arbitrary commutative ring. In other words, the resultant of two polynomials belongs to the ideal generated by these polynomials.

Invariance by ring homomorphisms

Let $A$ and $B$ be two polynomials of respective degrees $d$ and $e$ with coefficients in a commutative ring $R$ , and $\varphi \colon R\to S$ a ring homomorphism of $R$ into another commutative ring $S$ . Applying $\varphi$ to the coefficients of a polynomial extends $\varphi$ to a homomorphism of polynomial rings $R[x]\to S[x]$ , which is also denoted $\varphi .$ With this notation, we have:

If $\varphi$ preserve the degrees of $A$ and $B$ (that is if $\deg(\varphi (A))=d$ and $\deg(\varphi (B))=e$ ), then

\operatorname {res} (\varphi (A),\varphi (B))=\varphi (\operatorname {res} (A,B)).

If $\deg(\varphi (A))<d$ and $\deg(\varphi (B))<e,$ then

\operatorname {res} (\varphi (A),\varphi (B))=0.

If $\deg(\varphi (A))=d$ and $\deg(\varphi (B))=f<e,$ and the leading coefficient of $A$ is $a_{0}$ then

\varphi (a_{0})^{e-f}\operatorname {res} (\varphi (A),\varphi (B))=\varphi (\operatorname {res} (A,B)).

If $\deg(\varphi (A))=f<d$ and $\deg(\varphi (B))=e,$ and the leading coefficient of $B$ is $b_{0}$ then

\varphi (b_{0})^{d-f}\operatorname {res} (\varphi (A),\varphi (B))=(-1)^{e(d-f)}\varphi (\operatorname {res} (A,B)).

These properties are easily deduced from the definition of the resultant as a determinant. They are mainly used in two situations. For computing a resultant of polynomials with integer coefficients, it is generally faster to compute it modulo several primes and to retrieve the desired resultant with Chinese remainder theorem. When $R$ is a polynomial ring in other indeterminates, and $S$ is the ring obtained by specializing to numerical values some or all indeterminates of $R$ , these properties may be restated as if the degrees are preserved by the specialization, the resultant of the specialization of two polynomials is the specialization of the resultant. This property is fundamental, for example, for cylindrical algebraic decomposition.

Invariance under change of variable

$\operatorname {res} (A(x+a),B(x+a)=\operatorname {res} (A(x),B(x))$
$\operatorname {res} (A(ax),B(ax)=a^{de}\operatorname {res} (A(x),B(x))$
If $A_{r}(x)=x^{d}A(1/x)$ and $B_{r}(x)=x^{e}B(1/x)$ are the reciprocal polynomials of $A$ and $B$ , respectively, then

\operatorname {res} (A_{r},B_{r})=(-1)^{de}\operatorname {res} (A,B)

This means that the property of the resultant being zero is invariant under linear and projective changes of the variable

Invariance under change of polynomials

If $a$ and $b$ are nonzero constants (that is they are independent of the indeterminate $x$ ), and $A$ and $B$ are as above, then

\operatorname {res} (aA,bB)=a^{e}b^{d}\operatorname {res} (A,B)

If $d=\deg(A)\geq e=\deg(B),$ if a is a constant and $b_{0}$ is the leading coefficient of $B$ , and if $C$ is a polynomial of degree at most $d-e$ then

b_{0}^{d-e}\operatorname {res} (aA-CB,B)=a^{e}\operatorname {res} (A,B)

These properties imply that in Euclidean algorithm for polynomials, the resultant of two successive remainders differs from the resultant of the initial polynomials by a factor, which is easy to compute. Moreover, the constant $a$ in above second formula may be chosen in order that the successive remainders have their coefficients in the ring of coefficients of input polynomials. This is the starting idea of the subresultant-pseudo-remainder-sequence algorithm for computing the greatest common divisor and the resultant of two polynomials. This algorithms works for polynomials over the integers or, more generally, over an integral domain, without any other division than exact divisions (that is without involving fractions). It involves $O(de)$ arithmetic operations, while the computation of the determinant of the Sylvester matrix with standard algorithms require $O((d+e)^{3})$ arithmetic operations.

Generic properties

In this section, we consider two polynomials

A=a_{0}x^{d}+a_{1}x^{d-1}+\cdots +a_{d}

and

B=b_{0}x^{e}+b_{1}x^{e-1}+\cdots +b_{e}

whose $d + e +2$ coefficients are distinct indeterminates. Let

R={\mathbb {Z}}[a_{0},\ldots ,a_{d},b_{0},\ldots ,b_{e}]

be the polynomial ring over the integers defined by these indeterminates. The resultant $\operatorname {res} (A,B)$ is often called the generic resultant for the degrees $d$ and $e$ . It has the following properties.

$\operatorname {res} (A,B)$ is an absolutely irreducible polynomial.
If $I$ is the ideal of $R[x]$ generated by $A$ and $B$ , then $I\cap R$ is the principal ideal generated by $\operatorname {res} (A,B)$ .

Homogeneity

The generic resultant for the degrees $d$ and $e$ is homogeneous in various ways. More precisely:

It is homogeneous of degree $e$ in $a_{0},\ldots ,a_{d}.$
It is homogeneous of degree $d$ in $b_{0},\ldots ,b_{e}.$
It is homogeneous of degree $d + e$ in all the variables $a_{i}$ and $b_{j}.$
If $a_{i}$ and $b_{i}$ are given the weight $i$ (that is, the weight of each coefficient is its degree as elementary symmetric polynomial), then it is quasi-homogeneous of total weight $de$ .
If $P$ and $Q$ are homogeneous multivariate polynomials of respective degrees $d$ and $e$ , then their resultant in degrees $d$ and $e$ with respect to an indeterminate $x$ , denoted $\operatorname {res} _{x}^{d,e}(P,Q)$ in § Notation, is homogeneous of degree $de$ in the other indeterminates.

Elimination properties

Let $I=\langle A,B\rangle$ be the ideal generated by two polynomials $A$ and $B$ in a polynomial ring $R[x],$ where $R$ is itself a polynomial ring over a field. Then:

$I\cap R=R\cdot r$ is a principal ideal generated by $r$ for some $r\in R$
$\operatorname {res} _{x}(A,B)\in R\cdot r$
There exists a positive integer $k$ such that $\operatorname {res} _{x}(A,B)^{k}\in Rr.$

An example where $k > 1$ in the latter property is $R={\mathbb {R}}[y],$ $A=x^{2}+y^{2}-1$ (the unit circle), and $B = y - 2$ . In this case, $\operatorname {res} _{x}(A,B)=(y-2)^{2}$ and $\langle A,B\rangle \cap R=R\cdot (y-2).$ This example has been chosen for having a prime ideal $\langle A,B\rangle .$ For another field of coefficients and another constant term in $B$ , one has also $k > 1$ , but the ideal may be non-prime.

Computation

Theoretically, the resultant could be computed by using the formula expressing it as a product of roots differences. However, as the roots may generally not be computed exactly, such an algorithm would be inefficient and numerically unstable. As the resultant is a symmetric function of the roots of each polynomial, it could also be computed by using the fundamental theorem of symmetric polynomials, but this would be highly inefficient.

As the resultant is the determinant of the Sylvester matrix (and of the Bézout matrix), it may be computed by using any algorithm for computing determinants. This needs $O(n^{3})$ arithmetic operations. As one knows algorithms with a better complexity (see below), this method is not used in practice.

It follows from § Invariance under change of polynomials that the computation of a resultant is strongly related with Euclidean algorithm for polynomials. This shows that the computation of the resultant of two polynomials of degrees $d$ and $e$ may be done in $O(de)$ arithmetic operations in the field of coefficients.

However, when the coefficients are integers, rational numbers or polynomials, these arithmetic operations imply a number of GCD computations of coefficients which is of the same order and make the algorithm inefficient. The subresultant pseudo-remainder sequences were introduced to solve this problem and avoid any fraction and any GCD computation of coefficients. A more efficient algorithm is obtained by using the good behavior of the resultant under a ring homomorphism on the coefficients: to compute a resultant of two polynomials with integer coefficients, one computes their resultants modulo sufficiently many prime numbers and then reconstructs the result with the Chinese remainder theorem.

The use of fast multiplication of integers and polynomials allows algorithms for resultants and greatest common divisors that have a better time complexity, which is of the order of the complexity of the multiplication, multiplied by the logarithm of the size of the input ( $\log(s(d+e)),$ where $s$ is an upper bound of the number of digits of the input polynomials).

Application to polynomial systems

Resultants were introduced for solving systems of polynomial equations and provides the oldest proof that there exist algorithms for solving such systems. There are primarily intended for systems of two equations in two unknown, but allow also solving general systems.

Case of two equations in two unknowns

Let us consider two polynomials the system of equations

{\begin{aligned}P(x,y)&=0\\Q(x,y)&=0,\end{aligned}}

where $P$ and $Q$ are polynomials of respective total degrees $d$ and $e$ . Then $R=\operatorname {res} _{y}^{d,e}(P,Q)$ is a polynomial in $x$ , which is generically of degree $de$ (by properties of § Homogeneity). A value $\alpha$ of $x$ is a root of $R$ , if and only if, either there exist $\beta$ in an algebraically closed field containing the coefficients, such that $P(\alpha ,\beta )=Q(\alpha ,\beta )=0$ , or $\deg(P(\alpha ,y))<d$ and $\deg(Q(\alpha ,y))<d$ (in this case, one says that $P$ and $Q$ have a common root at infinity for $x=\alpha$ ).

Therefore, solving the system amounts computing the roots of $R$ , and for each root $\alpha,$ computing the common root(s) of $P(\alpha ,y),$ $Q(\alpha ,y),$ and $\operatorname {res} _{x}(P,Q).$

It is worth to remark that Bézout's theorem results of the value of $\deg \left(\operatorname {res} _{y}^{d,e}(P,Q)\right).$

General case

At first glance, it seems that resultants may be applied to a general polynomial system of equations

P_{1}(x_{1},\ldots ,x_{n})=0

\vdots

P_{k}(x_{1},\ldots ,x_{n})=0

by computing the resultants of every pair $(P_{i},P_{j})$ with respect to $x_{n}$ for eliminating one unknown, and repeating the process until getting univariate polynomials. Unfortunately, this introduce many spurious solutions, which are difficult to remove.

A method, introduced at the end of 19th century, works as follows: introduce $k - 1$ new indeterminates $U_{2},\ldots ,U_{k}$ and compute

\operatorname {res} _{x_{n}}(P_{1},U_{2}P_{2}+\cdots +U_{k}P_{k}).

This is a polynomial in $U_{2},\ldots ,U_{k}$ whose coefficients are polynomials in $x_{1},\ldots ,x_{n-1},$ which have the property that $\alpha _{1},\ldots ,\alpha _{n-1}$ is a common zero of these polynomial coefficients, if and only if the univariate polynomials $P_{i}(\alpha _{1},\ldots ,\alpha _{n-1},x_{n})$ have a common zero, possibly at infinity. This process may be iterated until finding univariate polynomials.

For getting a correct algorithm two complements have to be added to the method. Firstly, at each step, a linear change of variable may be needed in order that the degrees of the polynomials in the last variable are the same as their total degree. Secondly, if, at any step, the resultant is zero, this means that the polynomials have a common factor and that the solutions split in two components. One, were the common factor is zero, and the other which is obtained by factoring out this common factor before continuing.

This algorithm is very complicated and has a huge time complexity. Therefore, its interest is mainly historical.

Other applications

Number theory

The discriminant of a polynomial, which is a fundamental tool in number theory is the quotient by its leading coefficient of the resultant of the polynomial and its derivative.

If x and y are algebraic numbers such that $P(x)=Q(y)=0,$ with degree $Q$ of degree $n)$ , then $z=x+y$ is a root of the resultant $\operatorname {res} _{x}(P(x),Q(z-x)),$ and $t=xy$ is a root of $\operatorname {res} _{x}(P(x),x^{n}Q(t/x)).$ Combined with the fact that $1/y$ is a root of $y^{n}Q(1/y)$ , this shows that the set of algebraic numbers is a field.

Let $K(\alpha )$ be an algebraic field extension generated by an element $\alpha,$ which has $P(x)$ as minimal polynomial. Every element of $\beta \in K(\alpha )$ may be written as $\beta =Q(\alpha ),$ where $Q$ is a polynomial. Then $\beta$ is a root of $\operatorname {res} _{x}(P(x),z-Q(x)),$ and this resultant is a power of the minimal polynomial of $\beta .$

Algebraic geometry

Given two plane algebraic curves defined as the zeros of the polynomials $P (x, y)$ and $Q (x, y)$ , the resultant allows the computation of their intersection. More precisely, the roots of $\operatorname {res} _{y}(P,Q)$ are the x-coordinates of the intersection points and of the common vertical asymptotes, and the roots of $\operatorname {res} _{x}(P,Q)$ are the y-coordinates of the intersection points and of the common horizontal asymptotes.

A rational plane curve may be defined by a parametric equation

x={\frac {P(t)}{R(t)}},\qquad y={\frac {Q(t)}{R(t)}},

where $P$ , $Q$ and $R$ are polynomials. An implicit equation of the curve is given by

\operatorname {res} _{t}(xR-P,yR-Q).

The degree of this curve is the highest degree of $P$ , $Q$ and $R$ , which is equal to the total degree of the resultant.

Symbolic integration

In symbolic integration, for computing the antiderivative of a rational fraction, one uses partial fraction decomposition for decomposing the integral into a "rational part", which is a sum of rational fractions whose antiprimitives are rational fractions, and a "logarithmic part" which is a sum of rational fractions of the form

{\frac {P(x)}{Q(x)}},

where $Q$ is a square-free polynomial and $P$ is a polynomial of lower degree than $Q$ . The antiderivative of such a function involves necessarily logarithms, and generally algebraic numbers (the roots of $Q$ ). In fact, the antiderivative is

\int {\frac {P(x)}{Q(x)}}dx=\sum _{Q(\alpha )=0}{\frac {P(\alpha )}{Q'(\alpha )}}\log(x-\alpha ),

where the sum runs over all complex roots of $Q$ .

The number of algebraic numbers involved by this expression is generally equal to the degree of $Q$ , but it occurs frequently that an expression with less algebraic numbers may be computed. The Lazard–Rioboo–Trager method produced an expression, where the number of algebraic numbers is minimal, without any computation with algebraic numbers.

Let

S_{1}(r)S_{2}(r)^{2}\cdots S_{k}(r)^{k}=\operatorname {res} _{r}(rQ'(x)-P(x),Q(x))

be the square-free factorization of the resultant which appears on the right. Trager proved that the antiderivative is

\int {\frac {P(x)}{Q(x)}}dx=\sum _{i=1}^{k}\sum _{S_{i}(\alpha )=0}\alpha \log(T_{i}(\alpha ,x)),

where the internal sums run over the roots of the $S_{i}$ (if $S_{i}=1)$ the sum is zero, as being the empty sum), and $T_{i}(r,x)$ is a polynomial of degree $i$ in $x$ . The Lazard-Rioboo contribution is the proof that $T_{i}(r,x)$ is the subresultant of degree $i$ of $rQ'(x)-P(x)$ and $Q(x).$ It is thus obtained for free if the resultant is computed by the subresultant pseudo-remainder sequence.

Computer algebra

All preceding applications, and many others, show that the resultant is a fundamental tool in computer algebra. In fact most computer algebra systems include an efficient implementation of the computation of resultants.

Homogeneous resultant

The resultant is also defined for two homogeneous polynomial in two indeterminates. Given two homogeneous polynomials $P (x, y)$ and $Q (x, y)$ of respective total degrees $p$ and $q$ , their homogeneous resultant is the determinant of the matrix over the monomial basis of the linear map

(A,B)\mapsto AP+BQ,

where $A$ runs over the bivariate homogeneous polynomials of degree $q - 1$ , and $B$ runs over the homogeneous polynomials of degree $p - 1$ . In other words, the homogeneous resultant of $P$ and $Q$ is the resultant of $P (x, 1)$ and $Q (x, 1$ when they are considered as polynomials of degree $p$ and $q$ (their degree in $x$ may be lower than their total degree):

\operatorname {Res} (P(x,y),Q(x,y))=\operatorname {res} _{p,q}(P(x,1),Q(x,1)).

(The capitalization of "Res" is used here for distinguishing the two resultants, although there is no standard rule for the capitalization of the abbreviation).

The homogeneous resultant has essentially the same properties as the usual resultant, with essentially two differences: instead of polynomial roots, one considers zeros in the projective line, and the degree of a polynomial may not change under a ring homomorphism. That is:

The resultant of two homogeneous polynomials over an integral domain is zero if and only if they have a non-zero common zero over an algebraically closed field containing the coefficients.
If $P$ and $Q$ are two bivariate homogeneous polynomials with coefficients in a commutative ring $R$ , and $\varphi \colon R\to S$ a ring homomorphism of $R$ into another commutative ring $S$ , then extending $\varphi$ to polynomials over $R$ , ones has

\operatorname {Res} (\varphi (P),\varphi (Q))=\varphi (\operatorname {Res} (P,Q)).

The property of an homogeneous resultant to be zero is invariant under any projective change of variables.

Any property of the usual resultant may similarly extended to the homogeneous resultant, and the resulting property is either very similar or simpler than the corresponding property of the usual resultant.

Macaulay's resultant

Macaulay's resultant, named after Francis Sowerby Macaulay, also called the multivariate resultant, or the multipolynomial resultant,^[3] is a generalizationof the homogeneous resultant to $n$ homogeneous polynomials in $n$ indeterminates. Macaulay's resultant is a polynomial in the coefficients of these $n$ homogeneous polynomials that vanishes if and only if the polynomials have a common non-zero solution in an algebraically closed field containing containing the coefficients, or, equivalently, if the $n$ hyper surfaces defined by the polynomials have a common zero in the $n -1$ dimensional projective space. The multivariate resultant is, with Gröbner bases, one of the main tools of effective elimination theory (elimination theory on computers).

Like the homogeneous resultant, Macaulay's may be defined with determinants, and thus behaves well under ring homomorphisms. However, it cannot be defined by a single determinant. It follows that it is easier to define it first on generic polynomials.

Resultant of generic homogeneous polynomials

A homogeneous polynomial of degree $d$ in $n$ variables may have up to

{\binom {n+d-1}{n-1}}={\frac {(n+d-1)!}{(n-1)!\,d!}}

coefficients; it is said to be generic, if these coefficients are distinct indeterminates.

Let $P_{1},\ldots ,P_{n}$ be $n$ generic homogeneous polynomials in $n$ indeterminates, of respective degrees $d_{1},\dots ,d_{n}.$ Together, they involve

\sum _{i=1}^{n}{\binom {n+d_{i}-1}{n-1}}

indeterminate coefficients. Let $C$ be the polynomial ring over the integers, in all these indeterminate coefficients. The polynomials $P_{1},\ldots ,P_{n}$ belong thus to $C[x_{1},\ldots ,x_{n}],$ and their resultant (still to be defined) belongs to $C$ .

The Macaulay degree is the integer $D=d_{1}+\cdots +d_{n}-n+1,$ which is fundamental in Macaulay's theory. For defining the resultant, one considers the Macaulay matrix, which is the matrix over the monomial basis of the $C$ -linear map

(Q_{1},\ldots ,Q_{n})\mapsto Q_{1}P_{1}+\cdots +Q_{n}P_{n},

in which each $Q_{i}$ runs over the homogeneous polynomials of degree $D-d_{i},$ and the codomain is the $C$ -module of the homogeneous polynomials of degree $D$ .

If $n = 2$ , the Macaulay matrix is the Sylvester matrix, and is a square matrix, but this is no longer true for $n > 2$ . Thus, instead of considering the determinant, one considers all the maximal minors, that is the determinants of the square submatrices that have as many rows as the Macaulay matrix. Macaulay proved that the $C$ -ideal generated by these principal minors is a principal ideal, which is generated by the greatest common divisor of these minors. As one is working with polynomials with integer coefficients, this greatest common divisor is defined up its sign. The generic Macaulay resultant is the greatest common divisor which becomes $1$ , when, for each $i$ , zero is substituted for all coefficients of $P_{i},$ except the coefficient of $x_{i}^{d_{i}},$ for which one is substituted.

Properties of the generic Macaulay resultant

The generic Macaulay resultant is an irreducible polynomial.
It is homogeneous of degree $B/d_{i}$ in the coefficients of $P_{i},$ where $B=d_{1}\cdots d_{n}$ is the Bézout bound.
The product with the resultant of every monomial of degree $D$ in $x_1, \cdots, x_n$ belongs to the ideal of $C[x_{1},\cdots ,x_{n}]$ generated by $P_{1},\ldots ,P_{n}.$

Resultant of polynomials over a field

From now on, we consider that the homogeneous polynomials $P_{1},\ldots ,P_{n},$ of degrees $d_{1},\ldots ,d_{n}$ have their coefficients in a field $k$ , that is that they belong to $k[x_{1},\cdots ,x_{n}].$ Their resultant is defined as the element of $k$ obtained by replacing in the generic resultant the indeterminate coefficients by the actual coefficients of the $P_{i}.$

The main property of the resultant is that it is zero if only if $P_{1},\ldots ,P_{n},$ have a nonzero common zero in an algebraically closed extension of $k$ .

The "only if" part of this theorem results on the last property of the preceding paragraph, and is an effective version of Projective Nullstellensatz: If the resultant is nonzerozero, then

\langle x_{1},\ldots x_{n}\rangle ^{D}\subseteq \langle P_{1},\ldots ,P_{n}\rangle ,

where $D=d_{1}+\cdots +d_{n}-n+1$ is the Macaulay degree, and $\langle x_{1},\ldots x_{n}\rangle$ is the maximal homogeneous ideal. This implies that any common zero of $P_{1},\ldots ,P_{n}$ is a common zero of $x_{1},\ldots ,x_{n}.$

Computability

As the computation of a resultant may reduced to computing determinants and polynomial greatest common divisors, there are algorithms for computing resultants in a finite number of steps.

However, the generic resultant is a polynomial of very high degree (exponential in $n$ ) depending on a huge number of indeterminates. It follows that, except for very small $n$ and very small degrees of input polynomials, the generic resultant is, in practice, impossible to compute, even with modern computers. Moreover, the number of monomials of the generic resultant is so high, that, if it would be computable, the result could not be stored on available memory devices, even for rather small values of $n$ and of the degrees of the input polynomials.

Therefore, computing the resultant makes sense only for polynomials whose coefficients belong to a field or are polynomials in few indeterminates over a field.

In the case of input polynomials with coefficients in a field, the exact value of the resultant is rarely important, only its equality (or not) to zero matters. As the resultant is zero if and only if the rank of the Macaulay matrix is lower than its number of its rows, this equality to zero may by tested by applying Gaussian elimination to the Macaulay matrix. This provides a computational complexity $d^{O(n)},$ where $d$ is the maximum degree of input polynomials.

Another case where the computation of the resultant may provide useful information is when the coefficients of the input polynomials are polynomials in a small number of indeterminates, often called parameters. In this case, the resultant, if not zero, defines a hypersurface in the parameter space. A point belongs to this hyper surface, if and only if there are values of $x_{1},\ldots ,x_{n}$ which, together with the coordinates of the point are a zero of the input polynomials. In other words, the resultant is the result of the "elimination" of $x_{1},\ldots ,x_{n}$ from the input polynomials.

U-resultant

Macaulay's resultant provides a method, called "U-resultant" by Macaulay, for solving systems of polynomial equations.

Given $n - 1$ homogeneous polynomials $P_{1},\ldots ,P_{n-1},$ of degrees $d_{1},\ldots ,d_{n-1},$ in $n$ indeterminates $x_{1},\ldots ,x_{n},$ , over a field $k$ , their U-resultant is the resultant of the $n$ polynomials $P_{1},\ldots ,P_{n-1},P_{n},$ where

P_{n}=u_{1}x_{1}+\cdots +u_{n}x_{n}

is the generic linear form whose coefficients are new indeterminates $u_{1},\ldots ,u_{n}.$ Notation $u_{i}$ or $U_{i}$ for these generic coefficients is traditional, and is the origin of the term U-resultant.

The U-resultant is a homogeneous polynomial in $k[u_{1},\ldots ,u_{n}].$ It is zero if and only if the common zeros of $P_{1},\ldots ,P_{n-1}$ form an projective algebraic set of positive dimension (that is, there are infinitely many projective zeros over an algebraically closed extension of $k$ ). If the U-resultant is not zero, its degree is the Bézout bound $d_{1}\cdots d_{n-1}.$ The U-resultant factorizes over an algebraically closed extension of $k$ into a product of linear forms. If $\alpha _{1}u_{1}+\ldots +\alpha _{n}u_{n}$ is such a linear factor, then $\alpha _{1},\ldots ,\alpha _{n}$ are the homogeneous coordinates of a common zero of $P_{1},\ldots ,P_{n-1}.$ Moreover, every common zero may be obtained from one of these linear factors, and the multiplicity as a factor is equal to the intersection multiplicity of the $P_{i}$ at this zero. In other words, the U-resultant provides a completely explicit version of Bézout's theorem.

Extension to more polynomials and computation

The U-resultant, as defined by Macaulay was defined only for a number of homogeneous polynomials, which is one less than the number of indeterminates. In 1981, Daniel Lazard provided the following generalization to any number of polynomials, which may be computed by a single Gaussian elimination.

Let $P_{1},\ldots ,P_{k}$ be homogeneous polynomials in $x_{1},\ldots ,x_{n},$ of degrees $d_{1},\ldots ,d_{k},$ over a field $k$ . Without generality, one may suppose that $d_{2}\geq d_{3}\geq \cdots \geq d_{k}\geq d_{1}.$ Setting $d_{i}=1$ for $i > k$ , the Macaulay bound is $D=d_{1}+\cdots ,d_{n}-n+1.$

Let $u_{1},\ldots ,u_{n}$ be new indeterninates, and $P_{k+1}=u_{1}x_{1}+\cdots +u_{n}x_{n}.$ In this case, the Macaulay matrix is the matrix, over the basis of the monomials in $x_{1},\ldots ,x_{n},$ of the linear map

(Q_{1},\ldots ,Q_{k+1})\mapsto P_{1}Q_{1}+\cdots +P_{k+1}Q_{k+1},

where, for each $i$ , $Q_{i}$ runs over the homogeneous polynomials of degree $D-d_{i}.$

Gaussian elimination over $k$ of the Macaulay matrix, provides, after removing zero rows and zero columns, either an empty matrix or a block diagonal square matrix. The result is the empty matrix if and only if $P_{1},\ldots ,P_{k}$ have infinitely many common projective zeros over an algebraically closed extension of $k$ . In the other case, the block diagonal matrix consists of two blocks. One is an identity matrix, and the other is a square matrix of linear forms in $u_{1},\ldots ,u_{n}.$ The U-resultant is the determinant of the latter matrix, which is equal to the determinant of the block diagonal matrix.

As for the original U-resultant, this U-resultant factorizes into linear factors over an algebraically closed extension of $k$ . The coefficients of these linear factors are the homogeneous coordinates of the common zeros of $P_{1},\ldots ,P_{k},$ and the multiplicity of a common zero equals the multiplicity of the corresponding linear factor.

The number of rows of the Macaulay matrix is less than $(ed)^{n},$ where $e ~ 2.732$ is the usual mathematical constant, and $d$ is the arithmetic mean of the degrees of the $P_{i}.$ It follows that, if $k$ is not too large (say, if $k \leq 2 n$ ), a system of polynomial equations, that has a finite number of projective zeros, may be completely solved in time $d^{O(n)}.$

Although large, this bound is almost optimal. In fact, if all input degrees are equal, this implies that the time complexity is polynomial in the expected number of solutions (Bézout's theorem).

Notes

↑ Salmon 1885, lesson VIII, p. 66.
↑ Macaulay 1902.
↑ Cox, David; Little, John; O'Shea, Donal (2005), Using Algebraic Geometry, Springer Science+Business Media, ISBN 978-0387207339 , Chapter 3. Resultants

References

Gelfand, I. M.; Kapranov, M.M.; Zelevinsky, A.V. (1994), Discriminants, resultants, and multidimensional determinants, Boston: Birkhäuser, ISBN 978-0-8176-3660-9
Macaulay, F. S. (1902), "Some Formulæ in Elimination", Proc. London Math. Soc., 35: 3–27, doi:10.1112/plms/s1-35.1.3
Macaulay, F. S. (1916), The Algebraic Theory of Modular Systems, The Cornell Library of Historical Mathematical Monographs, Cambridge University Press, ISBN 978-1275570412
Salmon, George (1885) [1859], Lessons introductory to the modern higher algebra (4th ed.), Dublin, Hodges, Figgis, and Co., ISBN 978-0-8284-0150-0

External links

Weisstein, Eric W. "Resultant". MathWorld.

Polynomials and polynomial functions

By degree	Zero polynomial (degree undefined or −1 or −∞) Constant function (0) Linear function (1) Quadratic function (2) Cubic function (3) Quartic function (4) Quintic function (5) Sextic equation (6) Septic equation (7) Octic equation (8)

By number of terms and other properties	Monomial Binomial Trinomial Homogeneous Quasi-homogeneous

Tools and algorithms	Factorization Greatest common divisor Division Horner's method of evaluation Resultant Discriminant Gröbner basis

This article is issued from Wikipedia - version of the 12/3/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.