Roots of Polynomials
The Factor Theorem
The factor theorem states that if a non-zero polynomial has a root at , i.e. , then we can write for some polynomial .
Let , with and let be such that .
Then there exists such that .
We first consider a special case, then use this to prove the general case.
Consider the two cases:
-
Case x = 0. First we show the theorem when , then the claim is that . To see this observe that if then , and since we have . Therefore is of the form: If we define , we see that: As desired.
-
Case x 0. Suppose . Define the polynomial and observe that . Therefore, by the “x = 0” case, we conclude that for some . Next write:
If we define , then we see that:
Actually, the factor theorem can be broadened a bit: no part of the proof requires to be a integral domain, at no point did we use the fact that implies or . Therefore, the factor theorem applies even to polynomials over any commutative ring, for instance, polynomials over .
The Fundamental Theorem
Let be an integral domain and let be a non-zero polynomial of degree , then has at most roots in .
We prove this using induction over the degree of .
-
Base d = 0. If , then for some non-zero . Clearly, has roots.
-
Step d > 0. Let be a polynomial of degree . If has roots in , then we are done. Otherwise, let be a root of . Then, using the factor theorem, we can write for some polynomial of degree . By the inductive hypothesis, has at most roots and has at most root. Finally has at most roots because is an integral domain: since if is such that , then either or , in other words, must be a root of either or of which there are at most .
The theorem also allows us to conclude that if two polynomials and of degree share more than evaluations in , then the polynomials must be equal.
Let and be polynomials of degree over . Let be distinct elements. Then:
Define . Note that is a polynomial in of degree at most and observe that for all : has at least roots in . Therefore, must be the zero polynomial, i.e. .
Schwartz-Zippel Lemma
We can turn the corollary above into a probabilistic check of equality between polynomials, this technique is called the Schwartz-Zippel lemma and is widely used; we will use it a lot.
Let be distinct polynomials of degree and let be an arbitrary subset of the integral domain , then:
Note that the statement is vacuous if . When then the statement implies that there must exist a subset with such that: In which case the previous corollary shows that .
Multivariate Polynomial Roots
An important way for us to view multivariate polynomials will be as univariate polynomials over polynomial rings. To this end, it is useful for us to verify that multivariate polynomials form integral domains allowing us to apply the fundamental theorem:
Let be an integral domain, then is an integral domain.
We can multiply and add polynomials, it is also clear that polynomial multiplication is commutative (since is), i.e. for and , we have: Finally, let us verify that there are no zero divisors in , i.e. show: To see this let and be the degrees of and respectively, denote by and the leading coefficients of and . Note that and otherwise or respectively. Then the leading coefficient of is which is non-zero since is an integral domain. Therefore, if and only if or .
By applying the theorem above times, we can conclude that is an integral domain: let and for . Observe that:
This "iterative" construction of allows us to view -variate polynomials over as univariate polynomials over :
With this interpretation is a polynomial over and therefore can take any value in (not just ), i.e. we can evaluate for every -variate polynomial , which includes the constant polynomials, i.e. .
If we apply the fundamental theorem to this particular setting we get:
Let with be a -variate polynomial. Let be the degree in the variable, then there exist at most distinct -variate polynomials such that . In particular, there exist at most field elements (constant polynomials) such that .
If we apply this observation recursively, we can conclude that for sufficiently large tensor products, a polynomial vanishes over the tensor product if and only if the polynomial is the zero polynomial:
Let be a non-zero -variate polynomial with degree in each variable . Let be the tensor product of where : Then:
We prove this by induction:
-
Base k = 1. When the “multivariate” polynomial is a univariate polynomial , by applying the fundamental theorem with , we observe that the number of roots of is at most , however since the polynomial cannot evaluate to zero on all of . So the claim holds.
-
Step k > 1. Define and now rewrite as a polynomial with coefficients in : Since is an integral domain, we can apply the fundamental theorem, this time to , rather than . We conclude that at most values satisfy: And, in particular, there exist at most elements (constant polynomials) satisfying this. On the other hand, since there must exist at least one which is not a root, in other words: We then apply the induction hypothesis on to conclude that it does not vanish over . In other words, we conclude that there is at least one such that , which also allows us to conclude: So also cannot vanish over and the claim holds for as well.
Setting and as the -dimensional hypercube, we conclude that a non-zero multilinear polynomial cannot vanish on the hypercube, i.e. if then there exists at least one such that .
An easy, but very important, observation is that two multilinear polynomials can agree on the hypercube if and only if they actually are equal as polynomials.
Let be two multilinear polynomials such that: Then .
We can form: By assumption , therefore we conclude that by the theorem. Hence .
Multivariate Schwartz-Zippel
We can extend the techniques above to reason about the probability that a multivariate polynomial vanishes at a random point .
Let and let be a multivariate polynomial of individual degrees in . Then the probability that the polynomial vanishes at uniformly random can be bounded as follows: where .
We show this by induction:
-
Base k = 1. When the “multivariate” polynomial is a univariate polynomial , by applying the fundamental theorem with we observe that the number of roots of is at most . Hence for uniform , the probability that , i.e. that is one of the at most roots, is at most .
-
Step k > 1. Basically, there are two ways that could be zero:
- When we partially evaluate we get the zero polynomial
- Or, , but .
Define and view as a polynomial in : Since is an integral domain, we conclude that at most values satisfy: And, in particular, there exist at most elements which make , hence the probability that makes is at most .
On the other hand, if is not a root: We can apply the induction hypothesis on to conclude that: By applying a union bound on both these events we conclude that: