Vincent's theorem


In mathematics, Vincent's theorem—named after —is a theorem that isolates the real roots of polynomials with rational coefficients.
Even though Vincent's theorem is the basis of the fastest method for the isolation of the real roots of polynomials, it was almost totally forgotten, having been overshadowed by Sturm's theorem; consequently, it does not appear in any of the classical books on the theory of equations, except for Uspensky's book. Two variants of this theorem are presented, along with several real root isolation methods derived from them.

Sign variation

  1. If r = l+1 the numbers cl and cr have opposite signs.
  2. If rl+2 the numbers cl+1,..., cr−1 are all zero and the numbers cl and cr have opposite signs.
Two versions of this theorem are presented: the continued fractions version due to Vincent, and the bisection version due to Alesina and Galuzzi.

Vincent's theorem: Continued fractions version (1834 and 1836)

If in a polynomial equation with rational coefficients and without multiple roots, one makes successive transformations of the form
where are any positive numbers greater than or equal to one, then after a number of such transformations, the resulting transformed equation either has zero sign variations or it has a single sign variation. In the first case there is no root, whereas in the second case there is a single positive real root. Furthermore, the corresponding root of the proposed equation is approximated by the finite continued fraction:
Moreover, if infinitely many numbers satisfying this property can be found, then the root is represented by the corresponding continued fraction.
The above statement is an exact translation of the theorem found in Vincent's original papers; however, the following remarks are needed for a clearer understanding:
  • If denotes the polynomial obtained after n substitutions, then there exists N such that for all either has no sign variation or it has one sign variation. In the latter case has a single positive real root for all.
  • The continued fraction represents a positive root of the original equation, and the original equation may have more than one positive root. Moreover, assuming, we can only obtain a root of the original equation that is > 1. To obtain an arbitrary positive root we need to assume that.
  • Negative roots are obtained by replacing x by −x, in which case the negative roots become positive.

Vincent's theorem: Bisection version (Alesina and Galuzzi 2000)

Let p be a real polynomial of degree deg that has only simple roots. It is possible to determine a positive quantity δ so that for every pair of positive real numbers a, b with, every transformed polynomial of the form
has exactly 0 or 1 sign variations. The second case is possible if and only if p has a single root within.

The Alesina–Galuzzi "a_b roots test"

From equation the following criterion is obtained for determining whether a polynomial has any roots in the interval :
Perform on p the substitution
and count the number of sign variations in the sequence of coefficients of the transformed polynomial; this number gives an upper bound on the number of real roots p has inside the open interval. More precisely, the number ρab of real roots in the open interval —multiplicities counted—of the polynomial p in R, of degree deg, is bounded above by the number of sign variations varab, where
As in the case of Descartes' rule of signs if varab = 0 it follows that ρab = 0 and if varab = 1 it follows that ρab = 1.
A special case of the Alesina–Galuzzi "a_b roots test" is Budan's "0_1 roots test".

Sketch of a proof

A detailed discussion of Vincent's theorem, its extension, the geometrical interpretation of the transformations involved and three different proofs can be found in the work by Alesina and Galuzzi. A fourth proof is due to Ostrowski who rediscovered a special case of a theorem stated by Obreschkoff, p. 81, in 1920–1923.
To prove Vincent's theorem Alesina and Galuzzi show that after a series of transformations mentioned in the theorem, a polynomial with one positive root eventually has one sign variation. To show this, they use the following corollary to the theorem by Obreschkoff of 1920–1923 mentioned earlier; that is, the following corollary gives the necessary conditions under which a polynomial with one positive root has exactly one sign variation in the sequence of its coefficients; see also the corresponding figure.
Consider now the Möbius transformation
and the three circles shown in the corresponding figure; assume that
  • The circle
  • The two circles with center
From the above it becomes obvious that if a polynomial has a single positive root inside the eight-shaped figure and all other roots are outside of it, it presents one sign variation in the sequence of its coefficients. This also guarantees the termination of the process.

Historical background

Early applications of Vincent's theorem

In his fundamental papers, Vincent presented examples that show precisely how to use his theorem to isolate real roots of polynomials with continued fractions. However the resulting method had exponential computing time, a fact that mathematicians must have realized then, as was realized by Uspensky p. 136, a century later.
The exponential nature of Vincent's algorithm is due to the way the partial quotients ai are computed. That is, to compute each partial quotient ai Vincent uses Budan's theorem as a "no roots test"; in other words, to find the integer part of a root Vincent performs successive substitutions of the form xx+1 and stops only when the polynomials p and p differ in the number of sign variations in the sequence of their coefficients.
See the corresponding diagram where the root lies in the interval. It can be easily inferred that, if the root is far away from the origin, it takes a lot of time to find its integer part this way, hence the exponential nature of Vincent's method. Below there is an explanation of how this drawback is overcome.

Disappearance of Vincent's theorem

Vincent was the last author in the 19th century to use his theorem for the isolation of the real roots of a polynomial.
The reason for that was the appearance of Sturm's theorem in 1827, which solved the real root isolation problem in polynomial time, by defining the precise number of real roots a polynomial has in a real open interval. The resulting method for computing the real roots of polynomials has been the only one widely known and used ever since—up to about 1980, when it was replaced by methods derived from Vincent's theorem, the fastest one being the Vincent–Akritas–Strzeboński method.
Serret included in his Algebra, pp 363–368, Vincent's theorem along with its proof and directed all interested readers to Vincent's papers for examples on how it is used. Serret was the last author to mention Vincent's theorem in the 19th century.

Comeback of Vincent's theorem

In the 20th century Vincent's theorem cannot be found in any of the theory of equations books; the only exceptions are the books by Uspensky and Obreschkoff, where in the second there is just the statement of the theorem.
It was in Uspensky's book that Akritas found Vincent's theorem and made it the topic of his Ph.D. Thesis "Vincent's Theorem in Algebraic Manipulation", North Carolina State University, USA, 1978. A major achievement at the time was getting hold of Vincent's original paper of 1836, something that had eluded Uspensky—resulting thus in a great misunderstanding. Vincent's original paper of 1836 was made available to Akritas through the commendable efforts of a librarian in the Library of the University of Wisconsin–Madison, USA.

Real root isolation methods derived from Vincent's theorem

Isolation of the real roots of a polynomial is the process of finding open disjoint intervals such that each contains exactly one real root and every real root is contained in some interval. According to the French school of mathematics of the 19th century, this is the first step in computing the real roots, the second being their approximation to any degree of accuracy; moreover, the focus is on the positive roots, because to isolate the negative roots of the polynomial p replace x by −x and repeat the process.
The continued fractions version of Vincent's theorem can be used to isolate the positive roots of a given polynomial p of degree deg. To see this, represent by the Möbius transformation
the continued fraction that leads to a transformed polynomial with one sign variation in the sequence of its coefficients. Then, the single positive root of f corresponds to that positive root of p that is in the open interval with endpoints and. These endpoints are not ordered and correspond to M and M respectively.
Therefore, to isolate the positive roots of a polynomial, all that must be done is to compute—for each root—the variables a, b, c, d of the corresponding Möbius transformation
that leads to a transformed polynomial as in equation, with one sign variation in the sequence of its coefficients.
Crucial Observation: The variables a, b, c, d of a Möbius transformation
leading to a transformed polynomial—as in equation —with one sign variation in the sequence of its coefficients can be computed:
The "bisection part" of this all important observation appeared as a special theorem in the papers by Alesina and Galuzzi.
All methods described below need to compute an upper bound, ub, on the values of the positive roots of the polynomial under consideration. Exception is the VAS method where additionally lower bounds, lb, must be computed at almost every cycle of the main loop. To compute the lower bound lb of the polynomial p compute the upper bound ub of the polynomial and set.
Excellent bounds on the values of just the positive roots of polynomials have been developed by Akritas, Strzeboński and Vigklas based on previous work by Doru Stefanescu. They are described in P. S. Vigklas' Ph.D. Thesis and elsewhere. These bounds have already been implemented in the computer algebra systems Mathematica, SageMath, SymPy, Xcas etc.
All three methods described below follow the excellent presentation of François Boulier, p. 24.

Continued fractions method

Only one continued fractions method derives from Vincent's theorem. As stated above, it started in the 1830s when Vincent presented, in the papers several examples that show how to use his theorem to isolate the real roots of polynomials with continued fractions. However the resulting method had exponential computing time. Below is an explanation of how this method evolved.

Vincent–Akritas–Strzeboński (VAS, 2005)

This is the second method developed to handle the exponential behavior of Vincent's method.
The VAS continued fractions method is a direct implementation of Vincent's theorem. It was originally presented by Vincent from 1834 to 1938 in the papers in an exponential form; namely, Vincent computed each partial quotient ai by a series of unit increments aiai + 1, which are equivalent to substitutions of the form xx + 1.
Vincent's method was converted into its polynomial complexity form by Akritas, who in his 1978 Ph.D. Thesis computed each partial quotient ai as the lower bound, lb, on the values of the positive roots of a polynomial. This is called the ideal positive lower root bound that computes the integer part of the smallest positive root. To wit, now set ailb or, equivalently, perform the substitution xx + lb, which takes about the same time as the substitution xx + 1.
Finally, since the ideal positive lower root bound does not exist, Strzeboński introduced in 2005 the substitution, whenever ; in general and the value 16 was determined experimentally. Moreover, it has been shown that the VAS method is faster than the fastest implementation of the VCA method, a fact that was confirmed independently; more precisely, for the Mignotte polynomials of high degree VAS is about 50,000 times faster than the fastest implementation of VCA.
In 2007, Sharma removed the hypothesis of the ideal positive lower bound and proved that VAS is still polynomial in time.
VAS is the default algorithm for root isolation in Mathematica, SageMath, SymPy, Xcas.
For a comparison between Sturm's method and VAS use the functions realroot and time of Xcas. By default, to isolate the real roots of poly realroot uses the VAS method; to use Sturm's method write realroot. See also the External links for an application by A. Berkakis for Android devices that does the same thing.
Here is how VAS works, where for simplicity Strzeboński's contribution is not included:
  • Let p be a polynomial of degree deg such that p ≠ 0. To isolate its positive roots, associate with p the Möbius transformation M = x and repeat the following steps while there are pairs to be processed.
  • Use Descartes' rule of signs on p to compute, if possible, the number of its roots inside the interval. If there are no roots return the empty set, ∅ whereas if there is one root return the interval, where a = min, M), and b = max, M); if b = ∞ set b = ub, where ub is an upper bound on the values of the positive roots of p.
  • If there are two or more sign variations Descartes' rule of signs implies that there may be zero, one or more real roots inside the interval ; in this case consider separately the roots of p that lie inside the interval from those inside the interval. A special test must be made for 1.
  • *To guarantee that there are roots inside the interval the ideal lower bound, lb is used; that is the integer part of the smallest positive root is computed with the help of the lower bound,, on the values of the positive roots of p. If, the substitution is performed to p and M, whereas if use substitution xx+1 to find the integer part of the root.
  • *To compute the roots inside the interval perform the substitution to p and M and process the pair
Below is a recursive presentation of VAS.
VAS:
Input: A univariate, square-free polynomial, of degree deg, and the Möbius transformation
Output: A list of isolating intervals of the positive roots of p.
1 var ← the number of sign variations of p // Descartes' rule of signs;
2 if var = 0 then RETURN ∅;
3 if var = 1 then RETURN // a = min, M), b = max, M), but if b = ∞ set b = ub, where ub is an upper bound on the values of the positive roots of p;
4 lb ← the ideal lower bound on the positive roots of p;
5 if lb ≥ 1 then pp, MM;
6 p01deg p, M01M // Look for real roots in ;
7 mM // Is 1 a root?
8 p1∞p, M1∞M // Look for real roots in ;
9 if p ≠ 0 then
10 RETURN VAS ∪ VAS
11 else
12 RETURN VAS ∪ ∪ VAS
13 end
Remarks
  • For simplicity Strzeboński's contribution is not included.
  • In the above algorithm with each polynomial there is associated a Möbius transformation M.
  • In line 1 Descartes' rule of signs is applied.
  • If lines 4 and 5 are removed from VAS the resulting algorithm is Vincent's exponential one.
  • Any substitution performed on the polynomial p is also performed on the associated Möbius transformation M.
  • The isolating intervals are computed from the Möbius transformation in line 3, except for integer roots computed in line 7.

Example of VAS(''p'', ''M'')

We apply the VAS method to .

Iteration 1

VAS
1 var ← 2 // the number of sign variations in the sequence of coefficients of p = x3 − 7x + 7
4 lb ← 1 // the ideal lower bound—found using lbcomputed and substitution xx + 1
5 px3 + 3x2 − 4x + 1, Mx + 1
6 p01x3x2 − 2x + 1, M01
7 m ← 1
8 p1∞x3 + 6x2 + 5x + 1, M1∞x + 2
10 RETURN VAS ∪ VAS
List of isolation intervals:
List of pairs to be processed:
Remove the first and process it.

Iteration 2

VAS
1 var ← 2 // the number of sign variations in the sequence of coefficients of p = x3x2 − 2x + 1
4 lb ← 0 // the ideal lower bound—found using lbcomputed and substitution xx + 1
6 p01x3 + x2 − 2x − 1, M01
7 m
8 p1∞x3 + 2x2x − 1, M1∞
10 RETURN VAS ∪ VAS
List of isolation intervals:
List of pairs to be processed:
Remove the first and process it.

Iteration 3

VAS
1 var ← 1 // the number of sign variations in the sequence of coefficients of p = x3 + x2 − 2x − 1
3 RETURN
List of isolation intervals:
List of pairs to be processed:
Remove the first and process it.

Iteration 4

VAS
1 var ← 1 // the number of sign variations in the sequence of coefficients of p = x3 + 2x2x − 1
3 RETURN
List of isolation intervals:
List of pairs to be processed:
Remove the first and process it.

Iteration 5

VAS
1 var ← 0 // the number of sign variations in the sequence of coefficients of p = x3 + 6x2 + 5x + 1
2 RETURN
List of isolation intervals:
List of pairs to be processed:.
Finished.

Conclusion

Therefore, the two positive roots of the polynomial lie inside the isolation intervals and

Bisection methods

There are various bisection methods derived from Vincent's theorem; they are all presented and compared elsewhere. Here the two most important of them are described, namely, the Vincent–Collins–Akritas (VCA) method and the Vincent–Alesina–Galuzzi (VAG) method.
The Vincent–Alesina–Galuzzi (VAG) method is the simplest of all methods derived from Vincent's theorem but has the most time consuming test to determine if a polynomial has roots in the interval of interest; this makes it the slowest of the methods presented in this article.
By contrast, the Vincent–Collins–Akritas (VCA) method is more complex but uses a simpler test than VAG. This along with certain improvements have made VCA the fastest bisection method.

Vincent–Collins–Akritas (VCA, 1976)

This was the first method developed to overcome the exponential nature of Vincent's original approach, and has had quite an interesting history as far as its name is concerned. This method, which isolates the real roots, using Descartes' rule of signs and Vincent's theorem, had been originally called modified Uspensky's algorithm by its inventors Collins and Akritas. After going through names like "Collins–Akritas method" and "Descartes' method", it was finally François Boulier, of Lille University, who gave it the name Vincent–Collins–Akritas method, p. 24, based on the fact that "Uspensky's method" does not exist and neither does "Descartes' method". The best implementation of this method is due to Rouillier and Zimmerman, and to this date, it is the fastest bisection method. It has the same worst case complexity as Sturm's algorithm, but is almost always much faster. It has been implemented in Maple's RootFinding package.
Here is how VCA works:
  • Given a polynomial porig of degree deg, such that porig ≠ 0, whose positive roots must be isolated, first compute an upper bound, ub on the values of these positive roots and set p = porig and =. The positive roots of p all lie in the interval and there is a bijection between them and the roots of porig, which all lie in the interval = ; this bijection is expressed by α = a +α. Likewise, there is a bijection between the intervals and.
  • Repeat the following steps while there are pairs to be processed.
  • Use Budan's "0_1 roots test" on p to compute the number of its roots inside the interval. If there are no roots return the empty set, ∅ and if there is one root return the interval.
  • If there are two or more sign variations Budan's "0_1 roots test" implies that there may be zero, one, two or more real roots inside the interval. In this case cut it in half and consider separately the roots of p inside the interval —and that correspond to the roots of porig inside the interval from those inside the interval and correspond to the roots of porig inside the interval ; that is, process, respectively, the pairs
Below is a recursive presentation of the original algorithm VCA.
VCA
Input: A univariate, square-free polynomial pZ, p ≠ 0 of degree deg, and the open interval =, where ub is an upper bound on the values of the positive roots of p. are all in the open interval ).

Output: A list of isolating intervals of the positive roots of p
1 var ← the number of sign variations of degp // Budan's "0_1 roots test";
2 if var = 0 then RETURN ∅;
3 if var = 1 then RETURN ;
4 p0 ← 2degp // Look for real roots in ;
5 m ← // Is a root?
6 p1 ← 2degp // Look for real roots in ;
7 if p ≠ 0 then
8 RETURN VCA ∪ VCA
9 else
10 RETURN VCA ∪ ∪ VCA
11 end
Remark
  • In the above algorithm with each polynomial there is associated an interval. As shown elsewhere, p. 11, a Möbius transformation can also be associated with each polynomial in which case VCA looks more like VAS.
  • In line 1 Budan's "0_1 roots test" is applied.

Example of VCA(''p'', (''a'',''b''))

Given the polynomial and considering as an upper bound on the values of the positive roots the arguments of the VCA method are: and.

Iteration 1

1 var ← 2 // the number of sign variations in the sequence of coefficients of 3p = 7x3 − 7x2 − 35x + 43
4 p0 ← 64x3 − 112x + 56
5 m ← 2
6 p1 ← 64x3 + 192x2 + 80x + 8
7 p = 1
8 RETURN VCA ∪ VCA
List of isolation intervals:
List of pairs to be processed:
Remove the first and process it.

Iteration 2

VCA
1 var ← 2 // the number of sign variations in the sequence of coefficients of 3p = 56x3 + 56x2 − 56x + 8
4 p0 ← 64x3 − 448x + 448
5 m ← 1
6 p1 ← 64x3 + 192x2 − 256x + 64
7 p = 8
8 RETURN VCA ∪ VCA
List of isolation intervals:
List of pairs to be processed:
Remove the first and process it.

Iteration 3

VCA
1 var ← 0 // the number of sign variations in the sequence of coefficients of 3p = 448x3 + 896x2 + 448x + 64
2 RETURN
List of isolation intervals:
List of pairs to be processed:
Remove the first and process it.

Iteration 4

VCA
1 var ← 2 // the number of sign variations in the sequence of coefficients of 3p = 64x3 − 64x2 − 128x + 64
4 p0 ← 64x3 + 384x2 − 1024x + 512
5 m
6 p1 ← 64x3 + 576x2 − 64x + 64
7 p = −8
8 RETURN VCA ∪ VCA
List of isolation intervals:
List of pairs to be processed:
Remove the first and process it.

Iteration 5

VCA
1 var ← 1 // the number of sign variations in the sequence of coefficients of 3p = 512x3 + 512x2 − 128x − 64
3 RETURN
List of isolation intervals:
List of pairs to be processed:
Remove the first and process it.

Iteration 6

VCA
1 var ← 1 // the number of sign variations in the sequence of coefficients of 3p = −64x3 − 256x2 + 256x + 512
3 RETURN
List of isolation intervals:
List of pairs to be processed:
Remove the first and process it.

Iteration 7

VCA
1 var ← 0 // the number of sign variations in the sequence of coefficients of 3p = 8x3 + 104x2 + 376x + 344
2 RETURN
List of isolation intervals:
List of pairs to be processed:.
Finished.

Conclusion

Therefore, the two positive roots of the polynomial lie inside the isolation intervals and

Vincent–Alesina–Galuzzi (VAG, 2000)

This was developed last and is the simplest real root isolation method derived from Vincent's theorem.
Here is how VAG works:
  • Given a polynomial p of degree deg, such that p ≠ 0, whose positive roots must be isolated, first compute an upper bound, ub on the values of these positive roots and set =. The positive roots of p all lie in the interval.
  • Repeat the following steps while there are intervals to be processed; in this case the polynomial p stays the same.
  • Use the Alesina–Galuzzi "a_b roots test" on p to compute the number of its roots inside the interval. If there are no roots return the empty set, ∅ and if there is one root return the interval.
  • If there are two or more sign variations the Alesina–Galuzzi "a_b roots test" implies that there may be zero, one, two or more real roots inside the interval. In this case cut it in half and consider separately the roots of p inside the interval from those inside the interval ; that is, process, respectively, the intervals and. It may well turn out that is a root of p, in which case the isolation interval reduces to a point.
Below is a recursive presentation of VAG.
VAG

Input: A univariate, square-free polynomial pZ, p ≠ 0 of degree deg and the open interval =, where ub is an upper bound on the values of the positive roots of p.

Output: A list of isolating intervals of the positive roots of p.
1 var ← the number of sign variations of deg p // The Alesina–Galuzzi "a_b roots test";
2 if var = 0 then RETURN ∅;
3 if var = 1 then RETURN ;
4 m ← // Subdivide the interval in two equal parts;
5 if p ≠ 0 then
6 RETURN VAG ∪ VAG
7 else
8 RETURN VAG ∪ ∪ VAG
9 end
Remarks
  • Compared to VCA the above algorithm is extremely simple; by contrast, VAG uses the time consuming "a_b roots test" and that makes it much slower than VCA.
  • As Alesina and Galuzzi point out, p. 189, there is a variant of this algorithm due to Donato Saeli. Saeli suggested that the mediant of the endpoints be used instead of their midpoint. However, it has been shown that using the mediant of the endpoints is in general much slower than the "mid-point" version.

Example of VAG(''p'', (''a'',''b''))

Given the polynomial p = x3 − 7x + 7 and considering as an upper bound on the values of the positive roots ub = 4 the arguments of VAG are: p = x3 − 7x + 7 and =.

Iteration 1

1 var ← 2 // the number of sign variations in the sequence of coefficients of 3p = 43x3 − 35x2 − 7x + 7
4 m ← = 2
5 p = 1
8 RETURN VAG ∪ VAG
List of isolation intervals:.
List of intervals to be processed:.
Remove the first and process it.

Iteration 2

VAG
1 var ← 2 // the number of sign variations in the sequence of coefficients of 3p = x3 − 7x2 + 7x + 7
4 m ← = 1
5 p = 1
8 RETURN VAG ∪ VAG
List of isolation intervals:.
List of intervals to be processed:.
Remove the first and process it.

Iteration 3

VAG
1 var ← 0 // the number of sign variations in the sequence of coefficients of 3p = x3 + 7x2 + 14x + 7
2 RETURN
List of isolation intervals:.
List of intervals to be processed:.
Remove the first and process it.

Iteration 4

VAG
1 var ← 2 // the number of sign variations in the sequence of coefficients of 3p = x3 − 2x2x + 1
4 m ← =
5 p = −
8 RETURN VAG ∪ VAG
List of isolation intervals:.
List of intervals to be processed:.
Remove the first and process it.

Iteration 5

VAG
1 var ← 1 // the number of sign variations in the sequence of coefficients of 233p = x3 + 2x2 − 8x − 8
3 RETURN
List of isolation intervals:.
List of intervals to be processed:.
Remove the first and process it.

Iteration 6

VAG
1 var ← 1 // the number of sign variations in the sequence of coefficients of 233p = 8x3 + 4x2 − 4x − 1
3 RETURN
List of isolation intervals:.
List of intervals to be processed:.
Remove the first and process it.

Iteration 7

VAG
1 var ← 0 // the number of sign variations in the sequence of coefficients of 3p = 344x3 + 376x2 + 104x + 8
2 RETURN
List of isolation intervals:.
List of intervals to be processed: ∅.
Finished.

Conclusion

Therefore, the two positive roots of the polynomial lie inside the isolation intervals and