Horner's method
In mathematics and computer science, Horner's method is an algorithm for polynomial evaluation. It is named after William George Horner, although it is much older, attributed by Horner to Joseph-Louis Lagrange, and was discovered hundreds of years earlier by Chinese and Persian mathematicians. After the introduction of computers, this algorithm became fundamental for computing efficiently with polynomials.
The algorithm is based on Horner's rule, in which a polynomial is written in nested form:
This allows the evaluation of a polynomial of degree with only multiplications and additions. This is optimal, since there are polynomials of degree that cannot be evaluated with fewer arithmetic operations.
Horner's method and also refers to a method for approximating the roots of polynomials, described by Horner in 1819. It is a variant of the Newton–Raphson method made more efficient for hand calculation by application of Horner's rule. It was widely used until computers came into general use around 1970.
Polynomial evaluation and long division
Given the polynomialwhere are constant coefficients, the problem is to evaluate the polynomial at a specific value ofFor this, a new sequence of constants is defined recursively as follows:
Then is the value of.
To see why this works, the polynomial can be written in the form
Thus, by iteratively substituting the into the expression,
Similarly, it can be shown that:
Suggesting a convenient procedure for determining the result of the polynomial division
with being the division's remainder. If is a root of, then and a factor of.
Examples
Evaluate for.We use synthetic division as follows:
x│ x x x x
3 │ 2 −6 2 −1
│ 6 0 6
└────────────────────────
2 0 2 5
The entries in the third row are the sum of those in the first two. Each entry in the second row is the product of the -value with the third-row entry immediately to the left. The entries in the first row are the coefficients of the polynomial to be evaluated. Then the remainder of on division by is.
But by the polynomial remainder theorem, we know that the remainder is. Thus,.
In this example, if we can see that, the entries in the third row. So, synthetic division is easier to use; it can be shown to be equivalent to Horner's method.
As a consequence of the polynomial remainder theorem, the entries in the third row are the coefficients of the second-degree polynomial, the quotient of on division by.
The remainder is. This makes Horner's method useful for polynomial long division.
Divide by :
2 │ 1 −6 11 −6
│ 2 −8 6
└────────────────────────
1 −4 3 0
The quotient is.
Let and. Divide by using Horner's method.
0.5 │ 4 −6 0 3 −5
│ 2 −2 −1 1
└───────────────────────
2 −2 −1 1 −4
The third row is the sum of the first two rows, divided by. Each entry in the second row is the product of with the third-row entry to the left. The answer is
Efficiency
Evaluation using the monomial form of a degree polynomial requires at most additions and multiplications, if powers are calculated by repeated multiplication and each monomial is evaluated individually. The cost can be reduced to additions and multiplications by evaluating the powers of by iteration.If numerical data are represented in terms of digits, then the naive algorithm also entails storing approximately times the number of bits of : the evaluated polynomial has approximate magnitude, and one must also store itself. By contrast, Horner's method requires only additions and multiplications, and its storage requirements are only times the number of bits of. Alternatively, Horner's method can be computed with fused multiply–adds. Horner's method can also be extended to evaluate the first derivatives of the polynomial with additions and multiplications.
Horner's method is optimal, in the sense that any algorithm to evaluate an arbitrary polynomial must use at least as many operations. Alexander Ostrowski proved in 1954 that the number of additions required is minimal. Victor Pan proved in 1966 that the number of multiplications is minimal. However, when is a matrix, Horner's method is not optimal.
This assumes that the polynomial is evaluated in monomial form and no preconditioning of the representation is allowed, which makes sense if the polynomial is evaluated only once. However, if preconditioning is allowed and the polynomial is to be evaluated many times, then faster algorithms are possible. They involve a transformation of the representation of the polynomial. In general, a degree- polynomial can be evaluated using only +2 multiplications and additions.
Parallel evaluation
A disadvantage of Horner's rule is that all of the operations are sequentially dependent, so it is not possible to take advantage of instruction level parallelism on modern computers. In most applications where the efficiency of polynomial evaluation matters, many low-order polynomials are evaluated simultaneously, so it is not necessary to find parallelism within a single polynomial evaluation.If, however, one is evaluating a single polynomial of very high order, it may be useful to break it up as follows:
More generally, the summation can be broken into k parts:
where the inner summations may be evaluated using separate parallel instances of Horner's method. This requires slightly more operations than the basic Horner's method, but allows k-way SIMD execution of most of them. Modern compilers generally evaluate polynomials this way when advantageous, although for floating-point calculations this requires enabling reassociative math. Another use of breaking a polynomial down this way is to calculate steps of the inner summations in an alternating fashion to take advantage of instruction-level parallelism.
Application to floating-point multiplication and division
Horner's method is a fast, code-efficient method for multiplication and division of binary numbers on a microcontroller with no hardware multiplier. One of the binary numbers to be multiplied is represented as a trivial polynomial, where, and. Then, x is repeatedly factored out. In this binary numeral system,, so powers of 2 are repeatedly factored out.Example
For example, to find the product of two numbers and m:Method
To find the product of two binary numbers d and m:- A register holding the intermediate result is initialized to d.
- Begin with the least significant non-zero bit in m.
- If all the non-zero bits were counted, then the intermediate result register now holds the final result. Otherwise, add d to the intermediate result, and continue in step 2 with the next most significant bit in m.
Derivation
In general, for a binary number with bit values the product isAt this stage in the algorithm, it is required that terms with zero-valued coefficients are dropped, so that only binary coefficients equal to one are counted, thus the problem of multiplication or division by zero is not an issue, despite this implication in the factored equation:
The denominators all equal one, so this reduces to
or equivalently
In binary math, multiplication by a power of 2 is merely a register shift operation. Thus, multiplying by 2 is calculated in base-2 by an arithmetic shift. The factor is a right arithmetic shift, a results in no operation, and a results in a left arithmetic shift.
The multiplication product can now be quickly calculated using only arithmetic shift operations, addition and subtraction.
The method is particularly fast on processors supporting a single-instruction shift-and-addition-accumulate. Compared to a C floating-point library, Horner's method sacrifices some accuracy, however it is nominally 13 times faster and uses only 20% of the code space.
Other applications
Horner's method can be used to convert between different positional numeral systems – in which case x is the base of the number system, and the ai coefficients are the digits of the base-x representation of a given number – and can also be used if x is a matrix, in which case the gain in computational efficiency is even greater. However, for such cases faster methods are known.Polynomial root finding
Using the long division algorithm in combination with Newton's method, it is possible to approximate the real roots of a polynomial. The algorithm works as follows. Given a polynomial of degree with zeros make some initial guess such that. Now iterate the following two steps:- Using Newton's method, find the largest zero of using the guess.
- Using Horner's method, divide out to obtain. Return to step 1 but use the polynomial and the initial guess.
Example
Consider the polynomialwhich can be expanded to
From the above we know that the largest root of this polynomial is 7 so we are able to make an initial guess of 8. Using Newton's method the first zero of 7 is found as shown in black in the figure to the right. Next is divided by to obtain
which is drawn in red in the figure to the right. Newton's method is used to find the largest zero of this polynomial with an initial guess of 7. The largest zero of this polynomial which corresponds to the second largest zero of the original polynomial is found at 3 and is circled in red. The degree 5 polynomial is now divided by to obtain
which is shown in yellow. The zero for this polynomial is found at 2 again using Newton's method and is circled in yellow. Horner's method is now used to obtain
which is shown in green and found to have a zero at −3. This polynomial is further reduced to
which is shown in blue and yields a zero of −5. The final root of the original polynomial may be found by either using the final zero as an initial guess for Newton's method, or by reducing and solving the linear equation. As can be seen, the expected roots of −8, −5, −3, 2, 3, and 7 were found.
Divided difference of a polynomial
Horner's method can be modified to compute the divided difference Given the polynomialproceed as follows
At completion, we have
This computation of the divided difference is subject to less round-off error than evaluating and separately, particularly when.
Derivative of a polynomial
Substituting in this method gives, the derivative of. Evaluating a polynomial and its derivative at a point is useful for root-finding via Newton's method.History
Horner's paper, titled "A new method of solving numerical equations of all orders, by continuous approximation", was before the Royal Society of London, at its meeting on July 1, 1819, with a sequel in 1823. Horner's paper in Part II of Philosophical Transactions of the Royal Society of London for 1819 was warmly and expansively welcomed by a in the issue of The Monthly Review: or, Literary Journal for April, 1820; in comparison, a technical paper by Charles Babbage is dismissed curtly in this review. The sequence of reviews in The Monthly Review for September, 1821, concludes that Holdred was the first person to discover a direct and general practical solution of numerical equations. Fuller showed that the method in Horner's 1819 paper differs from what afterwards became known as "Horner's method" and that in consequence the priority for this method should go to Holdred.Unlike his English contemporaries, Horner drew on the Continental literature, notably the work of Arbogast. Horner is also known to have made a close reading of John Bonneycastle's book on algebra, though he neglected the work of Paolo Ruffini.
Although Horner is credited with making the method accessible and practical, it was known long before Horner. In reverse chronological order, Horner's method was already known to:
- Paolo Ruffini in 1809
- Isaac Newton in 1669
- the Chinese mathematician Zhu Shijie in the 14th century
- the Chinese mathematician Qin Jiushao in his Mathematical Treatise in Nine Sections in the 13th century
- the Persian mathematician Sharaf al-Dīn al-Ṭūsī in the 12th century
- the Chinese mathematician Jia Xian in the 11th century The Nine Chapters on the Mathematical Art, a Chinese work of the Han dynasty edited by Liu Hui.
Ulrich Libbrecht concluded: It is obvious that this procedure is a Chinese invention ... the method was not known in India. He said, Fibonacci probably learned of it from Arabs, who perhaps borrowed from the Chinese. The extraction of square and cube roots along similar lines is already discussed by Liu Hui in connection with Problems IV.16 and 22 in Jiu Zhang Suan Shu, while Wang Xiaotong in the 7th century supposes his readers can solve cubics by an approximation method described in his book Jigu Suanjing.