Kantorovitch’s theorem
Let be a point in an open neighborhood of in and adifferentiable mapping, with its derivative invertible. Define
If and the derivative satisfies thehttp://planetmath.org/node/765Lipschitz condition
for all points , and if the inequality
is satisfied, the equation has a uniquesolution in , and Newton’s method with initial guess converges to it. If we replace with , then it can be shownthat Newton’s method http://planetmath.org/node/793superconverges! If you want aneven stronger version, one can replace with the norm.
Logic behind the theorem:
Let’s look at the useful part of the theorem:
It is a product of three distinct properties of your function suchthat the product is less than or equal to a certain number, orbound. If we call the product , then it says that mustbe within a ball of radius . It also says that the solution is within this same ball. How was this ball defined?
The first term, , is a measure of how far thefunction is from the domain; in the Cartesian plane, it would be howfar the function is from the x-axis. Of course, if we’re solving for, we want this value to be small, because itmeans we’re closer to the axis. However a function can be annoyinglyclose to the axis, and yet just happily curve away from the axis. Thuswe need more.
The second term, is a little moredifficult. This is obviously a measure of how fast the function ischanging with respect to the domain (x-axis in the plane). The largerthe derivative, the faster it’s approaching wherever it’s going(hopefully the axis). Thus, we take the inverse of it, since we wantthis product to be less than a number. Why it’s squared though,is because it is the denominator where a product of two terms of likeunits is the numerator. Thus to conserve units with the numerator, itis multiplied by itself. Combined with the first term, this also seemsto be enough, but what if the derivative changes sharply, but itchanges the wrong way?
The third term is the Lipschitz ratio . This measures sharp changesin the first derivative, so we can be sure that if this is small, thatthe function won’t try to curve away from our goal on us too sharply.
By the way, the number is unitless, so all the units onthe left side cancel. Checking units is essential in applications,such as physics and engineering, where Newton’s method is used.