proof of arithmetic-geometric means inequality using Lagrange multipliers
As an interesting example of the Lagrange multiplier method,we employ it to prove the arithmetic-geometric means inequality:
with equality if and only if all the are equal.
To begin with, define by .
Fix (the arithmetic mean),and consider the set .This is clearly a closed and bounded set in , and hence is compact
.Therefore on has both a maximum and minimum.
is almost a manifold (a differentiable surface), except that it has a boundary consistingof points with for some .Clearly attains the minimum zero on this boundary,and on the “interior” of ,that is , is positive
. Now is a manifold, and the maximum of on is evidently a local maximum
on . Hencethe Lagrange multiplier method may be applied.
We have the constraint11Although does not constrain that ,this does not really matter — for those who are worried about suchthings, a rigorous way to escape this annoyance is to simplydefine to be zero whenever for some .Clearly the new cannot have a maximum at such points,so we may simply ignore the points with when solving the system.And for the same reason, the fact that fails to be differentiable at the boundarypoints is immaterial. under which we are to maximize .
We compute:
If we take the product, , of the extreme left- and right- hand sides of this equation,we obtain
from which it follows that ( is obviously inadmissible).Then substituting back, we get , which implies that the are all equal to ,The value of at this point is .
Since we have obtained a unique solution to the Lagrange multiplier system,this unique solution must be the unique maximum point of on .So amongst all numbers whose arithmetic mean is ,with equality occurring if and only if all the are equal to .
The case , which was not included in the above analysis, is trivial.
Proof of concavity
The question of whether the point obtained from solving Lagrange systemis actually a maximum for was taken care of by our preliminary compactness argument. Another popular way to determine whether a given stationary pointis a local maximum or minimum is by studying the Hessian
of the functionat that point.For the sake of illustrating the relevant techniques,we will prove again that is a local maximum for ,this time by showing that the Hessian is negative definite
.
Actually, it turns out to be not muchharder to show that is weakly concave everywhereon , not just on .A plausibility argument for this factis that the graph of is concave, and since also involves an th root,it should be concave too. We will prove concavity of byshowing that the Hessian of is negative semi-definite.
Since is a convex22If is not convex, then the arguments that follow will not work.In general, a prescription to determine whether a critical pointis a local minimum or maximum can be foundin tests for local extrema in Lagrange multiplier method. set, the restrictionof to will also be weakly concave;then we can conclude thatthe critical point on must be a global minimum on .
We begin by calculating second-order derivatives:
(The symbol is the Kronecker delta.)Thus the Hessian matrix is:
Now if we had actual numbers to substitute for the ,then we can go to a computer and ask if the matrix is negative definite or not.But we want to prove global concavity, so we have to employ some cleveralgebra, and work directly from the definition.
Let be a test vector.Negative semi-definiteness means for all .So let us write out:
We would be done if we can prove
(1) |
But look, this is just the Cauchy-Schwarz inequality,concerning the dot product of the vectors and !So is negative semi-definite everywhere,i.e. is weakly concave.
Moreover, the case of equality in the Cauchy-Schwarz inequality (1)only occurs when is parallel to ,i.e. for some scalar .Notice that is the normal vector tothe tangent space
of ,so unless .This means, equality never holds in (1)if is restricted to the tangent space of .In turn, this means that is strictly concaveon the convex set , andso is a strict global minimum of on .
Proof of inequality by symmetry argument
Observe that the maximum point is pleasantly symmetric.We can show that the solution has to be symmetric,by exploiting the symmetry
of the function and the set .
We have already calculated that is strictly concave on ;this was independent of the Lagrange multiplier method.Since is strictly concave on a closed convex set,it has a unique maximum there; call it .
Pick any indices and , ranging from to , and formthe linear transformation , which switches the th and th coordinatesof its argument.
By definition, for all .But , so this means for all .But is mapped to itself by the transformation ,so this is the same as saying that for all .But the global maximum is unique, so ,that is, .
Note that the argument in this last proofis extremely general — it takes the form:if is the unique maximum of ,and is any symmetry of such that ,then . This kind of symmetry argument was used to great effectby Jakob Steiner in his famous attempted proof of the isoperimetric inequality:among all closed curves of a given arc length
, the circle maximizesthe area enclosed. However, Steiner did not realize thatthe assumption
that there is a unique maximizing curvehas to be proved. Actually, that is the hardest part of the proof —if the assumption is known, then one may easily calculate using the Lagrange multiplier methodthat the maximizing curve must be a circle!