请输入您要查询的字词:

 

单词 LagrangeMultipliersOnManifolds
释义

Lagrange multipliers on manifolds


We discuss in this article the theoretical aspects ofthe Lagrange multiplier method.

To enhance understanding,proofs and intuitive explanations of the Lagrange multipler methodwill be given from several different viewpoints,both elementary and advanced.

Contents:
  • 1 Statements of theorem
    • 1.1 Formulation with differential forms
    • 1.2 Formulation with gradients
    • 1.3 Formulation with tangent maps
  • 2 Proofs
    • 2.1 Beautiful abstract proof
    • 2.2 Clumsy, but down-to-earth proof
  • 3 Intuitive interpretations
    • 3.1 Normals to tangent hyperplanes
    • 3.2 With infinitesimals
    • 3.3 As rates of substitution
  • 4 Stationary points

1 Statements of theorem

Let N be a n-dimensional differentiable manifold (without boundary), and f:N, andgi:N, for i=1,,k, be continuously differentiable.Set M=i=1kgi-1({0}).

1.1 Formulation with differential forms

Theorem 1.

Suppose dgi are linearly independentMathworldPlanetmathat each point of M.If pM is a local minimumMathworldPlanetmath or maximum point of f restricted to M,then there exist Lagrange multipliersλ1,,λkR, depending on p,such that

df(p)=λ1dg1(p)++λkdgk(p).

Here, d denotes the exterior derivativeMathworldPlanetmath.

Of course, as in one-dimensional calculus, the condition df(p)=iλidgi(p) by itself does not guarantee p is aminimum or maximum point, even locally.

1.2 Formulation with gradients

The version of Lagrange multipliers typicallyused in calculus is the special caseN=n in Theorem 1.In this case,the conclusionMathworldPlanetmath of thetheorem can also be writtenin terms of gradientsMathworldPlanetmath instead of differential forms:

Theorem 2.

Suppose gi are linearly independentat each point of M.If pM is a local minimum or maximum point of f restricted to M,then there exist Lagrange multipliersλ1,,λkR, depending on p,such that

f(p)=λ1g1(p)++λkgk(p).

This formulation and the first oneare equivalentMathworldPlanetmathPlanetmathPlanetmathPlanetmathPlanetmath sincethe 1-form df can be identified with the gradientf, via the formulaMathworldPlanetmathPlanetmath f(p)v=df(p;v)=dfp(v).

1.3 Formulation with tangent maps

The functions gi can also be coalesced into a vector-valued functionPlanetmathPlanetmathg:Nk. Then we have:

Theorem 3.

Let g=(g1,,gk):NRk.Suppose the tangent map Dg is surjectivePlanetmathPlanetmathat each point of M.If pM is a local minimum or maximum point of f restricted to M,thenthere exists a Lagrange multiplier vector λ(Rk)*,depending on p, such that

Df(p)=Dg(p)*λ.

Here, Dg(p):(Rk)*(TpN)*denotes the pullback of the lineartransformation (http://planetmath.org/DualHomomorphism) Dg(p):TpNRk.

If Dg is represented by its Jacobian matrix,then the condition that it be surjective is equivalent toits Jacobian matrix having full rank.

Note the deliberate use of the space (k)* instead of k— to which the former is isomorphic to —for the Lagrange multiplier vector. It turns out that theLagrange multiplier vector naturallylives in the dual spaceMathworldPlanetmathPlanetmathPlanetmath and not the original vector spaceMathworldPlanetmath k.This distinction is particularly important in the infinite-dimensionalgeneralizationsPlanetmathPlanetmath of Lagrange multipliers.But even in the finite-dimensional setting,we do see hints that the dual spacehas to be involved, because a transposeMathworldPlanetmath is involvedin the matrix expression for Lagrange multipliers.

If the expression Dg(p)*λ is writtenout in coordinatesMathworldPlanetmathPlanetmath, then it is apparent that the components λiof the vector λ are exactly thoseLagrange multipliers from Theorems 1 and 2.

2 Proofs

The proof of the Lagrange multiplier theorem is surprisingly short and elegant,when properly phrased in the languagePlanetmathPlanetmath of abstract manifolds anddifferential forms.

However, for the benefit of the readers not versed in these topics,we provide, in addition to the abstract proof, a concrete translationMathworldPlanetmathPlanetmath of the argumentsMathworldPlanetmathin the more familiar setting N=n.

2.1 Beautiful abstract proof

Proof.

Since dgi are linearly independent at each point of M=i=1kgi-1({0}),M is an embedded submanifold of N,of dimensionMathworldPlanetmathPlanetmath m=n-k. Let α:UM, with U open in m, bea coordinate chart for M such that α(0)=p.Then α*f has a local minimum or maximum at 0,and therefore 0=d(α*f)=α*df at 0.But α* at p is an isomorphismPlanetmathPlanetmathPlanetmath (TpM)*(T0m)*,so the preceding equation says that df vanishes on TpM.

Now, by the definition of gi, we have α*gi=0,so 0=d(α*gi)=α*dgi. So like df, dgi vanishes on TpM.

In other words, dgi(p) is in the annihilatorMathworldPlanetmathPlanetmathPlanetmath (http://planetmath.org/AnnihilatorOfVectorSubspace)(TpM)0 of the subspaceMathworldPlanetmathPlanetmathPlanetmath TpMTpN.Since TpM has dimension m=n-k, and TpN has dimension n,the annihilator (TpM)0 has dimension k.Now dgi(p)(TpM)0 are linearly independent,so they must in fact be a basis for (TpM)0.But we had argued that df(p)(TpM)0.Therefore df(p) may be written as a unique linear combinationMathworldPlanetmathof the dgi(p):

df(p)=λ1dg1(p)++λkdgk(p).

The last paragraph of the previous proof can also be rephrased,based on the same underlying ideas,to make evident the fact that the Lagrange multiplier vectorlives in the dual space (k)*.

Alternative argument..

A general theorem in linear algebra states that for any lineartransformation L,the image of the pullback L*is the annihilator of the kernel of L.Since kerDg(p)=TpMand df(p)(TpM)0,it immediately follows that λ(k)*exists such that df(p)=Dg(p)*λ.∎

Yet another proofcould be devised by observingthat the result is obvious if N=n and the constraint functionsare just coordinate projections on n:

gi(y1,,yn)=yi,i=1,,k.

We clearly must havef/yk+1==f/yn=0at a point p that minimizes f(y) over y1==yk=0.The general case can be deduced to this by a coordinate change:

Alternate argument..

Since dgi are linearly independent,we can find a coordinate chart for N about the point p,with coordinate functions y1,,yn:Nsuch that yi=gi for i=1,,k.Then

df=fy1dy1++fyndyn
=fg1dg1++fgkdgk+fyk+1dyk+1++fyndyn,

but f/yk+1==f/yn=0at the point p. Set λi=f/gi at p.∎

2.2 Clumsy, but down-to-earth proof

Proof.

We assume that N=n.Consider the list vector g=(g1,,gk) discussed earlier,and its Jacobian matrix Dg in EuclideanPlanetmathPlanetmath coordinates.The ith row of this matrixis

[gix1gixn]=(gi)T.

So the matrix Dg has full rank (i.e. rankDg=k) if and onlyif the k gradients gi are linearly independent.

Consider each solution qM of g(q)=0.Since Dg has full rank, we can apply the implicit function theorem,which states that there exist smooth solution parameterizationsα:UM around each point qM. (U is an open set in m, m=n-k.)These α are the coordinate charts which give to M=g-1({0}) a manifold structureMathworldPlanetmath.

We now consider specially the point q=p; without loss of generality, assume α(0)=p.Then fα is a function on Euclidean spaceMathworldPlanetmath having a local minimum or maximum at 0,so its derivativePlanetmathPlanetmath vanishes at 0.Calculating by the chain ruleMathworldPlanetmath, we have0=D(fα)(0)=Df(p)Dα(0).In other words, kerDf(p)range of Dα(0)=TpM.Intuitively, this says that the directional derivativesMathworldPlanetmathat p of f lying in the tangent spaceMathworldPlanetmathPlanetmath TpM of the manifold M vanish.

By the definition of g and α, we have gα=0.By the chain rule again, we derive 0=Dg(p)Dα(0).

Let the columns of Dα(0) be the column vectors v1,,vm, which span them-dimensional space TpM,and look at the matrix equation 0=Df(p)Dα(0) again.The equation for each entry of this matrix, which consists of only one row, is:

f(p)vj=0,j=1,,m.

In other words, f(p) is orthogonalMathworldPlanetmathPlanetmathPlanetmath to v1,,vm,and hence it is orthogonal to the entire tangent space TpM.

Similarly, the matrix equation 0=Dg(p)Dα(0) can be split into individualscalar equations:

gi(p)vj=0,i=1,,k,j=1,,m.

Thus gi(p) is orthogonal to TpM.But gi(p) are, by hypothesisMathworldPlanetmath, linearly independent,and there are k of these gradients, so they must form a basis forthe orthogonal complementMathworldPlanetmath of TpM, of n-m=k dimensions.Hence f(p) can be written as a unique linear combination of gi(p):

f(p)=λ1g1(p)++λkgk(p).

3 Intuitive interpretations

We now discuss the intuitive and geometricinterpretationsMathworldPlanetmath of Lagrange multipliers.

3.1 Normals to tangent hyperplanes

Each equation gi=0 defines a hypersurface Mi in n, a manifold of dimension n-1.If we consider the tangentPlanetmathPlanetmathPlanetmath hyperplaneMathworldPlanetmathPlanetmath at p of these hypersurfaces, TpMi, the gradient gi(p)gives the normal vectorMathworldPlanetmath to these hyperplanes.

The manifold M is the intersectionMathworldPlanetmath of the hypersurfaces Mi.Presumably, the tangent space TpM is the intersection of the TpMi, and the subspace perpendicularMathworldPlanetmathPlanetmath to TpM would be spanned by the normals gi(p).Now, the direction derivatives at p of f with respect to each vector in TpM, as we have proved,vanish. So the direction of f(p), the directionof the greatest change in f at p, should be perpendicularto TpM. Hence f(p) can be written as a linear combination of the gi(p).

Note, however, that this geometric picture, and the manipulations with the gradients f(p)and gi(p), do not carry over to abstract manifolds.The notions of gradients and normals to surfaces depend on theinner productMathworldPlanetmath structure of n, which isnot present in an abstract manifold (without a Riemannian metricMathworldPlanetmath).

On the other hand, this explains the mysterious appearance of annihilators inthe last paragraph of the abstract proof.Annihilators and dual space theory serve as the proper toolsto formalize the manipulations we made with the matrix equations 0=Df(p)Dα(0) and 0=Dg(p)Dα(0), without resorting to Euclidean coordinates, which, of course, are not even defined on an abstract manifold.

3.2 With infinitesimals

If we are willing to interpret the quantities df and dgi as infinitesimalsMathworldPlanetmathPlanetmath,even the abstract version of the result has an intuitive explanation.Suppose we are at the point p of the manifold M,and consider an infinitesimal movement Δp about this point.The infinitesimal movement Δp is a vector in the tangent spaceTpM, because, near p, M looks like the linear space TpM.And as p moves, the function f changes by a corresponding infinitesimal amount dfthat is approximately linear in Δp.

Furthermore, the change df may be decomposedas the sum of a change as p moves along the manifold M,and a change as p moves out of the manifold M.But if f has a local minimum at p, then there cannot beany change of f along M; thus f only changeswhen moving out of M.Now M is described by the equations gi=0,so a movement out of M is described by the infinitesimal changesdgi.As df is linear in the change Δp,we ought to be able to write it as a weighted sum of the changes dgi.The weights are, of course, the Lagrange multipliers λi.

The linear algebra performed in the abstract proof can be regarded as the precise, rigoroustranslation of the preceding argument.

3.3 As rates of substitution

Observe that the formula for Lagrange multipliers is formallyvery similarMathworldPlanetmathPlanetmath to the standard formula for expressinga differential form in terms of a basis:

df(p)=fy1dy1++fykdyk.

In fact, if dgi(p) are linearly independent,then they do form a basis for (TpM)0,that can be extended to a basis for (TpN)*.By the uniqueness of the basis representation,we must have

λi=fgi.

That is, λi is the differentialMathworldPlanetmathof f with respect to changes in gi.

In applications of Lagrange multipliers to economicproblems, the multipliers λi are rates of substitution —they give the rate of improvement in the objective function fas the constraints gi are relaxed.

4 Stationary points

In applications,sometimes we are interested infinding stationary points p of f — defined aspoints p such that df vanishes on TpM, or equivalently,that the Taylor expansion of f at p, under any system of coordinatesfor M, has no terms of first order. Then the Lagrange multiplier methodworks for this situation too.

The following theorem incorporatesthe more general notion of stationary points.

Theorem 4.

Let N be a n-dimensional differentiable manifold (without boundary), and f:NR,gi:NR, for i=1,,k,be continuously differentiable. Suppose pM=i=1kgi-1({0}), anddgi(p) are linearly independent.

Then p is a stationary point (e.g. a local extremum point) of f restricted to M,if and only if there exist λ1,,λkRsuch that

df(p)=λ1dg1(p)++λkdgk(p).

The Lagrange multipliers λi, which depend on p, are unique when they exist.

In this formulation, M is not necessarily a manifold,but it is one when intersected with a sufficiently small neighborhood about p.So it makes sense to talk about TpM, although we are abusing notation here.The subspace in question can be more accurately described as theannihilated subspace of span{dgi(p)}.

It is also enough that dgi be linearly independentonly at the point p.For dgi are continuousMathworldPlanetmathPlanetmath, so they will belinearly independent for points nearp anyway,and we may restrict our viewpoint to a sufficiently small neighborhoodaround p, and the proofs carry through.

The proof involves only simple modifications tothat of Theorem 1 — for instance,the converse implication followsbecause we have already proved that the dgi(p) form a basis for the annihilator of TpM,independently of whether or not p is a stationary point of f on M.

References

  • 1 Friedberg, Insel, Spence. Linear Algebra. Prentice-Hall, 1997.
  • 2 David Luenberger. Optimization by Vector Space Methods.John Wiley & Sons, 1969.
  • 3 James R. Munkres. Analysis on Manifolds. Westview Press, 1991.
  • 4 R. Tyrrell Rockafellar. “Lagrange Multipliers and Optimality”. SIAM Review. Vol. 35, No. 2, June 1993.
  • 5 Michael Spivak. Calculus on Manifolds. Perseus Books, 1998.
随便看

 

数学辞典收录了18232条数学词条,基本涵盖了常用数学知识及数学英语单词词组的翻译及用法,是数学学习的有利工具。

 

Copyright © 2000-2023 Newdu.com.com All Rights Reserved
更新时间:2025/5/4 5:36:53