chain rule (several variables)
The chain rule is a theorem of analysis that governs derivatives ofcomposed functions. The basic theorem is the chain rule for functionsof one variables (see here (http://planetmath.org/ChainRule)). This entryis devoted to the more general version involving functions of severalvariables and partial derivatives
. Note: the symbol will beused to denote the partial derivative with respect to the variable.
Let and be differentiablefunctions of several variables, and let
be the function determined by the composition of withThe partial derivatives of are given by
The chain rule can be more compactly (albeit less precisely) expressedin terms of the Jacobi-Legendre partial derivative symbols(http://members.aol.com/jeff570/calculus.htmlhistorical note). Just as inthe Leibniz system, the basic idea is that of one quantity (i.e.variable) depending on one or more other quantities. Thus we wouldspeak about a variable depends differentiably on ,which in turn depend differentiably on variables . Wewould then write the chain rule as
The most general, and conceptually clear approach to themulti-variable chain is based on the notion of a differentiablemapping, with the Jacobian matrix of partial derivatives playing therole of generalized derivative. Let, and be open domains and let
be differentiable mappings. In essence, the symbol represents functions of variables each:
whereas represents functions of variables each.The derivative of such mappings is no longer a function, but rather amatrix of partial derivatives, customarily called the Jacobian matrix.Thus
The chain rule now takes the same form as it did for functions of onevariable:
albeit with matrixmultiplication taking the place of ordinary multiplication.
This form of the chain rule also generalizes quite nicely to the evenmore general setting where one is interested in describing thederivative of a composition of mappings between manifolds.