释义 |
Least Squares FittingA mathematical procedure for finding the best fitting curve to a given set of points by minimizing the sum of the squares of theoffsets (``the residuals'') of the points from the curve. The sum of the squares of the offsets is used instead of theoffset absolute values because this allows the residuals to be treated as a continuous differentiable quantity. However, becausesquares of the offsets are used, outlying points can have a disproportionate effect on the fit, a property which may or may notbe desirable depending on the problem at hand.
data:image/s3,"s3://crabby-images/c3deb/c3debff0c1cffc59d515b5c4e42764b930ac185b" alt="" In practice, the vertical offsets from a line are almost always minimized instead of the perpendicular offsets. Thisallows uncertainties of the data points along the - and -axes to be incorporated simply, and also provides a much simpleranalytic form for the fitting parameters than would be obtained using a fit based on perpendicular distances. In addition, thefitting technique can be easily generalized from a best-fit line to a best-fit polynomial when sums of verticaldistances are used (which is not the case using perpendicular distances). For a reasonable number of noisy data points, thedifference between vertical and perpendicular fits is quite small.
The linear least squares fitting technique is the simplest and most commonly applied form of Linear Regression andprovides a solution to the problem of finding the best fitting straight line through a set of points. In fact, if thefunctional relationship between the two quantities being graphed is known to within additive or multiplicative constants, it iscommon practice to transform the data in such a way that the resulting line is a straight line, say by plotting vs. instead of vs. . For this reason, standard forms for Exponential, Logarithmic, and Powerlaws are often explicitly computed. The formulas for linear least squares fitting were independently derived by Gauß andLegendre. data:image/s3,"s3://crabby-images/6a0ce/6a0cef61e660f6b4f2958d9d9d2330f92837b84a" alt=""
For Nonlinear Least Squares Fitting to a number of unknown parameters, linear least squares fitting may be appliediteratively to a linearized form of the function until convergence is achieved. Depending on the type of fit and initialparameters chosen, the nonlinear fit may have good or poor convergence properties. If uncertainties (in the most general case,error ellipses) are given for the points, points can be weighted differently in order to give the high-quality points moreweight.
The residuals of the best-fit line for a set of points using unsquared perpendicular distances of points are given by
data:image/s3,"s3://crabby-images/759f9/759f912a4eea27f6e5e2bc59f0b52f671648f667" alt="" | (1) |
Since the perpendicular distance from a line to point is given by
data:image/s3,"s3://crabby-images/14978/14978da7b4b06af8526d7118a550bba603ab6b7e" alt="" | (2) |
the function to be minimized is
data:image/s3,"s3://crabby-images/ee38b/ee38b82457abc071170166b8ee0c5bf2fc13703b" alt="" | (3) |
Unfortunately, because the absolute value function does not have continuous derivatives, minimizing is notamenable to analytic solution. However, if the square of the perpendicular distances
data:image/s3,"s3://crabby-images/71789/717896f6956029c075ad507c866bae489ec74321" alt="" | (4) |
is minimized instead, the problem can be solved in closed form. is a minimum when (suppressing the indices)
data:image/s3,"s3://crabby-images/973d0/973d0b57656d6691fda2d386dd464a4a41e848e4" alt="" | (5) |
and
data:image/s3,"s3://crabby-images/774ab/774ab779da497c538cd139ae7b2be1cd6e5f73d5" alt="" | (6) |
The former gives
data:image/s3,"s3://crabby-images/2b02d/2b02dbaceac6e02781c6229ff75c227828a31395" alt="" | (7) |
and the latter
data:image/s3,"s3://crabby-images/89e4f/89e4f77299626986ddf31089078c3b41dfc790a3" alt="" | (8) |
But
data:image/s3,"s3://crabby-images/56be6/56be68132343ba6e9bc72912b09787448c268510" alt="" | (9) |
so (8) becomes
data:image/s3,"s3://crabby-images/491a5/491a597ff8bd1e300a6124cb8d85f6a49bb4cc65" alt="" | (10) |
data:image/s3,"s3://crabby-images/19209/192098e5f00ccc0ef2f4c3500977ee26e7e1d0a8" alt="" | | data:image/s3,"s3://crabby-images/828e0/828e01dd50ce0d3512ba015d3b8903ceafa6821c" alt="" | (11) |
data:image/s3,"s3://crabby-images/bca9c/bca9c590e731350072c611f539c494561aef3b2f" alt="" | (12) |
Plugging (7) into (12) then givesdata:image/s3,"s3://crabby-images/d5414/d541428f3d5cc321bf6e593a9c1935501cb736a6" alt="" | | data:image/s3,"s3://crabby-images/3f12d/3f12d5333a0a1e83c2a0cc55378c790269d06305" alt="" | (13) | After a fair bit of algebra, the result is
data:image/s3,"s3://crabby-images/2a4ae/2a4aede66944dfb62d2b66beea9b61ac12e00482" alt="" | (14) |
So define
and the Quadratic Formula gives
data:image/s3,"s3://crabby-images/d8a2d/d8a2d63e1be9c101f8e2f4b7a9687835ec20a811" alt="" | (16) |
with found using (7). Note the rather unwieldy form of the best-fit parameters in the formulation. In addition,minimizing for a second- or higher-order Polynomial leads to polynomial equations having higher order, so this formulation cannot be extended.
Vertical least squares fitting proceeds by finding the sum of the squares of the vertical deviations of a setof data points
data:image/s3,"s3://crabby-images/13eca/13ecacad4d9c73f1990666b1848d579be230f838" alt="" | (17) |
from a function . Note that this procedure does not minimize the actual deviations from the line (which would bemeasured perpendicular to the given function). In addition, although the unsquared sum of distances might seem a moreappropriate quantity to minimize, use of the absolute value results in discontinuous derivatives which cannot be treatedanalytically. The square deviations from each point are therefore summed, and the resulting residual is then minimized tofind the best fit line. This procedure results in outlying points being given disproportionately large weighting.
The condition for to be a minimum is that
data:image/s3,"s3://crabby-images/ba5a0/ba5a08f8eb0fef537a9c862853921a43bdbf585e" alt="" | (18) |
for , ..., . For a linear fit,
data:image/s3,"s3://crabby-images/7870f/7870fc640b858f5c544f4a168eb6295fd11486a1" alt="" | (19) |
so
data:image/s3,"s3://crabby-images/bde60/bde60e722d241a6b701abedc928d333f2d1e581e" alt="" | (20) |
data:image/s3,"s3://crabby-images/e912d/e912d2e04546a5b0da25ab65a781c2e05edf5cb3" alt="" | (21) |
data:image/s3,"s3://crabby-images/1049b/1049b51e34d962f476dbc17b3849f3938c8df26e" alt="" | (22) |
These lead to the equations
data:image/s3,"s3://crabby-images/2ec22/2ec22134cc914a23207a77599639d67c4dd01466" alt="" | (23) |
data:image/s3,"s3://crabby-images/47db7/47db73282a0d9b8e9aa636227302669b21ffcc85" alt="" | (24) |
where the subscripts have been dropped for conciseness. In Matrix form,
data:image/s3,"s3://crabby-images/538a0/538a09c606867b366438f9d2b9a99cb26a3ff8e8" alt="" | (25) |
so
data:image/s3,"s3://crabby-images/47a36/47a361245833bde876b5a816afa7c198f782e6ff" alt="" | (26) |
The Matrix Inverse is
data:image/s3,"s3://crabby-images/f586f/f586f0f3f03c6bf52bfd01a1c1b6d867a95ac6e6" alt="" | (27) |
so
(Kenney and Keeping 1962). These can be rewritten in a simpler form by defining the sums of squares
which are also written as
Here, is the Covariance and and are variances. Note that the quantities and can also be interpreted as the Dot Products
In terms of the sums of squares, the Regression Coefficient is given by
data:image/s3,"s3://crabby-images/606d4/606d4af1719d248b55f4605b37d6913b976628b3" alt="" | (40) |
and is given in terms of using (24) as
data:image/s3,"s3://crabby-images/69dff/69dff0096e7d312321ec055bb39ed0f810d71810" alt="" | (41) |
The overall quality of the fit is then parameterized in terms of a quantity known as the Correlation Coefficient,defined by
data:image/s3,"s3://crabby-images/56584/56584022ec3ddd5a9ca50bcdcfd05722f6b8987e" alt="" | (42) |
which gives the proportion of which is accounted for by the regression.
The Standard Errors for and are
Let be the vertical coordinate of the best-fit line with -coordinate , so
data:image/s3,"s3://crabby-images/e86c3/e86c326017433d37ba4b1e8a2ca91f4bfb2bccea" alt="" | (45) |
then the error between the actual vertical point and the fitted point is given by
data:image/s3,"s3://crabby-images/2d376/2d376abdd9fee34f718e36c6cceacc2c013409cb" alt="" | (46) |
Now define as an estimator for the variance in ,
data:image/s3,"s3://crabby-images/95173/95173c9c701cf3eba000b215c946f53b0f2d32cb" alt="" | (47) |
Then can be given by
data:image/s3,"s3://crabby-images/e72fc/e72fc04eafe13f8525c2aaac5d1c2f658133347c" alt="" | (48) |
(Acton 1966, pp. 32-35; Gonick and Smith 1993, pp. 202-204).
Generalizing from a straight line (i.e., first degree polynomial) to a th degree Polynomial
data:image/s3,"s3://crabby-images/1b352/1b3524b53e2bd2419884a9078b6c741307509809" alt="" | (49) |
the residual is given by
data:image/s3,"s3://crabby-images/a20ff/a20ffafbba204094c4172f9d46a074232c61e72b" alt="" | (50) |
The Partial Derivatives (again dropping superscripts) are
data:image/s3,"s3://crabby-images/324e8/324e89ae8593423805a255891fe37380f9a81505" alt="" | (51) |
data:image/s3,"s3://crabby-images/dc660/dc6600a79938fd9c05a3b0d4c789962bec338efd" alt="" | (52) |
data:image/s3,"s3://crabby-images/6b04f/6b04f89f84af3478242ae86c72ed6241923e30fc" alt="" | (53) |
These lead to the equations
data:image/s3,"s3://crabby-images/4df1c/4df1c20180e51f11d2dfaa2c5f8c7263c071d2cc" alt="" | (54) |
data:image/s3,"s3://crabby-images/e5d09/e5d099f15013d4b21fa0b69560f60f60e1c3068a" alt="" | (55) |
data:image/s3,"s3://crabby-images/a904a/a904a1901eba04ec57ef38ac0f17353745bc3d65" alt="" | (56) |
or, in Matrix formdata:image/s3,"s3://crabby-images/f3840/f3840b3d090a1b88eeef4f5c7371e17f7f5ad299" alt="" | | | (57) | This is a Vandermonde Matrix. We can also obtain the Matrix for a least squares fit by writing
data:image/s3,"s3://crabby-images/f8560/f8560ad4accbb2552b6f18aeab86e0bdb831f65a" alt="" | (58) |
Premultiplying both sides by the Transpose of the first Matrix then gives
data:image/s3,"s3://crabby-images/293da/293da43e44d94d0bb59df7c32468187dd8a91380" alt="" | (59) |
so
data:image/s3,"s3://crabby-images/58dc1/58dc1f82ecb797f6b07b1e1966051edec4da6aef" alt="" | (60) |
As before, given points and fitting with Polynomial Coefficients , ..., gives
data:image/s3,"s3://crabby-images/b06b6/b06b67c5053f9736435e0c2050d2bd2a202f229f" alt="" | (61) |
In Matrix notation, the equation for a polynomial fit is given by
data:image/s3,"s3://crabby-images/654fb/654fb0c2b9e2b9c6a62dcc2c9c2f24b1eeadd586" alt="" | (62) |
This can be solved by premultiplying by the Matrix Transpose ,
data:image/s3,"s3://crabby-images/25468/25468e1e8b9dee0ce5ee589f3311d58fae1ae1d0" alt="" | (63) |
This Matrix Equation can be solved numerically, or can be inverted directly if it is well formed, to yield thesolution vector
data:image/s3,"s3://crabby-images/378e8/378e86c25ab35f644b89e6d89f5c12dda731759e" alt="" | (64) |
Setting in the above equations reproduces the linear solution. See also Correlation Coefficient, Interpolation, Least Squares Fitting--Exponential, Least Squares Fitting--Logarithmic, LeastSquares Fitting--Power Law, Moore-Penrose Generalized Matrix Inverse,Nonlinear Least Squares Fitting, Regression Coefficient, Spline References
Acton, F. S. Analysis of Straight-Line Data. New York: Dover, 1966.Bevington, P. R. Data Reduction and Error Analysis for the Physical Sciences. New York: McGraw-Hill, 1969. Gonick, L. and Smith, W. The Cartoon Guide to Statistics. New York: Harper Perennial, 1993. Kenney, J. F. and Keeping, E. S. ``Linear Regression, Simple Correlation, and Contingency.'' Ch. 8 in Mathematics of Statistics, Pt. 2, 2nd ed. Princeton, NJ: Van Nostrand, pp. 199-237, 1951. Kenney, J. F. and Keeping, E. S. ``Linear Regression and Correlation.'' Ch. 15 in Mathematics of Statistics, Pt. 1, 3rd ed. Princeton, NJ: Van Nostrand, pp. 252-285, 1962. Lancaster, P. and Salkauskas, K. Curve and Surface Fitting: An Introduction. London: Academic Press, 1986. Lawson, C. and Hanson, R. Solving Least Squares Problems. Englewood Cliffs, NJ: Prentice-Hall, 1974. Nash, J. C. Compact Numerical Methods for Computers: Linear Algebra and Function Minimisation, 2nd ed. Bristol, England: Adam Hilger, pp. 21-24, 1990. Press, W. H.; Flannery, B. P.; Teukolsky, S. A.; and Vetterling, W. T. ``Fitting Data to a Straight Line'' ``Straight-Line Data with Errors in Both Coordinates,'' and ``General Linear Least Squares.'' §15.2, 15.3, and 15.4 in Numerical Recipes in FORTRAN: The Art of Scientific Computing, 2nd ed. Cambridge, England: Cambridge University Press, pp. 655-675, 1992. York, D. ``Least-Square Fitting of a Straight Line.'' Canad. J. Phys. 44, 1079-1086, 1966.
|