linear least squares fit
One of the most common uses of least squares fitting is fitting a straight line todata. Whilst, in general, it is difficult to determine the curve which best fitsthe data, in this case there is a relatively simple formula which can be used.
Theorem 1.
Suppose we have a data set . Then the straightline which best fits this set is given as
where
(1) | ||||
(2) | ||||
(3) | ||||
(4) |
Proof.
Being the best fitting line means minimizing the merit function , given as
with respect to the parameters and . Expanding the square, this can bewritten as
where are as above and
This function is a quadratic polynomial; moreover, from its definition as asum of squares, it is clear that the highest order terms are positive definite,hence it has a minimum and all that remains is to find that minimum. To do this,we set the derivatives equal to zero to obtain the following equations:
(5) | ||||
(6) |
These equations are easily solved to give
(7) | ||||
(8) |
substituting in the equation for a straight line, we obtainthe answer given above.∎
Because of the ease with which one can make a least squares fit of a line, thistechnique is often adapted to fitting other sorts of curves by making a changeof variables. Two common cases of this practice are power laws and exponentials.
Suppose that one wants to fit some data to a curve of the form .Making a change of variable and defining , the equation ofthe curve becomes . One can therefore fit the data set to a straight line.
Suppose that one wants to fit some data to a curve of the form . Makinga change of variable , and defining , the equation ofthe curve becomes . One can therefore fit the data set to a straight line.
Although convenient and common, this procedure can be a cheat because changing variablesand making a least squares fit of a line is not the same as making a least squares fitto a curve. The reason for this is that the merit functions are different and will not,in general have a minimum in the same place. However, if the data happen to approximatelylie on a power curve or an exponential, then the answer obtained by changing variablesand fitting will be an approximation to the correct answer. Depending on what one isdoing, this approximation may be good enough or one may use it as a starting point forsome algorithm to compute the correct minimum.