|
|
Re: Why doesn't multiple regression maximise R2
Posted:
Apr 25, 2012 2:22 PM
|
|
On Apr 25, 10:27 am, dc353 <dc...@hotmail.com> wrote: > On Wednesday, April 25, 2012 10:25:28 AM UTC-4, dc353 wrote: >> Attached is sample data that produces the following: >> >> We regress VOL on STRK_PCT and AVG_VOL and get the following equation: >> >> VOL = -.38446 + .004353 * STRK_PCT + 2.963829*AVG_VOL >> >> This produces an R2 of .940671 >> >> We then modify the equation to: >> >> VOL = -.38 + .0044*STRK_PCT +2.96*AVG_VOL and calculate the predicated values. The R2 is now .960361 >> >> My recollection is that a regression minimizes the sum of the squared vertical distances from the observations to line, while R2 is the explained variance divided by the total variance. Is there any reason why minimizing the former maximizes the latter and if so how do we explain the example in the spread sheet? > >> DATA [deleted] > > How do you scale the estimate to a least square fit?
Suppose you start with a dependent variable y and several predictors, x1,x2,... .
You solve for the least squares weights b0,b1,b2,... and form an estimate y' = b0 + b1*x1 + b2*x2 + ... .
var[y']/var[y] = R^2[y,y'],
*because* the least squares weights were used.
Then suppose you pick arbitrary weights a0,a1,a2,... and form a new estimate y" = a0 + a1*x1 + a2*x2 + ... .
var[y"]/var[y] /= R^2[y,y"],
because the least squares weights were *not* used.
To scale y", regress it on y: get least squares weights c0,c1 and form a third estimate y"' = c0 + c1*y".
var[y"']/var[y] = R^2[y,y"'] = R^2[y,y"] < R^2[y,y'].
(Actually, the intercepts b0,a0,c0 are irrelevant, because they affect neither the variance nor the correlation. They become relevant only if you expect the estimates to have the same mean as y.)
|
|