The Math Forum

Search All of the Math Forum:

Views expressed in these public forums are not endorsed by NCTM or The Math Forum.

Math Forum » Discussions » sci.math.* » sci.stat.math

Notice: We are no longer accepting new posts, but the forums will continue to be readable.

Topic: Which sample variance should I choose?
Replies: 7   Last Post: Sep 5, 2011 3:50 AM

Advanced Search

Back to Topic List Back to Topic List Jump to Tree View Jump to Tree View   Messages: [ Previous | Next ]
Steven D'Aprano

Posts: 21
Registered: 3/22/11
Which sample variance should I choose?
Posted: Aug 31, 2011 1:06 AM
  Click to see the message monospaced in plain text Plain Text   Click to reply to this topic Reply

The population variance is given by:

?^2 = ?(x - µ)^2 / n

with µ = population mean, the summation being over all the x in the

(for brevity, I haven't attempted to show subscript-i on the x).

If you don't have the entire population, you can estimate the variance with
the sample variance:

s^2 = ?(x - m)^2 / n (Eq. 1)

where m = sample mean (usually written as x bar), and the sum is over all
the x in the sample. A second estimator is:

s^2 = ?(x - m)^2 / (n-1) (Eq. 2)

which some people prefer because it is unbiased (that is, the average of all
the possible sample variances equals the true population variance if you
use the (n-1) version).

See also

I have a set of data with an (allegedly) known population mean µ but an
unknown ?^2. I wish to estimate ?^2. Under what circumstances should I
prefer Eq.1 over Eq.2?

Or should I ignore the sample mean altogether, and plug the known population
mean into one of the two equations? I.e.:

s^2 = ?(x - µ)^2 / n (Eq. 3)
s^2 = ?(x - µ)^2 / (n-1) (Eq. 4)

Under what circumstances should I prefer each of these four estimators of
?^2 and what are the pros and cons of each?

Thanks in advance,


Point your RSS reader here for a feed of the latest messages in this topic.

[Privacy Policy] [Terms of Use]

© The Math Forum at NCTM 1994-2018. All Rights Reserved.