Python numpy.var returning wrong values

Question

I'm trying to do a simple variance calculation on a set of 3 numbers:

numpy.var([0.82159889, 0.26007962, 0.09818412])

which returns

0.09609366366174843

However, when you calculate the variance it should actually be

0.1441405

Seems like such a simple thing, but I haven't been able to find an answer yet.

DSM · Accepted Answer · 2014-10-09 02:52:23Z

11

As the documentation explains:

ddof : int, optional
    "Delta Degrees of Freedom": the divisor used in the calculation is
    ``N - ddof``, where ``N`` represents the number of elements. By
    default `ddof` is zero.

And so you have:

>>> numpy.var([0.82159889, 0.26007962, 0.09818412], ddof=0)
0.09609366366174843
>>> numpy.var([0.82159889, 0.26007962, 0.09818412], ddof=1)
0.14414049549262264

Both conventions are common enough that you always need to check which one is being used by whatever package you're using, in any language.

answered Oct 9, 2014 at 2:52

DSM

355k67 gold badges606 silver badges504 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

pajarraco Over a year ago

Thanks! I just figured it out before I came back to check out the answer.

Aaron Hall · Accepted Answer · 2014-10-09 03:10:22Z

3

np.var by default calculates the population variance.

The Sum of Squared Errors can be calculated as follows:

>>> vals = [0.82159889, 0.26007962, 0.09818412]
>>> mean = sum(vals)/3.0
>>> mean
0.3932875433333333
>>> sum((mean-val)**2 for val in vals)
0.2882809909852453
>>> sse = sum((mean-val)**2 for val in vals)

This is the population variance:

>>> sse/3 
0.09609366366174843
>>> np.var(vals)
0.09609366366174843

This is the sample variance:

>>> sse/(3-1)
0.14414049549262264
>>> np.var(vals, ddof=1)
0.14414049549262264

You can read more about the difference here.

edited Oct 9, 2014 at 3:10

answered Oct 9, 2014 at 3:05

Aaron Hall♦

400k93 gold badges416 silver badges342 bronze badges

Collectives™ on Stack Overflow

Python numpy.var returning wrong values

2 Answers 2

1 Comment

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Related