Calculating arithmetic mean (one type of average) in Python [duplicate]

Question

Is there a built-in or standard library method in Python to calculate the arithmetic mean (one type of average) of a list of numbers?

Average is ambiguous - mode and median are also commonly-used averages — jtlz2
– jtlz2, Commented Jun 11, 2018 at 8:13
Mode and median are other measures of central tendency. They are not averages. The mode is the most common value seen in a data set and is not necessarily unique. The median is the value that represents the center of the data points. As the question implies, there are a few different types of averages, but all are different from median and mode calculations. purplemath.com/modules/meanmode.htm — Jarom
– Jarom, Commented Aug 1, 2018 at 4:48
@Jarom That link disagrees with you: 'Mean, median, and mode are three kinds of "averages"' — Marcelo Cantos
– Marcelo Cantos, Commented Feb 7, 2019 at 3:39

compie · Accepted Answer · 2016-08-09 07:27:14Z

288

I am not aware of anything in the standard library. However, you could use something like:

def mean(numbers):
    return float(sum(numbers)) / max(len(numbers), 1)

>>> mean([1,2,3,4])
2.5
>>> mean([])
0.0

In numpy, there's numpy.mean().

edited Aug 9, 2016 at 7:27

compie

10.6k15 gold badges59 silver badges79 bronze badges

answered Oct 10, 2011 at 17:22

NPE

503k114 gold badges970 silver badges1k bronze badges

Sign up to request clarification or add additional context in comments.

7 Comments

yo' Over a year ago

A common thing is to consider that the average of [] is 0, which can be done by float(sum(l))/max(len(l),1).

zondo Over a year ago

PEP 8 says that l is a bad variable name because it looks so much like 1. Also, I would use if l rather than if len(l) > 0. See here

1 -_- Over a year ago

Why have you called max?

Simon Fakir Over a year ago

See the question above: To avoid division by zero ( for [] )

Marcelo Cantos Over a year ago

Empty lists have no mean. Please don't pretend they do.

|

Bengt · Accepted Answer · 2012-12-13 22:12:28Z

196

NumPy has a numpy.mean which is an arithmetic mean. Usage is as simple as this:

>>> import numpy
>>> a = [1, 2, 4]
>>> numpy.mean(a)
2.3333333333333335

answered Dec 13, 2012 at 22:12

Bengt

14.6k7 gold badges53 silver badges67 bronze badges

9 Comments

vcarel Over a year ago

numpy is a nightmare to install in a virtualenv. You should really consider not using this lib

user227667 Over a year ago

@vcarel: "numpy is a nightmare to install in a virtualenv". I'm not sure why you say this. It used to be the case, but for the last year or more it's been very easy.

Juan Carlos Coto Over a year ago

I must second this comment. I'm currently using numpy in a virtualenv in OSX, and there is absolutely no problem (currently using CPython 3.5).

Akseli Palén Over a year ago

With continuous integration systems like Travis CI, installing numpy takes several extra minutes. If quick and light build is valuable to you, and you need only the mean, consider.

Bengt Over a year ago

@AkseliPalén virtual environments on Travis CI can use a numpy installed via apt-get using the system site packages. This may be quick enough to use even if one only needs a mean.

|

kirbyfan64sos · Accepted Answer · 2019-05-28 01:49:52Z

192

Use statistics.mean:

import statistics
print(statistics.mean([1,2,4])) # 2.3333333333333335

It's available since Python 3.4. For 3.1-3.3 users, an old version of the module is available on PyPI under the name stats. Just change statistics to stats.

edited May 28, 2019 at 1:49

user3064538

answered Dec 28, 2013 at 22:38

kirbyfan64sos

10.8k6 gold badges58 silver badges79 bronze badges

4 Comments

Eike P. Over a year ago

Note that this is extremely slow when compared to the other solutions. Compare timeit("numpy.mean(vec)), timeit("sum(vec)/len(vec)") and timeit("statistics.mean(vec)") - the latter is slower than the others by a huge factor (>100 in some cases on my PC). This appears to be due to a particularly precise implementation of the sum operator in statistics, see PEP and Code. Not sure about the reason for the large performance difference between statistics._sum and numpy.sum, though.

Antti Haapala Over a year ago

@jhin this is because the statistics.mean tries to be correct. It calculates correctly the mean of [1e50, 1, -1e50] * 1000.

PaulMcG Over a year ago

statistics.mean will also accept a generator expression of values, which all solutions that use len() for the divisor will choke on.

Mathieu Rollet Over a year ago

Since python 3.8, there is a faster statistics.fmean function

Bengt · Accepted Answer · 2015-11-08 15:42:27Z

55

You don't even need numpy or scipy...

>>> a = [1, 2, 3, 4, 5, 6]
>>> print(sum(a) / len(a))
3

edited Nov 8, 2015 at 15:42

Bengt

14.6k7 gold badges53 silver badges67 bronze badges

answered Aug 17, 2013 at 18:29

Mumon

6315 silver badges2 bronze badges

6 Comments

Jk041 Over a year ago

then mean([2,3]) would give 2. be careful with floats. Better use float(sum(l))/len(l). Better still, be careful to check if the list is empty.

yota Over a year ago

@jesusiniesta except in python3, where division does what it is intended to do : divide

spiffytech Over a year ago

And in Python 2.2+ if you from __future__ import division at the top of your program

obayhan Over a year ago

What about big numbers and overflow?

0 _ Over a year ago

What about a = list()? The proposed code results in ZeroDivisionError.

|

Lenka Pitonakova · Accepted Answer · 2016-05-07 01:56:37Z

8

Use scipy:

import scipy;
a=[1,2,4];
print(scipy.mean(a));

edited May 7, 2016 at 1:56

answered Nov 19, 2012 at 19:11

Lenka Pitonakova

1,12315 silver badges15 bronze badges

1 Comment

Bengt Over a year ago

scipy.stats.mean is deprecated; please update your code to use numpy.mean.

Vlad Bezden · Accepted Answer · 2019-12-15 22:03:54Z

7

Instead of casting to float you can do following

def mean(nums):
    return sum(nums, 0.0) / len(nums)

or using lambda

mean = lambda nums: sum(nums, 0.0) / len(nums)

UPDATES: 2019-12-15

Python 3.8 added function fmean to statistics module. Which is faster and always returns float.

Convert data to floats and compute the arithmetic mean.

This runs faster than the mean() function and it always returns a float. The data may be a sequence or iterable. If the input dataset is empty, raises a StatisticsError.

fmean([3.5, 4.0, 5.25])

4.25

New in version 3.8.

edited Dec 15, 2019 at 22:03

answered Apr 28, 2017 at 10:56

Vlad Bezden

90.7k27 gold badges261 silver badges190 bronze badges

Comments

fariborz najafi · Accepted Answer · 2018-10-02 16:56:34Z

3

from statistics import mean
avarage=mean(your_list)

for example

from statistics import mean

my_list=[5,2,3,2]
avarage=mean(my_list)
print(avarage)

and result is

3.0

answered Oct 2, 2018 at 16:56

fariborz najafi

691 gold badge2 silver badges8 bronze badges

Comments

Mathieu Rollet · Accepted Answer · 2020-12-30 22:20:06Z

2

If you're using python >= 3.8, you can use the fmean function introduced in the statistics module which is part of the standard library:

>>> from statistics import fmean
>>> fmean([0, 1, 2, 3])
1.5

It's faster than the statistics.mean function, but it converts its data points to float beforehand, so it can be less accurate in some specific cases.

You can see its implementation here

answered Dec 30, 2020 at 22:20

Mathieu Rollet

2,3942 gold badges24 silver badges35 bronze badges

Comments

jasonleonhard · Accepted Answer · 2017-09-10 20:29:41Z

1

def avg(l):
    """uses floating-point division."""
    return sum(l) / float(len(l))

Examples:

l1 = [3,5,14,2,5,36,4,3]
l2 = [0,0,0]

print(avg(l1)) # 9.0
print(avg(l2)) # 0.0

answered Sep 10, 2017 at 20:29

jasonleonhard

14.2k1 gold badge98 silver badges71 bronze badges

Comments

Muhoza yves · Accepted Answer · 2018-07-03 11:06:23Z

1

def list_mean(nums):
    sumof = 0
    num_of = len(nums)
    mean = 0
    for i in nums:
        sumof += i
    mean = sumof / num_of
    return float(mean)

edited Jul 3, 2018 at 11:06

user9598935

answered Aug 18, 2016 at 15:09

Muhoza yves

411 silver badge5 bronze badges

Comments

PaulMcG · Accepted Answer · 2018-08-29 14:05:21Z

1

The proper answer to your question is to use statistics.mean. But for fun, here is a version of mean that does not use the len() function, so it (like statistics.mean) can be used on generators, which do not support len():

from functools import reduce
from operator import truediv
def ave(seq):
    return truediv(*reduce(lambda a, b: (a[0] + b[1], b[0]), 
                           enumerate(seq, start=1), 
                           (0, 0)))

edited Aug 29, 2018 at 14:05

answered Aug 28, 2018 at 1:30

PaulMcG

64.1k16 gold badges98 silver badges135 bronze badges

Comments

n611x007 · Accepted Answer · 2015-11-02 11:46:50Z

0

I always supposed avg is omitted from the builtins/stdlib because it is as simple as

sum(L)/len(L) # L is some list

and any caveats would be addressed in caller code for local usage already.

Notable caveats:

non-float result: in python2, 9/4 is 2. to resolve, use float(sum(L))/len(L) or from __future__ import division

division by zero: the list may be empty. to resolve:

if not L:
    raise WhateverYouWantError("foo")
avg = float(sum(L))/len(L)

edited Nov 2, 2015 at 11:46

answered Nov 2, 2015 at 11:03

n611x007

9,35210 gold badges65 silver badges102 bronze badges

Comments

Hashmatullah Noorzai · Accepted Answer · 2017-09-11 01:53:03Z

-1

Others already posted very good answers, but some people might still be looking for a classic way to find Mean(avg), so here I post this (code tested in Python 3.6):

def meanmanual(listt):

mean = 0
lsum = 0
lenoflist = len(listt)

for i in listt:
    lsum += i

mean = lsum / lenoflist
return float(mean)

a = [1, 2, 3, 4, 5, 6]
meanmanual(a)

Answer: 3.5

answered Sep 11, 2017 at 1:53

Hashmatullah Noorzai

7903 gold badges13 silver badges36 bronze badges

Collectives™ on Stack Overflow

Calculating arithmetic mean (one type of average) in Python [duplicate]

13 Answers 13

7 Comments

9 Comments

4 Comments

6 Comments

1 Comment

Comments

Comments

Comments

Examples:

Comments

Comments

Comments

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

13 Answers 13

7 Comments

9 Comments

4 Comments

6 Comments

1 Comment

Comments

Comments

Comments

Examples:

Comments

Comments

Comments

Comments

Comments

Linked

Related