Filter integers in numpy float array

Question

Is there any built in function to discard integer and keep only float number in numpy.

import numpy as np

input = np.array([0.0, 0.01, 1.0, 2.0, 2.001, 2.002])

desired_ouput = some_function(input)
# Expected ouput
# desired_output = np.array([0.01, 2.001, 2.002])

All values in that array are floats. "float" doesn't imply anything about the value being non-integer. — user2357112
– user2357112, Commented Aug 30, 2018 at 19:00

Joe Iddon · Accepted Answer · 2018-08-30 10:23:36Z

18

Mask with whether each element is equal to it as an integer.

arr = np.array([0.0, 0.01, 1.0, 2.0, 2.001, 2.002])
out = arr[arr != arr.astype(int)]
#np.array([0.01, 2.001, 2.002])

answered Aug 30, 2018 at 10:23

Joe Iddon

20.5k7 gold badges38 silver badges62 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

SpghttCd · Accepted Answer · 2018-08-30 10:21:45Z

18

I don't think so. My approach would be

import numpy as np
a = np.array([0.0, 0.01, 1.0, 2.0, 2.001, 2.002])
mask = np.isclose(a, a.astype(int))

print(a[~mask])
#[ 0.01   2.001  2.002]

answered Aug 30, 2018 at 10:21

SpghttCd

10.9k2 gold badges23 silver badges28 bronze badges

2 Comments

Daniel F Over a year ago

As a general rule always use isclose when comparing floats. I'm not sure it's necessary here, but upvoted for good practice.

Ruslan Over a year ago

It's not a good practice to blindly follow what is called "good practice". isclose, depending on the source of these numbers, may actually be actively harmful.

jpp · Accepted Answer · 2018-08-30 10:21:26Z

6

I know of no in-built function. But you can create one yourself:

import numpy as np

A = np.array([0.0, 0.01, 1.0, 2.0, 2.001, 2.002])

def remove_ints(arr):
    return arr[~(arr == arr.astype(int))]

res = remove_ints(A)

array([ 0.01 ,  2.001,  2.002])

Aside, you should not use a built-in class such as input as a variable name.

answered Aug 30, 2018 at 10:21

jpp

166k37 gold badges301 silver badges362 bronze badges

Comments

user3483203 · Accepted Answer · 2018-08-30 10:55:31Z

6

I've always used np.equal with np.mod:

>>> A[~np.equal(np.mod(A, 1), 0)]
array([0.01 , 2.001, 2.002])

edited Aug 30, 2018 at 10:55

answered Aug 30, 2018 at 10:40

user3483203

51.3k10 gold badges72 silver badges104 bronze badges

Comments

Dadep · Accepted Answer · 2018-09-04 07:04:39Z

4

If you do not have to much data (short list), maybe do not need numpy:

>>> i = [0.0, 0.01, 1.0, 2.0, 2.001, 2.002]
>>> a=[j for j in i if not j.is_integer()]
>>> a
['0.01', '2.001', '2.002']

Otherwise see Joe Iddon answer

edited Sep 4, 2018 at 7:04

answered Aug 30, 2018 at 10:23

Dadep

2,7805 gold badges30 silver badges40 bronze badges

1 Comment

Joe Iddon Over a year ago

The idea of scipy is to never use for for efficiency.

MarianD · Accepted Answer · 2018-08-30 10:29:02Z

2

I don't know any builtin for this but you can filter those floats using:

filter(lambda x: int(str(x).split('.')[1]) != 0, input)

The lambda expression here checks if the decimal places are zero which I interpret as the number being an int.

edited Aug 30, 2018 at 10:29

MarianD

14.4k12 gold badges50 silver badges61 bronze badges

answered Aug 30, 2018 at 10:22

meissner_

5416 silver badges10 bronze badges

4 Comments

Oli Over a year ago

precision is lost by converting to string. e.g 2.001 becomes 2.0009999999999999.

meissner_ Over a year ago

What? I just tried replicating this but [str(x) for x in input] returned: ['0.0', '0.01', '1.0', '2.0', '2.001', '2.002']? But even if i assume a loss of accuracy, it wouldn't change the results since '001' will not come out as 0 after the int cast, just like '00099999'.

Oli Over a year ago

It happend in my python. Actually these are indexes of pandas dataframe, so loss of precision mean, i can't get back values by apply .loc method.

meissner_ Over a year ago

Well, this would be very interesting indeed if you can find a way to replicate the effect...

Mad Physicist · Accepted Answer · 2021-12-31 20:42:39Z

0

I had a similar question a while back: Numpy: Check if float array contains whole numbers. The simplest way to mask fractions that I am currently aware of is

mask = ((input % 1) != 0)

You can then apply the mask directly with

output = input[mask]

It bothered me that there is no built-in function to determine the integerness of a float quickly, so I wrote a fast ufunc that provides the functionality of float.is_integer for numpy. You can download from github and compile if you're interested:

from is_integer_ufunc import is_integer

output = input[~is_integer(input)]

I'll see if the numpy community wants to consider adding something like that to the core library. The question seems to come up often enough to justify it.

edited Dec 31, 2021 at 20:42

answered Dec 31, 2021 at 0:39

Mad Physicist

116k29 gold badges202 silver badges292 bronze badges

Collectives™ on Stack Overflow

Filter integers in numpy float array

7 Answers 7

Comments

2 Comments

Comments

Comments

1 Comment

4 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

7 Answers 7

Comments

2 Comments

Comments

Comments

1 Comment

4 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related