How to delete numpy nan from a list of strings in Python?

Question

I have a list of strings

x = ['A', 'B', nan, 'D']

and want to remove the nan.

I tried:

x = x[~numpy.isnan(x)]

But that only works if it contains numbers. How do we solve this for strings in Python 3+?

@JoshLee The non object from numpy module which the OP is using. I change it to numpy so that the future askers can find the question easily. — Kasravnd
– Kasravnd, Commented Mar 23, 2017 at 18:14

Kasravnd · Accepted Answer · 2017-03-23 15:37:54Z

6

If you have a numpy array you can simply check the item is not the string nan, but if you have a list you can check the identity with is and np.nan since it's a singleton object.

In [25]: x = np.array(['A', 'B', np.nan, 'D'])

In [26]: x
Out[26]: 
array(['A', 'B', 'nan', 'D'], 
      dtype='<U3')

In [27]: x[x != 'nan']
Out[27]: 
array(['A', 'B', 'D'], 
      dtype='<U3')


In [28]: x = ['A', 'B', np.nan, 'D']

In [30]: [i for i in x if i is not np.nan]
Out[30]: ['A', 'B', 'D']

Or as a functional approach in case you have a python list:

In [34]: from operator import is_not

In [35]: from functools import partial

In [37]: f = partial(is_not, np.nan)

In [38]: x = ['A', 'B', np.nan, 'D']

In [39]: list(filter(f, x))
Out[39]: ['A', 'B', 'D']

edited Mar 23, 2017 at 15:37

answered Mar 23, 2017 at 15:32

Kasravnd

108k19 gold badges167 silver badges195 bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

Colonel Beauvel Over a year ago

aggregate things like: [i for i in x if not i in ['nan', np.nan]], +1 otherwise

Kasravnd Over a year ago

@ColonelBeauvel Yeah, that's a good idea if you don't know that kind of a data structure you're dealing with.

Josh Lee Over a year ago

NaN is not a singleton.

Kasravnd Over a year ago

@JoshLee Why? I think as far as you can't create different instances from a particular object it would be refer as singleton. Is there anything special about np.nan?

Josh Lee Over a year ago

np.nan is just some floating point constant. You wouldn't compare is math.pi either, for the same reason.

|

Horia Coman · Accepted Answer · 2017-03-23 15:28:24Z

3

You can use math.isnan and a good-old list comprehension.

Something like this would do the trick:

import math
x = [y for y in x if not math.isnan(y)]

answered Mar 23, 2017 at 15:28

Horia Coman

8,8112 gold badges25 silver badges25 bronze badges

1 Comment

hpaulj Over a year ago

Did you try math.isnan('A')? Test on the OP's x?

akuiper · Accepted Answer · 2017-03-23 15:30:47Z

1

You may want to avoid np.nan with strings, use None instead; but if you do have nan you could do this:

import numpy as np

[i for i in x if i is not np.nan]
# ['A', 'B', 'D']

answered Mar 23, 2017 at 15:30

akuiper

216k33 gold badges362 silver badges379 bronze badges

4 Comments

Josh Lee Over a year ago

NaN is not a singleton.

akuiper Over a year ago

@JoshLee I didn't say it's a singleton. I just said it might be better to use None instead of nan in string cases, which will be converted to a string nan but None stays as None.

Josh Lee Over a year ago

You're comparing with is. This will fail.

akuiper Over a year ago

@JoshLee I didn't get what you mean. It works for this case.

boot-scootin · Accepted Answer · 2017-03-23 15:32:54Z

1

You could also try this:

[s for s in x if str(s) != 'nan']

Or, convert everything to str at the beginning:

[s for s in map(str, x) if s != 'nan']

Both approaches yield ['A', 'B', 'D'].

answered Mar 23, 2017 at 15:32

boot-scootin

12.6k10 gold badges73 silver badges118 bronze badges

Collectives™ on Stack Overflow

How to delete numpy nan from a list of strings in Python?

4 Answers 4

6 Comments

1 Comment

4 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

6 Comments

1 Comment

4 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related