For example I have a sentence
"He is so .... cool!"
Then I remove all the punctuation and make it in a list.
["He", "is", "so", "", "cool"]
How do I remove or ignore the empty string?
For example I have a sentence
"He is so .... cool!"
Then I remove all the punctuation and make it in a list.
["He", "is", "so", "", "cool"]
How do I remove or ignore the empty string?
You can use filter, with None as the key function, which filters out all elements which are Falseish (including empty strings)
>>> lst = ["He", "is", "so", "", "cool"]
>>> filter(None, lst)
['He', 'is', 'so', 'cool']
Note however, that filter returns a list in Python 2, but a generator in Python 3. You will need to convert it into a list in Python 3, or use the list comprehension solution.
Falseish values include:
False
None
0
''
[]
()
# and all other empty containers
list in Python 2 and a generator in Python 3.['']?Trueish.False inside a True, ehI'll give you the answer to the question you should have asked -- how to avoid the empty string altogether. I assume you do something like this to get your list:
>>> "He is so .... cool!".replace(".", "").split(" ")
['He', 'is', 'so', '', 'cool!']
The point is that you use .split(" ") to split on space characters. However, if you leave out the argument to split, this happens:
>>> "He is so .... cool!".replace(".", "").split()
['He', 'is', 'so', 'cool!']
Quoth the docs:
If sep is not specified or is None, a different splitting algorithm is applied: runs of consecutive whitespace are regarded as a single separator, and the result will contain no empty strings at the start or end if the string has leading or trailing whitespace.
So you really don't need to bother with the other answers (except Blender's, which is a totally different approach), because split can do the job for you!
>>> from string import punctuation
>>> text = "He is so .... cool!"
>>> [w.strip(punctuation) for w in text.split() if w.strip(punctuation)]
['He', 'is', 'so', 'cool']
w.strip(punctuation) twice, l = [w.strip(punctuation) for w in text.split()] and then [w for w in l if w] wuld be preferable...