How to read a text file into separate lists python

Question

Say I have a text file formatted like this:

100 20 the birds are flying

and I wanted to read the int(s) into their own lists and the string into its own list...how would I go about this in python. I tried

data.append(map(int, line.split()))

that didn't work...any help?

map(int, line.split()) applies int to the entire line. What caused you to think this would separate numbers from words? — S.Lott
– S.Lott, Commented Jan 21, 2012 at 3:02

Michael0x2a · Accepted Answer · 2013-04-20 06:52:37Z

4

Essentially, I'm reading the file line by line, and splitting them. I first check to see if I can turn them into an integer, and if I fail, treat them as strings.

def separate(filename):
    all_integers = []
    all_strings = []
    with open(filename) as myfile:
        for line in myfile:
            for item in line.split(' '):
                try:
                    # Try converting the item to an integer
                    value = int(item, 10)
                    all_integers.append(value)
                except ValueError:
                    # if it fails, it's a string.
                    all_strings.append(item)
    return all_integers, all_strings

Then, given the file ('mytext.txt')

100 20 the birds are flying
200 3 banana
hello 4

...doing the following on the command line returns...

>>> myints, mystrings = separate(r'myfile.txt')
>>> print myints
[100, 20, 200, 3, 4]
>>> print mystrings
['the', 'birds', 'are', 'flying', 'banana', 'hello']

edited Apr 20, 2013 at 6:52

answered Jan 21, 2012 at 2:45

Michael0x2a

65.6k32 gold badges190 silver badges241 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

S.Lott Over a year ago

+1. This use of an exception is perfect. Remove the "I'm not happy..." business. This is good.

Michael0x2a Over a year ago

Yeah, I read somewhere that exceptions should only be used for exceptional behavior, which isn't really the case here. I removed it.

S.Lott Over a year ago

"exceptions should only be used for exceptional behavior" Not terribly true in Python. True in some languages, but not Python.

dave · Accepted Answer · 2012-01-21 02:43:18Z

3

If i understand your question correctly:

import re

def splitList(list):
    ints = []
    words = []
    for item in list:
        if re.match('^\d+$', item):
           ints.append(int(item))
        else:
           words.append(item)
    return ints, words

intList, wordList = splitList(line.split())

Will give you two lists: [100, 20] and ['the', 'birds', 'are', 'flying']

answered Jan 21, 2012 at 2:43

dave

12.9k10 gold badges45 silver badges60 bronze badges

Comments

rodion · Accepted Answer · 2012-01-21 03:10:02Z

2

Here's a simple solution. Note it might not be as efficient as others for very large files, because it iterates over word two times for each line.

words = line.split()
intList = [int(x) for x in words if x.isdigit()]
strList = [x for x in words if not x.isdigit()]

answered Jan 21, 2012 at 3:10

rodion

15.1k4 gold badges57 silver badges55 bronze badges

Comments

Rob Wouters · Accepted Answer · 2012-01-21 02:43:36Z

0

pop removes the element from the list and returns it:

words = line.split()
first = int(words.pop(0))
second = int(words.pop(0))

This is of course assuming your format is always int int word word word ....

And then join the rest of the string:

words = ' '.join(words)

And in Python 3 you can even do this:

first, second, *words = line.split()

Which is pretty neat. Although you would still have to convert first and second to int's.

edited Jan 21, 2012 at 2:43

answered Jan 21, 2012 at 2:38

Rob Wouters

16.4k3 gold badges44 silver badges36 bronze badges

3 Comments

RanRag Over a year ago

your answer is ok. But what about a more generic solution.The case were we dont't know the occurrence of string and integer in file. Like for eg hello 1 2 check in this case your solution will not work

Rob Wouters Over a year ago

@RanRag, true. I edited in that assumption after reading other answers, the question isn't clear in that regard. If it is always in this format I think the Python 3 one-liner is the way to go though. Assuming he's using Python 3 of course.

RanRag Over a year ago

Yeah, in that case the one-liner is the best solution.

Collectives™ on Stack Overflow

How to read a text file into separate lists python

4 Answers 4

3 Comments

Comments

Comments

3 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

3 Comments

Comments

Comments

3 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related