nested for loop in python not working

Question

We basically have a large xcel file and what im trying to do is create a list that has the maximum and minimum values of each column. there are 13 columns which is why the while loop should stop once it hits 14. the problem is once the counter is increased it does not seem to iterate through the for loop once. Or more explicitly,the while loop only goes through the for loop once yet it does seem to loop in that it increases the counter by 1 and stops at 14. it should be noted that the rows in the input file are strings of numbers which is why I convert them to tuples and than check to see if the value in the given position is greater than the column_max or smaller than the column_min. if so I reassign either column_max or column_min.Once this is completed the column_max and column_min are appended to a list( l ) andthe counter,(position), is increased to repeat the next column. Any help will be appreciated.

input_file = open('names.csv','r')
l= []  
column_max = 0
column_min = 0
counter = 0
while counter<14:
    for row in input_file:
        row = row.strip()
        row = row.split(',')
        row = tuple(row)
        if (float(row[counter]))>column_max:
            column_max = float(row[counter])  
        elif (float(row[counter]))<column_min:
            column_min = float(row[counter])    
        else:
            column_min=column_min
            column_max = column_max
    l.append((column_max,column_min))
    counter = counter + 1

Use for i in range(14) instead of a while loop. Also, you might want to use csvreader instead of splitting by ,: csvreader will handle strings containing commas. — Waleed Khan
– Waleed Khan, Commented Oct 22, 2012 at 3:31
If there were thirteen columns, you would use 13 as your bounding value, not 14. — Chris Morgan
– Chris Morgan, Commented Oct 22, 2012 at 3:41
Instead of column_max = 0 and column_min = 0, I'd use column_max = float('-inf') and column_min = float('inf'). That way you know the maxima and minima will be correct. — Chris Morgan
– Chris Morgan, Commented Oct 22, 2012 at 3:42

Steven Rumbalski · Accepted Answer · 2012-10-22 04:03:08Z

3

I think you want to switch the order of your for and while loops.

Note that there is a slightly better way to do this:

with open('yourfile') as infile:
    #read first row.  Set column min and max to values in first row
    data = [float(x) for x in infile.readline().split(',')]
    column_maxs = data[:]
    column_mins = data[:]
    #read subsequent rows getting new min/max
    for line in infile:
        data = [float(x) for x in line.split(',')]
        for i,d in enumerate(data):
            column_maxs[i] = max(d,column_maxs[i])
            column_mins[i] = min(d,column_mins[i])

If you have enough memory to hold the file in memory at once, this becomes even easier:

with open('yourfile') as infile:
    data = [map(float,line.split(',')) for line in infile]
    data_transpose = zip(*data)
    col_mins = [min(x) for x in data_transpose]
    col_maxs = [max(x) for x in data_transpose]

edited Oct 22, 2012 at 4:03

Steven Rumbalski

45.7k10 gold badges96 silver badges125 bronze badges

answered Oct 22, 2012 at 3:31

mgilson

312k70 gold badges656 silver badges722 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Chris Morgan Over a year ago

Merely swapping the two loops would not be enough; the row = lines would need to be taken out of the inner loop, also. But you are quite right: min and max are the correct way to do it.

Brian L Over a year ago

This takes the maximum per row, but the code in the question is designed to calculate the maximum per column. Otherwise +1.

mgilson Over a year ago

@ChrisMorgan -- Yes, the associated logic should also be moved. I assumed that was obvious enough ...

mgilson Over a year ago

@BrianL -- Good catch. Updated accordingly.

Chris Morgan Over a year ago

"I think you want to switch the order of your for and while loops": no, he doesn't... that's a relic of the transposed interpretation of the question.

Chris Morgan · Accepted Answer · 2012-10-22 03:58:16Z

1

Once you have consumed the file, it has been consumed. Thus iterating over it again won't produce anything.

>>> for row in input_file:
...     print row
1,2,3,...
4,5,6,...
etc.
>>> for row in input_file:
...     print row
>>> # Nothing gets printed, the file is consumed

That is the reason why your code is not working.

You then have three main approaches:

Read the file each time (inefficient in I/O operations);
Load it into a list (inefficient for large files, as it stores the whole file in memory);
Rework the logic to operate line by line (quite feasible and efficient, though not as brief in code as loading it all into a two-dimensional structure and transposing it and using min and max may be).

Here is my technique for the third approach:

maxima = [float('-inf')] * 13
minima = [float('inf')] * 13
with open('names.csv') as input_file:
    for row in input_file:
        for col, value in row.split(','):
            value = float(value)
            maxima[col] = max(maxima[col], value)
            minima[col] = min(minima[col], value)

# This gets the value you called ``l``
combined_max_and_min = zip(maxima, minima)

edited Oct 22, 2012 at 3:58

answered Oct 22, 2012 at 3:51

Chris Morgan

91.6k28 gold badges217 silver badges220 bronze badges

2 Comments

mgilson Over a year ago

My only beef with this is that you essentially hard-code the number of columns whereas my version of this implementation doesn't hard-code the number of columns. (also, no need to row.strip() as float doesn't care about whitespace).

Chris Morgan Over a year ago

@mgilson: it could be handled without that hardcoding, but it's uglier. For reference, that would be something like float('-inf') if col > len(maxima) else maxima[col]. Either that or duplicating the row-reading code. All in all, I think hard-coding the value will typically (though not always) be more suitable. As for the row.strip() business, I thought about it but left it in through laziness. You have now prompted me to remove it.

Collectives™ on Stack Overflow

nested for loop in python not working

2 Answers 2

5 Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

5 Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related