'int' object is not subscriptable. Pandas

Question

I have dataset df. within this dataset I have column Gross I am completely new to Python,

I am trying to convert this column to float and display sum()

dollarGross = lambda x: float(x[1:-1])
df.Gross = df.Gross.apply(dollarGross)
df.Gross.sum()

But I am getting this error:

<ipython-input-294-a9010792122a> in <lambda>(x)
----> 1 dollarGross = lambda x: float(x[1:-1])
      2 df.Gross = df.Gross.apply(dollarGross)
      3 df.Gross.sum()

TypeError: 'int' object is not subscriptable

What am I missing?

what is x[1:-1] supposed to do in your lambda function? It looks to me like you're trying to do string operations on an integer column ... If that's the case, then you can probably do df.Gross.sum() directly. — mgilson
– mgilson, Commented May 26, 2017 at 16:26
I thought since I am accessing csv file all columns are strings — Serdia
– Serdia, Commented May 26, 2017 at 16:29

piRSquared · Accepted Answer · 2017-05-26 17:20:42Z

Your error starts here:

df.Gross.apply(dollarGross)

df.Gross is a pandas.Series and when you use the apply method, pandas iterates through each member of the series and passes that member to the "callable" (also known as a function, more on this in a bit) named dollarGross. The critical thing to understand is what the members of the pandas.Series are. In this case, they are integers. So each integer in the series gets passed to dollarGross and gets called like this:

dollarGross(184)

This in turn looks like this:

float(184[1:-1])

Which makes no sense. You are trying to use [1:-1] which is subscripting/slicing syntax on an integer. And that is what the error is telling you: Hey, you can't subscript an integer!

That is why it's good to tell us what you are trying to do. Because now we can help you do that. Remember I said you can pass a "callable" to apply. Well, float is the name of the class of float objects... It's also a "callable" because we can do this float(184). So....

df.Gross.apply(float)

Should get things done. However, it's still probably better to do this

df.Gross.astype(float)

Or, if some of the members of df.Gross cannot be interpreted as a float value, it's probable better to use @MaxU's answer.

MaxU - stand with Ukraine · Accepted Answer · 2017-05-26 16:35:33Z

3

AFAIK pd.to_numeric() method provides us the most idiomatic way to convert strings to numerical values:

df['Gross'] = pd.to_numeric(df['Gross'], errors='coerce')
print(df['Gross'].sum())

answered May 26, 2017 at 16:35

MaxU - stand with Ukraine

212k37 gold badges402 silver badges436 bronze badges

Comments

knurzl · Accepted Answer · 2017-05-26 16:31:55Z

2

I think you just have to write dollarGross = lambda x: float(x). If you use square brackets you try to access an array.

answered May 26, 2017 at 16:31

knurzl

3533 silver badges12 bronze badges

Comments

Dest · Accepted Answer · 2017-05-26 16:34:08Z

0

I think you should separate the columns using

dollarGross = df['Gross'] #I defined a new array to store the Gross Values 
print(dollarGross.sum())

answered May 26, 2017 at 16:34

Dest

437 bronze badges

Collectives™ on Stack Overflow

'int' object is not subscriptable. Pandas

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related