is invalid key Pandas Python

Question

I have a dataframe in Pandas with 729278 rows and 190 columns:

df1:

+----------+----------+----------+---+---+-----+---------+
| RULE_1_2 | RULE_2_2 | RULE_3_2 | … | … | smt | default |
+----------+----------+----------+---+---+-----+---------+
| 0        | 0        | 0        | … | … | 2   | 0       |
| 0        | 2        | 3        | … | … | 3   | 0       |
| 1        | 3        | 0        | … | … | 4   | 1       |
| …        | …        | …        | … | … | …   | …       |
+----------+----------+----------+---+---+-----+---------+

Trying to exctract all columns containing RULE and column 'default'.

Code:

df2 = df1[df1.filter(regex='RULE'), df1["default"]]

But Python says:

[729278 rows x 1 columns])' is an invalid key

All columns contain int64 type, which confirmed by df1.dtypes

What's wrong with 1 column 'default'? It doesn't appear in datamrame 'df2'. How to fix it?

jezrael · Accepted Answer · 2020-04-30 08:40:47Z

3

Idea is add another part of regex joined by | for regex or, also ^ is for start of string and $ for end of string for prevent selecting strings like some data default:

df2 = df1.filter(regex='RULE|^default$')

answered Apr 30, 2020 at 8:40

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

is invalid key Pandas Python

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related