Create a dataframe from another data frame based on column containing string (2 defined conditions)

Question

Iam trying to compile some of the columns in df1 to a new dataframe df2.

the columns will be selected based on the following conditions:

if word 'COORDINATES' is in the column
if word 'ID' is in the column

Here is the code I tried:

df1 = pd.read_csv(csvpath) #table as below

cols = [col for col in df1.columns if  'Coordinates' and 'ID' in col]

df2=df1[cols]

However the conditions are only being applied for the last item in cols= (in this case its only extracting ID and ignoring coordinates)

How do i edit the above code to include both Coordinates and ID (I could just drop the unwanted columns however the dataset im dealing with is large and hence i need to do it in such a way similar to what i defined above)

much appreciated your help on this.

Original Table (df1)

Required Output(df2)

The Oracle · Accepted Answer · 2020-05-07 21:09:42Z

1

I think this should work

cols = [col for col in df1.columns if  'Coordinates' in col or 'ID' in col]

edited May 7, 2020 at 21:09

The Oracle

5063 gold badges12 silver badges26 bronze badges

answered May 7, 2020 at 18:41

Simon A

2211 silver badge6 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

The Oracle Over a year ago

i tried that, it takes the whole df. any other suggestions?

Simon A Over a year ago

are you sure, I just tried it on my computer using the same column names as you have and it worked fine

Collectives™ on Stack Overflow

Create a dataframe from another data frame based on column containing string (2 defined conditions)

1 Answer 1

2 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

2 Comments

Your Answer

Sign up or log in

Post as a guest

Related