string sorting csv row

Question

import pandas as pd

rawDF = pd.read_csv('D:\Project\python\Grade\GradeDataRaw.csv',names=['GradeCol'])

filteredDF = rawDF[rawDF['GradeCol'].str.contains('EVCS:|BVCS:|LOW POINT STA')]
print(filteredDF)

filename = 'GradeOut.csv'

filteredDF.to_csv(filename,index=False, encoding='utf-8')

Output in CSV file is

GradeCol

EVCS: 210+080.907

BVCS: 210+080.907

LOW POINT STA =208+108.133\PLOW POINT ELEV = 66.849\PPVI STA = 209+126.315\PPVI ELEV = 66.762\PA.D = 1.413%\PK

LOW POINT STA =208+108.133\PLOW POINT ELEV = 66.849\PPVI STA = 209+126.000\PPVI ELEV = 66.762\PA.D = 1.413%\PK

Would like to have only "PPVI STA = 209+126.315" in data frame row where there is this string available, other rows with EVCS & BVCS to remain intact, numerical part can vary in every row. With the extract method getting NaN values in the rows where the is no match , that is not the intention.

What is your desired output? do you want to order all the rows? — Kelvin
– Kelvin, Commented Jul 15, 2017 at 15:47
"info \GPK HEK = 209+126.315\info ends here" - is it the whole string/row or just one column in the row? — MaxU - stand with Ukraine
– MaxU - stand with Ukraine, Commented Jul 15, 2017 at 15:52
hello guys , hope the above edit with more information helps to clarify the expected output. — Dagdoba
– Dagdoba, Commented Jul 15, 2017 at 20:16
Welcome to the site: you may want to read help center, How to Ask and minimal reproducible example, and re-word your question accordingly. — boardrider
– boardrider, Commented Jul 17, 2017 at 14:35

MaxU - stand with Ukraine · Accepted Answer · 2017-07-15 16:03:24Z

1

IIUC:

Sample DF:

In [99]: df
Out[99]:
                                                 txt
0         info \GPK HEK = 209+126.315\info ends here
1  blah-blah-blah GPK HEK = 1 + 2.33333end of string

Solution:

In [100]: df['txt'].str.extract(r'(GPK HEK\s*=\s*\d+\s*\+\s*\d+\.\d+)', expand=False)
Out[100]:
0    GPK HEK = 209+126.315
1    GPK HEK = 1 + 2.33333
Name: txt, dtype: object

answered Jul 15, 2017 at 16:03

MaxU - stand with Ukraine

212k37 gold badges402 silver badges436 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Dagdoba Over a year ago

hello MaxU, kindly have a look at the new edited information.

Banach Tarski · Accepted Answer · 2017-07-15 15:54:09Z

0

This should do the job.

def parse(string):
    start = string.find('\\') + 1
    end   = string.find('.')

    while string[end] != '\\':
        end += 1

    return string[start : end]

answered Jul 15, 2017 at 15:54

Banach Tarski

1,82917 silver badges36 bronze badges

Collectives™ on Stack Overflow

string sorting csv row

2 Answers 2

1 Comment

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Related