getting type error : expected string or bytes-like object

Question

I am facing a challenge where I am trying to clean a column in my dataset using the regular expression in python. The column is of object type and when I am using the below code I am getting this error: expected string or bytes-like object

import re 
def clean_str(string):
    """
    Tokenization/string cleaning for dataset
    Every dataset is lower cased except
    """
    string = re.sub(r"\n", "", string)    
    string = re.sub(r"\r", "", string) 
    string = re.sub(r"[0-9]", "digit", string)
    string = re.sub(r"\'", "", string)   
    string = re.sub(r"\"", "", string)    
    return string.strip().lower()
X = []
for i in range(df.shape[0]):
    X.append(clean_str(df.iloc[i][1])) #0,1,2,3
y = np.array(df["Standardpositionsname"])

Please indent your code correctly. As it stands that code is unreadable. — Mihai Chelaru
– Mihai Chelaru, Commented Aug 5, 2019 at 15:26
Oh gosh no. It was better the other way. Is there a line number in the error message? — Martin
– Martin, Commented Aug 5, 2019 at 15:32

Mohammad Ansari · Accepted Answer · 2019-08-05 15:36:21Z

2

I Think in X.append(clean_str(df.iloc[i][1])) you must convert parameter to string type like this

X.append(clean_str(str(df.iloc[i][1])))

answered Aug 5, 2019 at 15:36

Mohammad Ansari

1,5591 gold badge16 silver badges25 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

getting type error : expected string or bytes-like object

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related