I tried to read my dataset in text file format using pandas. However, some characters are not encoded correctly. I got ??? for apostrophe.
What should I do to encode my file correctly? I've tried
encoding = "utf8"but I gotUnicodeDecodeError: 'utf8' codec can't decode byte 0xc3 in position 2044: unexpected end of data.encoding = "latin1"but this gave me a lot of ???encoding = "ISO-8859-1" or "ISO-8859-2"but this also gave me just like no encoding...
When I open my data in sublime, I got this character ’.
UPDATED: But when I access the entry using loc I got something like \u0102\u02d8\xe2\x82\u0179\xc2\u015, \u0102\u02d8\xe2\x82\u0179\xe2\x84\u02d8