TypeError: coercing to Unicode: need string or buffer, file found in python

Question

I want to stem the words, for which i import the porterstemmer pkg from nltk but an error occurred at run time.

The error is :

TypeError: coercing to Unicode: need string or buffer, file found

My Python code is

  import nltk;     
  from nltk.stem import PorterStemmer  
  stemmer=PorterStemmer()  
  file = open('C:/Python26/test.txt','r')  
  f=open("root.txt",'w')  
  with open(file,'r',-1) as rf:  
    lines = rf.readlines()  
    for word in lines:  
        root = stemmer.stem(word)  
        f.write(root+"\n")  
    f.close()

yes i tried it and got an error which i couldn't understand ad the error was 1.6.2 Traceback (most recent call last): File "C:\Python26\check.py", line 10, in with open(file,'r',-1) as rf: UnicodeDecodeError: 'ascii' codec can't decode byte 0xf8 in position 6: ordinal not in range(128)

                                                                                                    My code after ur recommended change is 
import nltk;
import numpy;
import numpy as np
from StringIO import StringIO
print numpy.__version__
from nltk.stem import PorterStemmer  
stemmer=PorterStemmer()  
file = np.genfromtxt('C:/Python26/test.txt', delimiter=" ")  
f=open("root.txt",'w')  
with open(file,'r',-1) as rf:
    lines = rf.readlines()  
    for word in lines:  
        root = stemmer.stem(word)  
        f.write(root+"\n")  
    f.close()                                                                                                         and my dummy file is like this

walking
talked
oranges
books
Src
Src
mAB

Eli Korvigo · Accepted Answer · 2015-03-12 07:50:59Z

2

You have already opened the file. You're trying to pass a file object to with open.... Remove file = open('C:/... line.

P.S. You will be iterating over lines, not words.

edited Mar 12, 2015 at 7:50

answered Mar 12, 2015 at 6:43

Eli Korvigo

10.5k6 gold badges50 silver badges75 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

ca_san · Accepted Answer · 2015-03-12 06:45:35Z

1

It seems that the problem is with the parameters passed to a function, and i'm guessing its in the line root = stemmer.stem(word)

try using the function genfromtxt instead of open():

>>> import numpy as np
>>> from StringIO import StringIO
>>> np.genfromtxt('C:/Python26/test.txt', delimiter=",") #Whatever delimiter your file has.

That should fix the problem.

answered Mar 12, 2015 at 6:45

ca_san

1932 silver badges11 bronze badges

7 Comments

Shaheen Gul Over a year ago

I tried the code of "Anthon" it did not create any error but did not stem the word

ca_san Over a year ago

@ShaheenGul did you try mine?

Shaheen Gul Over a year ago

yes i tried it and got an error which i couldn't understand ad the error was

ca_san Over a year ago

have you declares coding at the start of your document? put this at the start of your file and see if the error persists # -*- coding: UTF-8 -*-

Shaheen Gul Over a year ago

What this mean?? I can't understand, anyhow i also do this but same result when i add # -- coding: UTF-8 -- – , as # is used for commenting the text, but when i remove this then error was encountered

|

Anthon · Accepted Answer · 2015-03-12 09:03:42Z

1

You are opening file in line 4 and then use that as the filename for another open() in line 6. Just do:

import nltk;     
from nltk.stem import PorterStemmer  
stemmer=PorterStemmer()  
with open("root.txt",'w') as f:
    with open('C:/Python26/test.txt','r',-1) as rf:  
      lines = rf.readlines()  
      for word in lines:  
          root = stemmer.stem(word)  
          f.write(root+"\n")

edited Mar 12, 2015 at 9:03

answered Mar 12, 2015 at 6:42

Anthon

78.3k35 gold badges207 silver badges290 bronze badges

Collectives™ on Stack Overflow

TypeError: coercing to Unicode: need string or buffer, file found in python

3 Answers 3

Comments

7 Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

7 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related