74

What is the method to convert a Python list of strings to a pd.Series object?

(pandas Series objects can be converted to list using tolist() method--but how to do the reverse conversion?)

2
  • @smci this is really embarrassing for me, a question from the time when I was a newbie. You have edited it already, looks good now. Commented Apr 25, 2018 at 10:53
  • 4
    HypotheticalNinja: it's a perfectly good canonical question on an important basic topic. Commented Apr 25, 2018 at 22:27

4 Answers 4

55

I understand that your list is in fact a list of lists

import pandas as pd

thelist = [ ['sentence 1'], ['sentence 2'], ['sentence 3'] ]
df = pd.Series( (v[0] for v in thelist) )
Sign up to request clarification or add additional context in comments.

3 Comments

from your edits and comments, I understand that the list you're talking about is a list of lists. you have to make it 1D to make the Series. I edited my post to show how to do that with a generator.
it was simple.. df = pd.Series(data) .. automatically converted the whole text into a dataframe object.. thanks.. you can edit your post and include this too,for others to benefit.. :)
ok, still not sure about what a sentence is in your case, but I'm happy I could help anyway :-) - cheers
43

To convert the list myList to a Pandas series use:

mySeries = pd.Series(myList) 

This is also one of the basic ways for creating a series from a list in Pandas.

Example:

myList = ['string1', 'string2', 'string3']                                                                                                                
mySeries = pd.Series(myList)                                                                                                                             
mySeries                                                                                                                                                 
# Out: 
# 0    string1
# 1    string2
# 2    string3
# dtype: object

Note that Pandas will guess the data type of the elements of the list because a series doesn't admit mixed types (contrary to Python lists). In the example above the inferred datatype was object (the Python string) because it's the most general and can accommodate all other data types (see data types).

It's possible to specify a data type when creating a series:

myList= [1, 2, 3] 

# inferred data type is integer
pd.Series(myList).dtype                                                                                                                        
# Out:
# dtype('int64')

myList= ['1', 2, 3]                                                                                                                                     

# data type is object  
pd.Series(myList).dtype                                                                                                                                                                                                                                                                
# Out: 
# dtype('O')

One can specify dtype as integer:

myList= ['1', 2.2, '3']
mySeries = pd.Series(myList, dtype='int')  
mySeries.dtype                                                                                                                                 
# Out:
# dtype('int64')

But this will work only if all elements in the list can be casted to the desired data type.

Comments

11
import pandas as pd
sentence_list = ['sentence 1', 'sentence 2', 'sentence 3', 'sentence 4']
print("List of Sentences: \n", sentence_list)
sentence_series = pd.Series(sentence_list)
print("Series of Sentences: \n", sentence_series)

Documentation

Even if sentence_list is a list of list, this code still converts a list to Pandas Series object.

Comments

2

pd.Series(l) actually works on almost any type of list and it returns Series object:

import pandas as pd
l = [ ['sentence 1'], ['sentence 2'], ['sentence 3'] ] #works
l = ['sentence 1', 'sentence 2', 'sentence 3'] #works
l = numpy.array(['sentance 1', 'sentance2', 'sentance3'], dtype='object') #works

print(l, type(l))
ds = pd.Series(l)
print(ds, type(ds))

0    sentence 1
1    sentence 2
2    sentence 3
dtype: object <class 'pandas.core.series.Series'>

Comments

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.