Pandas dataframe - creating datetime index from separate columns

Question

I read a csv with separate date and time columns. I generated an index from them in the following way:

data = p.read_csv(fileName,usecols=["date","time","price"])
data.set_index(["date","time"],inplace=True)

This however isn't very useful when I want to get the difference in days or hours between rows. How do I generate a single datetime index from the separate date and time columns?

You should join them together and call pd.to_datetime.

cs95
– cs95

2017-10-24 06:26:29 +00:00
Commented Oct 24, 2017 at 6:26 — cs95
– cs95, Commented Oct 24, 2017 at 6:26

jezrael · Accepted Answer · 2017-10-24 06:45:25Z

1

I think you need parameter parse_dates with nested list with both columns and parameter index_col with new column created by concanecated columns names separated by _:

data = p.read_csv(fileName,
                  usecols=["date","time","price"], 
                  parse_dates=[["date","time"]], 
                  index_col=['date_time'])

Sample:

from pandas.compat import StringIO

temp=u"""date,time,price
2015-01-01,14:00:10,7
2014-01-01,10:20:10,1"""
#after testing replace 'StringIO(temp)' to 'filename.csv'
df = pd.read_csv(StringIO(temp), 
                 usecols=["date","time","price"], 
                 parse_dates=[["date","time"]],
                 index_col=['date_time'])

print (df)
                     price
date_time                 
2015-01-01 14:00:10      7
2014-01-01 10:20:10      1

print (df.index)
DatetimeIndex(['2015-01-01 14:00:10', '2014-01-01 10:20:10'], 
               dtype='datetime64[ns]', 
               name='date_time', freq=None)

edited Oct 24, 2017 at 6:45

answered Oct 24, 2017 at 6:27

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

IamTheWalrus Over a year ago

Thanks, that works for me. Using parse_dates that way seem to significantly increase the processing time though. I have about 75,000 rows so I guess it should be expected.

Collectives™ on Stack Overflow

Pandas dataframe - creating datetime index from separate columns

1 Answer 1

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related