pandas datetime to unix timestamp seconds

Question

From the official documentation of pandas.to_datetime we can say,

unit : string, default ‘ns’

unit of the arg (D,s,ms,us,ns) denote the unit, which is an integer or float number. This will be based off the origin. Example, with unit=’ms’ and origin=’unix’ (the default), this would calculate the number of milliseconds to the unix epoch start.

So when I try like this way,

import pandas as pd
df = pd.DataFrame({'time': [pd.to_datetime('2019-01-15 13:25:43')]})
df_unix_sec = pd.to_datetime(df['time'], unit='ms', origin='unix')
print(df)
print(df_unix_sec)

                 time
0   2019-01-15 13:25:43
0   2019-01-15 13:25:43
Name: time, dtype: datetime64[ns]

Output is not changing for the latter one. Every time it is showing the datetime value not number of milliseconds to the unix epoch start for the 2nd one. Why is that? Am I missing something?

cs95 · Accepted Answer · 2022-02-10 04:32:00Z

147

I think you misunderstood what the argument is for. The purpose of origin='unix' is to convert an integer timestamp to datetime, not the other way.

pd.to_datetime(1.547559e+09, unit='s', origin='unix') 
# Timestamp('2019-01-15 13:30:00')

Here are some options:

Option 1: integer division

Conversely, you can get the timestamp by converting to integer (to get nanoseconds) and divide by 10⁹.

pd.to_datetime(['2019-01-15 13:30:00']).astype(int) / 10**9
# Float64Index([1547559000.0], dtype='float64')

Pros:

super fast

Cons:

makes assumptions about how pandas internally stores dates

Option 2: recommended by pandas

Pandas docs recommend using the following method:

# create test data
dates = pd.to_datetime(['2019-01-15 13:30:00'])

# calculate unix datetime
(dates - pd.Timestamp("1970-01-01")) // pd.Timedelta('1s')

[out]:
Int64Index([1547559000], dtype='int64')

Pros:

"idiomatic", recommended by the library

Cons:

unweildy
not as performant as integer division

Option 3: `pd.Timestamp`

If you have a single date string, you can use pd.Timestamp as shown in the other answer:

pd.Timestamp('2019-01-15 13:30:00').timestamp()
# 1547559000.0

If you have to cooerce multiple datetimes (where pd.to_datetime is your only option), you can initialize and map:

pd.to_datetime(['2019-01-15 13:30:00']).map(pd.Timestamp.timestamp)
# Float64Index([1547559000.0], dtype='float64')

Pros:

best method for a single datetime string
easy to remember

Cons:

not as performant as integer division

edited Feb 10, 2022 at 4:32

answered Jan 22, 2019 at 17:26

cs95

406k106 gold badges744 silver badges797 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

AstroFloyd Over a year ago

Note that Pandas now recommends .view() instead of .astype() for method 1, This method works with a (timezone-aware) DateTimeIndex array as well, unless daylight-savings time starts or ends in that time span. In that case, I get TypeError: Cannot change data-type for object array. Conversion to UTC fixes this.

Wesley Cheek Over a year ago

In option 1) you may need to cast astype("int64"). With astype(int) I am getting TypeError: Converting from datetime64[ns] to int32 is not supported. Do obj.astype('int64').astype(dtype) instead

Matthias Luh Over a year ago

Regarding option 2: If you want to convert a timezone-aware datetime to a Unix timestamp, you will get an error "TypeError: Cannot subtract tz-naive and tz-aware datetime-like objects". The solution is to use (dates - pd.Timestamp("1970-01-01", tz='UTC')) // pd.Timedelta("1s")

ciaran haines May 19 at 14:21

... or for auto handling if you don't know the time zone or naivety: (dates - pd.Timestamp("1970-01-01", tz=dates.dt.tz)) // pd.Timedelta("1s") where the Series.dt.tz returns None if it's a naive series

taranarmo · Accepted Answer · 2021-05-18 10:23:11Z

30

You can use timestamp() method which returns POSIX timestamp as float:

pd.Timestamp('2021-04-01').timestamp()

[Out]:
1617235200.0

pd.Timestamp('2021-04-01 00:02:35.234').timestamp()

[Out]:
1617235355.234

edited May 18, 2021 at 10:23

answered Apr 29, 2021 at 11:57

taranarmo

4114 silver badges6 bronze badges

Comments

karan rajagopalan · Accepted Answer · 2021-03-24 08:25:59Z

3

value attribute of the pandas Timestamp holds the unix epoch. This value is in nanoseconds. So you can convert to ms or us by diving by 1e3 or 1e6. Check the code below.

import pandas as pd
date_1 = pd.to_datetime('2020-07-18 18:50:00')
print(date_1.value)

answered Mar 24, 2021 at 8:25

karan rajagopalan

411 bronze badge

Comments

cottontail · Accepted Answer · 2022-10-19 22:14:17Z

When you calculate the difference between two datetimes, the dtype of the difference is timedelta64[ns] by default (ns in brackets). By changing [ns] into [ms], [s], [m] etc as you cast the output to a new timedelta64 object, you can convert the difference into milliseconds, seconds, minutes etc.

For example, to find the number of seconds passed since Unix epoch, subtract datetimes and change dtype.

df_unix_sec = (df['time'] - pd.Timestamp('1970-01-01')).astype('timedelta64[s]')

N.B. Oftentimes, the differences are very large numbers, so if you want them as integers, use astype('int64') (NOT astype(int)).

df_unix_sec = (df['time'] - pd.Timestamp('1970-01-01')).astype('timedelta64[s]').astype('int64')

For OP's example, this would yield,

0    1547472343
Name: time, dtype: int64

Red · Accepted Answer · 2020-07-25 19:51:53Z

-3

In case you are accessing a particular datetime64 object from the dataframe, chances are that pandas will return a Timestamp object which is essentially how pandas stores datetime64 objects.

You can use pd.Timestamp.to_datetime64() method of the pd.Timestamp object to convert it to numpy.datetime64 object with ns precision.

edited Jul 25, 2020 at 19:51

Red

27.7k8 gold badges44 silver badges63 bronze badges

answered Jul 25, 2020 at 18:55

sakgak

12 bronze badges

1 Comment

above_c_level Over a year ago

Welcome to SO! Thank you for your time in answering this question. Please read the question of the OP carefully. Does your solution answer the question better/different than the accepted answer?

Collectives™ on Stack Overflow

pandas datetime to unix timestamp seconds

5 Answers 5

Option 1: integer division

Option 2: recommended by pandas

Option 3: `pd.Timestamp`

4 Comments

Comments

Comments

Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Option 1: integer division

Option 2: recommended by pandas

Option 3: pd.Timestamp

4 Comments

Comments

Comments

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related

Option 3: `pd.Timestamp`