如何从一个简单的字符串构造一个 timedelta 对象

我正在编写一个函数,它需要将字符串解析为 timedelta。用户必须输入类似于 "32m""2h32m",甚至 "4:13""5hr34m56s"... 是否有一个库或者已经实现了这种类型的东西?

117657 次浏览

对于第一种格式(5hr34m56s) ,应该使用正则表达式进行解析

以下是重新设计的解决方案:

import re
from datetime import timedelta




regex = re.compile(r'((?P<hours>\d+?)hr)?((?P<minutes>\d+?)m)?((?P<seconds>\d+?)s)?')




def parse_time(time_str):
parts = regex.match(time_str)
if not parts:
return
parts = parts.groupdict()
time_params = {}
for name, param in parts.items():
if param:
time_params[name] = int(param)
return timedelta(**time_params)




>>> from parse_time import parse_time
>>> parse_time('12hr')
datetime.timedelta(0, 43200)
>>> parse_time('12hr5m10s')
datetime.timedelta(0, 43510)
>>> parse_time('12hr10s')
datetime.timedelta(0, 43210)
>>> parse_time('10s')
datetime.timedelta(0, 10)
>>>

对我来说,最优雅的解决方案是使用 约会时间强大的 strptime字符串解析方法,而不必求助于诸如 约会之类的外部库或手动解析输入。

from datetime import datetime, timedelta
# we specify the input and the format...
t = datetime.strptime("05:20:25","%H:%M:%S")
# ...and use datetime's hour, min and sec properties to build a timedelta
delta = timedelta(hours=t.hour, minutes=t.minute, seconds=t.second)

在这之后,你可以像平常一样使用你的 timedelta 对象,把它转换成秒,以确保我们做了正确的事情等等。

print(delta)
assert(5*60*60+20*60+25 == delta.total_seconds())

昨天我有一点时间,所以我把 @ virhilo回答开发成了 Python 模块,添加了一些时间表达式格式,包括 @ priestc请求的所有格式。

源代码在 github (MIT License)上,任何人都可以使用,它也在 PyPI 上:

pip install pytimeparse

以秒数的形式返回时间:

>>> from pytimeparse.timeparse import timeparse
>>> timeparse('32m')
1920
>>> timeparse('2h32m')
9120
>>> timeparse('4:13')
253
>>> timeparse('5hr34m56s')
20096
>>> timeparse('1.2 minutes')
72

如果你使用 Python 3,那么这里是 Hari Shankar 的解决方案的更新版本,我使用的是:

from datetime import timedelta
import re


regex = re.compile(r'(?P<hours>\d+?)/'
r'(?P<minutes>\d+?)/'
r'(?P<seconds>\d+?)$')


def parse_time(time_str):
parts = regex.match(time_str)
if not parts:
return
parts = parts.groupdict()
print(parts)
time_params = {}
for name, param in parts.items():
if param:
time_params[name] = int(param)
return timedelta(**time_params)

我想输入一个时间,然后把它添加到不同的日期,所以这对我很有用:

from datetime import datetime as dtt


time_only = dtt.strptime('15:30', "%H:%M") - dtt.strptime("00:00", "%H:%M")

我对 Virhilo 的回答很好进行了一些升级:

  • 添加了一个断言,说明该字符串是一个有效的时间字符串
  • 把「人力资源」时间表改为「人力资源」时间表
  • 考虑到“ d”天指标
  • 允许非整数次数(例如 3m0.25s为3分0.25秒)

.

import re
from datetime import timedelta




regex = re.compile(r'^((?P<days>[\.\d]+?)d)?((?P<hours>[\.\d]+?)h)?((?P<minutes>[\.\d]+?)m)?((?P<seconds>[\.\d]+?)s)?$')




def parse_time(time_str):
"""
Parse a time string e.g. (2h13m) into a timedelta object.


Modified from virhilo's answer at https://stackoverflow.com/a/4628148/851699


:param time_str: A string identifying a duration.  (eg. 2h13m)
:return datetime.timedelta: A datetime.timedelta object
"""
parts = regex.match(time_str)
assert parts is not None, "Could not parse any time information from '{}'.  Examples of valid strings: '8h', '2d8h5m20s', '2m4s'".format(time_str)
time_params = {name: float(param) for name, param in parts.groupdict().items() if param}
return timedelta(**time_params)

Django 带有实用功能 parse_duration():

解析字符串并返回 datetime.timedelta

期望数据采用 "DD HH:MM:SS.uuuuuu"格式或 ISO 8601指定的格式(例如 P4DT1H15M20S等同于 4 1:15:20)或 PostgreSQL 的日间间隔格式(例如 3 days 04:05:06)。

使用 等离子体库解析 ISO 8601持续时间字符串,例如:

isodate.parse_duration('PT1H5M26S')

也请参阅 有没有一个简单的方法来转换 ISO 8601持续时间到时间三角洲?

如果你想使用: 作为分隔符,我使用这个函数:

import re
from datetime import timedelta


def timedelta_parse(value):
"""
convert input string to timedelta
"""
value = re.sub(r"[^0-9:.]", "", value)
if not value:
return


return timedelta(**{key:float(val)
for val, key in zip(value.split(":")[::-1],
("seconds", "minutes", "hours", "days"))
})

例子:

In [4]: timedelta_parse("1:0:0:1")
Out[4]: datetime.timedelta(days=1, seconds=1)


In [5]: timedelta_parse("123.5")
Out[5]: datetime.timedelta(seconds=123, microseconds=500000)


In [6]: timedelta_parse("1:6:34:9.983")
Out[6]: datetime.timedelta(days=1, seconds=23649, microseconds=983000)


In [8]: timedelta_parse("23:45:00")
Out[8]: datetime.timedelta(seconds=85500)

考虑试试 Parse _ timedelta

$ pip-run 'tempora>=4.1.1'
Collecting tempora>=4.1.1
Downloading tempora-4.1.1-py3-none-any.whl (15 kB)
Collecting jaraco.functools>=1.20
Using cached jaraco.functools-3.3.0-py3-none-any.whl (6.8 kB)
Collecting pytz
Using cached pytz-2021.1-py2.py3-none-any.whl (510 kB)
Collecting more-itertools
Using cached more_itertools-8.8.0-py3-none-any.whl (48 kB)
Installing collected packages: more-itertools, pytz, jaraco.functools, tempora
Successfully installed jaraco.functools-3.3.0 more-itertools-8.8.0 pytz-2021.1 tempora-4.1.1
Python 3.9.2 (v3.9.2:1a79785e3e, Feb 19 2021, 09:06:10)
[Clang 6.0 (clang-600.0.57)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from tempora import parse_timedelta
>>> parse_timedelta("32m")
datetime.timedelta(seconds=1920)
>>> parse_timedelta("2h32m")
datetime.timedelta(seconds=9120)
>>> parse_timedelta("4:13")
datetime.timedelta(seconds=15180)
>>> parse_timedelta("5hr34m56s")
datetime.timedelta(seconds=20096)

如果熊猫已经成为你的依赖对象,那么它在这方面做得很好:

>>> import pandas as pd
>>> pd.Timedelta('5hr34m56s')
Timedelta('0 days 05:34:56')


>>> pd.Timedelta('2h32m')
Timedelta('0 days 02:32:00')


>>> pd.Timedelta('5hr34m56s')
Timedelta('0 days 05:34:56')


>>> # It is pretty forgiving:
>>> pd.Timedelta('2 days 24:30:00 10 sec')
Timedelta('3 days 00:30:10')

如果您喜欢转换为 datetime.timedelta类型:

>>> pd.Timedelta('1 days').to_pytimedelta()
datetime.timedelta(1)

不幸的是,这并不奏效:

>>> pd.Timedelta('4:13')
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "pandas\_libs\tslibs\timedeltas.pyx", line 1217, in
pandas._libs.tslibs.timedeltas.Timedelta.__new__
File "pandas\_libs\tslibs\timedeltas.pyx", line 454, in
pandas._libs.tslibs.timedeltas.parse_timedelta_string
ValueError: expected hh:mm:ss format

熊猫实际上有相当广泛的日期和时间工具,即使这不是它的主要目的。

安装熊猫:

# If you use pip
pip install pandas


# If you use conda
conda install pandas