将字符串日期分割成多个日期列表

问题描述 投票:2回答:4

我有以下列表,我正在尝试将元素相应地拆分为多个“日期”,我想编写一个函数来执行此操作,我不确定正则表达式是否可以使用datetime

x=["2- 7 MAY, 2020, 10-12 JUN, 2014","7 February, 2020, 6 February, 2020, 26 October, 2018","16 JUN, 2020, 24 JUL, 2020, 28 FEB, 2020, 15 SEPT, 2020, 8-11 MAY, 2023, 22 OCT, 2020","14 JUN, 2020"]

for i in x:
    temp=my_func(i)
    if len(temp)==1:
        date1=temp[0]
        date2=""
    elif len(temp)>=2:
        date1=temp[0]
        date2=temp[1]
    else:
        continue
    #rest of my code

这里是my_func的预期输出

#my_func(x[0])=["2- 7 MAY, 2020", "10-12 JUN, 2014"]
#my_func([x[1]])=["7 February, 2020", "6 February, 2020", "26 October, 2018"]
#my_func(x[-1])=["14 JUN, 2020"]
python
4个回答
1
投票

根据您的示例,

import re
for i in x:
    temp =re.findall('\d.*?\d{4}',i)
#output
['2- 7 MAY, 2020', '10-12 JUN, 2014']
['7 February, 2020', '6 February, 2020', '26 October, 2018']
['16 JUN, 2020', '24 JUL, 2020', '28 FEB, 2020', '15 SEPT, 2020', '8-11 MAY, 2023', '22 OCT, 2020']
['14 JUN, 2020']

1
投票

','分割字符串。零件总有偶数。 2个连续的部分构成一个日期。因此,只需将两个部分连接起来以形成日期字符串即可。

re很好,但这也应该:

>>> x = ["2- 7 MAY, 2020, 10-12 JUN, 2014","7 February, 2020, 6 February, 2020, 26 October, 2018","16 JUN, 2020, 24 JUL, 2020, 28 FEB, 2020, 15 SEPT, 2020, 8-11 MAY, 2023, 22 OCT, 2020","14 JUN, 2020"]
>>> result = []
>>> for s in x:
        parts = s.split(',')
        result.append([','.join(parts[i:i+2]).strip() for i in range(0,len(parts),2)])
>>> result
[['2- 7 MAY, 2020', '10-12 JUN, 2014'], 
 ['7 February, 2020', '6 February, 2020', '26 October, 2018'], 
 ['16 JUN, 2020', '24 JUL, 2020', '28 FEB, 2020', '15 SEPT, 2020', '8-11 MAY, 2023', '22 OCT, 2020'], 
 ['14 JUN, 2020']
]

您的my_func将简单为:

>>> def my_func(s):
        parts = s.split(',')
        return [','.join(parts[i:i+2]).strip() for i in range(0,len(parts),2)]

0
投票

使用了全局Varibale,但是输出如您所愿。该程序取决于日期格式的方式。但是使用这种格式可以正常工作。

dates=["2- 7 MAY, 2020, 10-12 JUN, 2014","7 February, 2020, 6 February, 2020, 26 October, 2018","16 JUN, 2020, 24 JUL, 2020, 28 FEB, 2020, 15 SEPT, 2020, 8-11 MAY, 2023, 22 OCT, 2020","14 JUN, 2020"]

def splitdates(date):
    if type(date) is int:
        tosplit = str(dates[date])
    else:
        tosplit = date
    month = ["J" , "j" , "f" , "F" , "m" , "M" , "A" , "a" , "s" , "S" ,"n" , "N" , "o" , "O" , "D" , "d"]
    for item, character in enumerate(tosplit):
        if character in month:
            for item2, character in enumerate(tosplit[item+1:]):
                if character.startswith(","):
                    for item3, character in enumerate(tosplit[item+item2+2:]):
                        if character.startswith(","):
                            global newdate
                            newdate.append(tosplit[:item + item2 + item3 + 3])
                            nextPart = tosplit[item + item2 + item3 + 3:]

                            if nextPart.endswith(";"):
                                newPart = nextPart
                                splitdates(newPart)

                            else:
                                newdate.append(tosplit[item+item2+item3+3:])
                                return newdate

                    newdate.append(tosplit[:item+item2+item3])
                    return newdate


for x in range(len(dates)):
    newdate = []
    print("Date: ",splitdates(x)[enter image description here][1])

输出为:

Date:  ['2- 7 MAY, 2020,', ' 10-12 JUN, 2014']
Date:  ['7 February, 2020,', ' 6 February, 2020, 26 October, 2018']
Date:  ['16 JUN, 2020,', ' 24 JUL, 2020, 28 FEB, 2020, 15 SEPT, 2020, 8-11 MAY, 2023, 22 OCT, 2020']
Date:  ['14 JUN, 2']

-2
投票
x=["2- 7 MAY, 2020, 10-12 JUN, 2014","7 February, 2020, 6 February, 2020, 26 October, 
  2018","16 JUN, 2020, 24 JUL, 2020, 28 FEB, 2020, 15 SEPT, 2020, 8-11 MAY, 2023, 22 
  OCT, 2020","14 JUN, 2020"]
result = []
for i in range(0,len(x)):
  result.append([x[i]])
© www.soinside.com 2019 - 2024. All rights reserved.