从熊猫df中提取数据到列表中

问题描述 投票:1回答:2

我有一个Pandas数据框,其标题和行包含冗余数据,并希望从中提取数据。例如,我有一个看起来像这样的df:

df = pd.DataFrame({'Your availability: Wednesday, December 25th, 2019 5:00AM-6:00AM': ['Wednesday, December 25th, 2019 5:00AM-6:00AM', pd.NaN, pd.NaN, 'Wednesday, December 25th, 2019 5:00AM-6:00AM'],
                'Your availability: Tuesday, December 10th 2019 8:00AM-5:00PM': [pd.NaN, 'Tuesday, December 10th 2019 8:00AM-5:00PM', pd.NaN, pd.NaN]})

...我想提取日期并将其放入字典中以供参考:

datetimes = {'P1': "Wednesday, December 25th, 2019 5:00AM-6:00AM", 'P2' : "Tuesday, December 10th 2019 8:00AM-5:00PM", 'P3': NaN, 'P4': "Wednesday, December 25th, 2019 5:00AM-6:00AM}
python-3.x pandas list datetime text-extraction
2个回答
0
投票

IIUC,尝试这个

df.ffill(1).iloc[:,-1].rename(lambda x: f'P{x+1}').to_dict()

Out[1159]:
{'P1': 'Wednesday, December 25th, 2019 5:00AM-6:00AM',
 'P2': 'Tuesday, December 10th 2019 8:00AM-5:00PM',
 'P3': nan,
 'P4': 'Wednesday, December 25th, 2019 5:00AM-6:00AM'}

0
投票

是您想要的吗?

df.drop_duplicates().stack().to_list()

输出:

['Wednesday, December 25th, 2019 5:00AM-6:00AM',
 'Tuesday, December 10th 2019 8:00AM-5:00PM']
© www.soinside.com 2019 - 2024. All rights reserved.