我正在尝试对df中的时间戳重新采样以每0.1秒显示一次。它正在重新采样,但相对于那些时间戳,我正在丢失数据。
d = ({
'Time' : ['2010-07-27 09:25:31.1','2010-07-27 09:25:31.2','2010-07-27 09:25:31.4','2010-07-27 09:25:31.5','2010-07-27 09:25:31.6','2010-07-27 09:25:31.9','2010-07-27 09:25:32.0'],
'Area' : ['A','A','A','A','A','A','A',],
})
df = pd.DataFrame(data=d)
df = df.set_index('Time').asfreq('0.1S').reset_index()
输出:
Time Area
0 2010-07-27 09:25:31.100 NaN
1 2010-07-27 09:25:31.200 NaN
2 2010-07-27 09:25:31.300 NaN
3 2010-07-27 09:25:31.400 NaN
4 2010-07-27 09:25:31.500 NaN
5 2010-07-27 09:25:31.600 NaN
6 2010-07-27 09:25:31.700 NaN
7 2010-07-27 09:25:31.800 NaN
8 2010-07-27 09:25:31.900 NaN
9 2010-07-27 09:25:32.000 NaN
目标:
0 2010-07-27 09:25:31.100 A
1 2010-07-27 09:25:31.200 A
2 2010-07-27 09:25:31.300 NaN
3 2010-07-27 09:25:31.400 A
4 2010-07-27 09:25:31.500 A
5 2010-07-27 09:25:31.600 A
6 2010-07-27 09:25:31.700 Nan
7 2010-07-27 09:25:31.800 NaN
8 2010-07-27 09:25:31.900 A
9 2010-07-27 09:25:32.000 A
您可以执行以下操作。首先,您需要确保您的列在datetime64[ns]
中,然后在该列上使用resample
df['Time']=pd.to_datetime(df['Time'])
df.set_index('Time').resample('0.1S').last().fillna(0).reset_index()
输出
Time Area
0 2010-07-27 09:25:31.100 A
1 2010-07-27 09:25:31.200 A
2 2010-07-27 09:25:31.300 0
3 2010-07-27 09:25:31.400 A
4 2010-07-27 09:25:31.500 A
5 2010-07-27 09:25:31.600 A
6 2010-07-27 09:25:31.700 0
7 2010-07-27 09:25:31.800 0
8 2010-07-27 09:25:31.900 A
9 2010-07-27 09:25:32.000 A