将df重新采样到微秒-熊猫

问题描述 投票:0回答:1

我正在尝试对df中的时间戳重新采样以每0.1秒显示一次。它正在重新采样,但相对于那些时间戳,我正在丢失数据。

d = ({   
    'Time' : ['2010-07-27 09:25:31.1','2010-07-27 09:25:31.2','2010-07-27 09:25:31.4','2010-07-27 09:25:31.5','2010-07-27 09:25:31.6','2010-07-27 09:25:31.9','2010-07-27 09:25:32.0'],
    'Area' : ['A','A','A','A','A','A','A',],
   })

df = pd.DataFrame(data=d)

df = df.set_index('Time').asfreq('0.1S').reset_index()

输出:

                     Time Area
0 2010-07-27 09:25:31.100  NaN
1 2010-07-27 09:25:31.200  NaN
2 2010-07-27 09:25:31.300  NaN
3 2010-07-27 09:25:31.400  NaN
4 2010-07-27 09:25:31.500  NaN
5 2010-07-27 09:25:31.600  NaN
6 2010-07-27 09:25:31.700  NaN
7 2010-07-27 09:25:31.800  NaN
8 2010-07-27 09:25:31.900  NaN
9 2010-07-27 09:25:32.000  NaN

目标:

0 2010-07-27 09:25:31.100  A
1 2010-07-27 09:25:31.200  A
2 2010-07-27 09:25:31.300  NaN
3 2010-07-27 09:25:31.400  A
4 2010-07-27 09:25:31.500  A
5 2010-07-27 09:25:31.600  A
6 2010-07-27 09:25:31.700  Nan
7 2010-07-27 09:25:31.800  NaN
8 2010-07-27 09:25:31.900  A
9 2010-07-27 09:25:32.000  A
python pandas
1个回答
1
投票

您可以执行以下操作。首先,您需要确保您的列在datetime64[ns]中,然后在该列上使用resample

df['Time']=pd.to_datetime(df['Time'])
df.set_index('Time').resample('0.1S').last().fillna(0).reset_index()

输出

                       Time     Area
0   2010-07-27 09:25:31.100     A
1   2010-07-27 09:25:31.200     A
2   2010-07-27 09:25:31.300     0
3   2010-07-27 09:25:31.400     A
4   2010-07-27 09:25:31.500     A
5   2010-07-27 09:25:31.600     A
6   2010-07-27 09:25:31.700     0
7   2010-07-27 09:25:31.800     0
8   2010-07-27 09:25:31.900     A
9   2010-07-27 09:25:32.000     A
© www.soinside.com 2019 - 2024. All rights reserved.