我有两个数据帧
df1 = [1, 2, 3, 4, 5]
df2 = [1, 2, 3, 7, 9]
我想得到一个只有[4,5]的新Df(我写的号码,但真正的列表是两个电子邮件列表)然后我将保存DataFrame转换为CSV文件
我该怎么做?
df1 = [1, 2, 3, 4, 5]
df2 = [1, 2, 3, 7, 9]
[x for x in df1 if x not in df2]
好像他们是list
,然后我们使用set
set(df1)-set(df2)
Out[398]: {4, 5}
Diff pandas数据帧:
import pandas as pd
df1 = pd.DataFrame([1, 2, 3, 4, 5])
df2 = pd.DataFrame([1, 2, 3, 7, 9])
df3 = df1.merge(df2, indicator=True, how='outer')
df3[merged['_merge'] == 'left_only']
合并pandas数据帧:
df4 = pd.concat([df1, df2]).drop_duplicates(keep=False)