如何对一个热编码数据帧进行关联规则分析(先验)

问题描述 投票:0回答:1

我得到了一个模拟购物清单的数据框:

import pandas as pd

data = {'Produce':  ['Brocolli', 'Spinach','Spinach','Lettuce','Brocolli','Lettuce','Lettuce',],
        'Dairy': ['Milk', '','Milk','Cheese','Milk','Yogurt','Yogurt',],
        'Beverage': ['', '','Orange Juice','Soda','Soda','Orange juice','',],
        'Fruit': ['Brocolli', 'Spinach','Spinach','Lettuce','Brocolli','Lettuce','Lettuce',],
        'Poultry': ['Chicken Tender', 'Chicken Breasts','Chicken Tender','Chicken Thigh','Chicken Breasts','','Chicken Breasts',],
        'Deli': ['Turkey Breasts', 'Ham','Ham','','','Turkey Breasts','',],
       }

df = pd.DataFrame (data, columns = ['Produce','Dairy','Beverage','Fruit','Deli'])

df

我如何执行一次热编码以转换此数据帧,以便可以在其上运行先验(据我所知,基本上所有独特的值都被列标签所取代,并且值被布尔值所取代)?

python one-hot-encoding apriori
1个回答
0
投票

您可以尝试:

pd.get_dummies(df)
© www.soinside.com 2019 - 2024. All rights reserved.