[我想查找与另一个数据框中的列相比,DataFrame列中是否存在子字符串。
在我的示例DF2['Column y']
中,我想要
'manager'
与'Software Developer Manager'
和]相对>'executive'
与'Online Bidding Executive'
等,等等DF1
:
unique_values counts Rank Stop_Word 0 manager 9322 1.0 False 1 for 8463 2.0 True 2 developer 7323 3.0 False 3 executive 5864 4.0 False 4 engineer 5669 5.0 False 5 sales 4492 6.0 False
DF2
:
ColumnX. Column y. 0 Digital Media Planner. Nan. 1 Online Bidding Executive. Executive 2 Software Developer Manager Manager 3 Technical Support. Nan 4 Software Test Engineer -hyderabad. engineer 5 Opening For Adobe Analytics Specialist. Nan 6 Sales- Fresher-for Leading Property Consultant. Nan 7 Opportunity For Azure Devops Architect Nan 8 BDE. Nan 9 Technical Support/ Product Support. Nan
我想要
DF2['Column y']
作为输出
此外,如果存在多个子字符串,则必须像在DF2
的第二个索引值中考虑排名最小的子字符串:'manager'
视为'developer'
之上。
我想查找与另一个数据帧中的列相比,DataFrame列中是否存在子字符串。在我的示例DF2 ['Column y']中,我想要针对'Software Developer ...的管理者...
我会使用apply; apply基本上只是一个将方法应用于每一行或每一列的映射。输出可以放入其自己的列中,如图所示。