从python列中删除某些长度的字符

问题描述 投票:-3回答:1

我正在尝试从CSV的特定列('sys_tags')中删除一系列字符。

n = 20 dataframe['sys_tags'].value_counts()[:20].index.tolist()

输出:

['IAM~3rd',
 'IAM~3rd, IAM~KI-000',
 'IAM~1st',
 'IAM~KI-000',
 'IAM~3rd, IAM~KI-057',
 'iam~2nd, IAM~KI-000',
 'IAM~3rd, IAM~KI-050',
 'IAM~3rd, IAM~KI-049',
 'iam~2nd',
 'IAM~1st, IAM~3rd, IAM~KI-000',
 'IAM~1st, IAM~3rd',
 'iam~2nd, IAM~KI-049',
 'IAM~3rd, IAM~KI-047',
 'IAM~KI-069',
 'IAM~3rd, IAM~KI-042',
 'IAM~KI-049',
 'IAM~3rd, IAM~KI-000, IAM~KI-057',
 'IAM~1st, IAM~3rd, IAM~KI-050',
 'IAM~KI-016',
 'IAM~KI-031']

我正在尝试删除所有单词(例如:IAM〜3rd)对进一步分类没有用的那些。

我正在尝试使用以下方法将其删除,但是以错误结尾,有人可以告诉我一种更好的方法,该方法适用于多个单词,而不论大小写如何?

dataframe['sys_tags'] = dataframe['sys_tags'].str.strip('IAM~2nd,' , 'IAM~3rd,' )

TypeErrorstrip() takes from 1 to 2 positional arguments but 3 were given

python regex pandas text-mining text-processing
1个回答
0
投票

据我所知,您想从sys_tags列的每一行中删除某些“单词”。

© www.soinside.com 2019 - 2024. All rights reserved.