我正在尝试从CSV的特定列('sys_tags')中删除一系列字符。
n = 20
dataframe['sys_tags'].value_counts()[:20].index.tolist()
输出:
['IAM~3rd',
'IAM~3rd, IAM~KI-000',
'IAM~1st',
'IAM~KI-000',
'IAM~3rd, IAM~KI-057',
'iam~2nd, IAM~KI-000',
'IAM~3rd, IAM~KI-050',
'IAM~3rd, IAM~KI-049',
'iam~2nd',
'IAM~1st, IAM~3rd, IAM~KI-000',
'IAM~1st, IAM~3rd',
'iam~2nd, IAM~KI-049',
'IAM~3rd, IAM~KI-047',
'IAM~KI-069',
'IAM~3rd, IAM~KI-042',
'IAM~KI-049',
'IAM~3rd, IAM~KI-000, IAM~KI-057',
'IAM~1st, IAM~3rd, IAM~KI-050',
'IAM~KI-016',
'IAM~KI-031']
我正在尝试删除所有单词(例如:IAM〜3rd)对进一步分类没有用的那些。
我正在尝试使用以下方法将其删除,但是以错误结尾,有人可以告诉我一种更好的方法,该方法适用于多个单词,而不论大小写如何?
dataframe['sys_tags'] = dataframe['sys_tags'].str.strip('IAM~2nd,' , 'IAM~3rd,' )
TypeErrorstrip() takes from 1 to 2 positional arguments but 3 were given
据我所知,您想从sys_tags
列的每一行中删除某些“单词”。