我有这个数据框。
In [0]: df
Out[0]:
A
0 1258GA 25/15 TABLE 090626 038272
1 GOODIES 762088 A714816
2 TABLE AA88547 734963 GOODIES
3 WATER 02/450 FROM TOMORROW 48246
4 02H12 ALSCA 00548246B GOODIES
而且我想在下面得到结果。
In [1]: df
Out[1]:
A Category
0 1258GA 25/15 TABLE 090626 038272 TABLE
1 GOODIES 762088 A714816 GOODIES
2 TABLE AA88547 734963 GOODIES TABLE GOODIES
3 WATER 02/450 FROM TOMORROW 48246 WATER
4 02H12 ALSCA 00548246B GOODIES GOODIES
我尝试了很多事情,但是没有得到结果
也许这会有所帮助:
df['A'].str.findall(r'\b[A-Z]+\b')
0 [TABLE]
1 [GOODIES]
2 [TABLE, GOODIES]
3 [WATER, FROM, TOMORROW]
4 [ALSCA, GOODIES]