在我正在使用的数据中,化妆短语(即粉底,唇/唇膏,遮瑕膏,古铜色等)与以前的短语捆绑在一起。如何基于多个分隔符(分隔符本身就是组成短语)对这些构成短语进行分区(因为我想保留分隔符而不要拆分)?具体来说,我在列出分隔符短语时遇到麻烦,因此可以考虑所有分隔符并将其保留在分区中。
示例短语
‘vamplipstick @’
‘208bronzer :’
‘jadefoundation :’
‘nc45blush @’
‘eyeseyeliner @’
‘kikomilanolips :’
‘235concealer @’
理想输出
‘vamp lipstick @’
‘208 bronzer:’
‘jade foundation:’
‘nc45 blush @’
‘eyes eyeliner @’
‘kikomilano lips:’
‘235 concealer @’
到目前为止的代码
makeup = r"\w+\s+[@:]"
separators = ["foundation", "bronzer", "lips", "lipstick", "concealer", "blush", "eyeliner"]
[makeup.partition(<?list_multiple_separators?>) for makeup in df]