如何使用python删除单词中的空格?

问题描述 投票:1回答:2

这是给定John plays chess and l u d o.的输入我希望输出采用这种格式(如下所示)

John plays chess and ludo.

我已经尝试使用正则表达式来删除空格,但对我来说不起作用。

import re
sentence='John plays chess and l u d o'
sentence = re.sub(r"\s+", "", sentence, flags=re.UNICODE)

print(sentence)

我期待输出John plays chess and ludo.。 但我得到的输出是Johnplayschessandludo

python python-3.x text-mining spacy removing-whitespace
2个回答
2
投票

这应该工作!本质上,解决方案从句子中提取单个字符,使其成为单词并将其连接回剩余的句子。

s = 'John plays chess and l u d o'

chars = []
idx = 0

#Get the word which is divided into single characters
while idx < len(s)-1:

    #This will get the single characters around single spaces
    if s[idx-1] == ' ' and s[idx].isalpha() and s[idx+1] == ' ':
        chars.append(s[idx])

    idx+=1

#This is get the single character if it is present as the last item
if s[len(s)-2] == ' ' and s[len(s)-1].isalpha():
    chars.append(s[len(s)-1])

#Create the word out of single character
join_word = ''.join(chars)

#Get the other words
old_words = [item for item in s.split() if len(item) > 1]

#Form the final string
res = ' '.join(old_words + [join_word])

print(res)

然后输出看起来像

John plays chess and ludo

0
投票

上面的代码在解决问题时不会保持单词序列。例如,尝试输入这句话“John play s h s s and ludo”

如果您在任何位置的文本中都有单个空格,请尝试使用此选项:

sentence = "John plays c h e s s and ludo"
sentence_list = sentence.split()
index = [index for index, item in enumerate(sentence_list) if len(item) == 1]
join_word = "".join([item for item in sentence_list if len(item) == 1])
if index != []:
    list(map(lambda x: sentence_list.pop(index[0]), index[:-1]))
    sentence_list[index[0]] = join_word
    sentence = " ".join(sentence_list)
else:
    sentence
© www.soinside.com 2019 - 2024. All rights reserved.