如何在字符串赋值中修复错误的utf字符?

问题描述 投票:0回答:1

我有一个python3.6循环执行以下操作:

childrenList = []

output["text"] = path

for item in sorted(os.listdir(path)):
    contentFullPath = os.path.join(path, item)
    children = {}

    if os.path.isfile(contentFullPath):
        print("item = "+str(item))
        children["text"] = item
        print("children = "+str(children))
        children["id"] = contentFullPath
        childrenList.append(children)

我得到以下输出:

web_1     | item = Obamaééé.mp4
web_1     | children = {'icon': 'jstree-file', 'text': 
            'Obamae\udccc\udc81e\udccc\udc81e\udccc\udc81.mp4'}

为什么名为children的变量中的文本未正确显示?

python python-3.x utf
1个回答
-1
投票
import unidecode

text = 'Obamaééé.mp4'
strip_text = unidecode.unidecode(text.decode('utf-8'))
print(strip_text)

[output] : Obamaeee.mp4

希望这会对你有所帮助。

© www.soinside.com 2019 - 2024. All rights reserved.