BS4 getText函数产生意外的输出

问题描述 投票:1回答:1

以下HTML示例根据文本样式格式产生不同的结果这是一行中的示例

card = """
<ul class="wrapper--inline-block float--left margin-top--15 padding-left--20 font--weight-300"><li><span class="font--weight-500">Minimum Qualification:</span> Bachelor</li><li><span class="font--weight-500">Experience Level:</span> Graduate trainee</li><li><span class="font--weight-500">Experience Length:</span> 1 year</li></ul>
"""

输出:

Minimum Qualification: BachelorExperience Level: Graduate traineeExperience Length: 1 year

并且格式化html示例时

card = """
<ul class="wrapper--inline-block float--left margin-top--15 padding-left--20 font--weight-300">
<li><span class="font--weight-500">Minimum Qualification:</span> Bachelor</li>
<li><span class="font--weight-500">Experience Level:</span> Graduate trainee</li>
<li><span class="font--weight-500">Experience Length:</span> 1 year</li>
</ul>
"""

输出

Minimum Qualification: Bachelor
Experience Level: Graduate trainee
Experience Length: 1 year

问题是,如何使第一种情况像第二种情况一样产生所需的输出。这是我当前的代码

qualifications=  BeautifulSoup(card, "html.parser")
print(qualifications.getText())
python beautifulsoup
1个回答
1
投票

使用separator="\n"获得所需的输出,

qualifications.getText(separator="\n")
© www.soinside.com 2019 - 2024. All rights reserved.