我正在使用BeautifulSoup按类“reply_to”查找href。
<div class="message">
<div class="reply_to details">
In reply to <a href="#go_to_message18" onclick="return
GoToMessage(18)">this message</a>
</div>
</div>
现在,下面的代码可以找到reply_to元素,但接下来如何找到并从中检索href
soup = BeautifulSoup(parsed.input[0], "html.parser")
alldefaultmesssages = soup.select(".message.default")
alldefaultmesssages.select_one(".reply_to")
你需要到anchor
元素的孩子reply_to
标签:
import requests
from bs4 import BeautifulSoup
html = """<div class="message">
<div class="reply_to details">
In reply to <a href="#go_to_message18" onclick="return
GoToMessage(18)">this message</a>
</div>
</div>"""
soup = BeautifulSoup(html, 'html.parser')
print(soup.find("div", class_="reply_to").a.get('href'))
#>>> #go_to_message18