如何在python中使用beautifulsoup提取href内容?

问题描述 投票:1回答:2
import requests
from bs4 import BeautifulSoup

page = requests.get('http://espn.go.com/nba/team/roster/_/name/atl/atlanta-hawks')
soup = BeautifulSoup(page.content, "html.parser")
player_list = soup.find_all(class_="Image__Wrapper")
#player_list = soup.find_all("tr")
print(player_list[1])

我得到的输出是

<div class="Image__Wrapper aspect-ratio--child"><img alt="https://a.espncdn.com/i/headshots/nba/players/full/3062667.png" class="" data-mptype="image" src="data:image/gif;base64,R0lGODlhAQABAIAAAAAAAP///yH5BAEAAAAALAAAAAABAAEAAAIBRAA7" title="DeAndre' Bembry"/></div>

我只对得到DeAndre' Bembry感兴趣,如何提取。我也有点困惑,如何获得所有球员的名字列表。

python list beautifulsoup href
2个回答
0
投票

你可以尝试

import requests
from bs4 import BeautifulSoup

page = requests.get('http://espn.go.com/nba/team/roster/_/name/atl/atlanta-hawks')
soup = BeautifulSoup(page.content, "html.parser")
player_list = soup.find_all(class_="Image__Wrapper")
#player_list = soup.find_all("tr")
print(player_list[1].img["title"])

产量

 DeAndre' Bembry

并打印所有球员

print([i.img["title"] for i in player_list if 0 < i.img["title"].count(" ") <= 3])

产量

["DeAndre' Bembry", 'Charlie Brown Jr.', 'Clint Capela', 'Vince Carter', 'John Collins', 'Dewayne Dedmon', 'Bruno Fernando', 'Brandon Goodwin', 'Treveon Graham', 'Kevin Huerter', "De'Andre Hunter", 'Damian Jones', 'Skal Labissiere', 'Cam Reddish', 'Jeff Teague', 'Trae Young']

0
投票
player_list[1].find_next('img').get('title')  # "DeAndre' Bembry"
© www.soinside.com 2019 - 2024. All rights reserved.