美丽的汤和碎片 - 获取href和src属性。

问题描述 投票:0回答:1

这就是代码。

url = f'https://www.premierleague.com/players'

# Initiate a splinter instance of the URL
browser.visit(url)

browser.find_by_tag('div[class="table playerIndex"]')
soup = BeautifulSoup(browser.html, 'html.parser')
for el in soup:
    td =  el.findAll('td')
    for each_td in td:
        url = each_td.find('a', href=True)
        print (url)

打击目标物品,但后面的代码是 None:

<a class="playerName" href="/players/19970/Max-Aarons/overview"><img alt="" class="img" data-player="p232980" data-script="pl_player-image" data-size="40x40" data-widget="player-image" src="//platform-static-files.s3.amazonaws.com/premierleague/photos/players/40x40/Photo-Missing.png"/>Max Aarons</a>
None
None
<a class="playerName" href="/players/13279/Abdul-Rahman-Baba/overview"><img alt="" class="img" data-player="p118335" data-script="pl_player-image" data-size="40x40" data-widget="player-image" src="//platform-static-files.s3.amazonaws.com/premierleague/photos/players/40x40/Photo-Missing.png"/>Abdul Rahman Baba</a>
None
None

我如何获得 hrefsrc 值?

python beautifulsoup splinter
1个回答
0
投票

你可以以字典的形式访问一个元素的属性和特性。

for el in soup: 
    td = el.findAll('td') 
    for each_td in td: 
        link = each_td.find('a', href=True)
        if link:
            print(link['href'])
        image = each_td.find('img')
        if image:
            print(image['src'])
© www.soinside.com 2019 - 2024. All rights reserved.