抓取表格仅返回“表格”，而不返回表格的内容

问题描述投票：0回答：1

图片描述在这里：

抓取表格仅返回“表格”，而不返回表格的内容。这是我的code：

from urllib.request import urlopen

from bs4 import BeautifulSoup

url = "http://data.eastmoney.com/gdhs/detail/600798.html"

html = urlopen(url)


soup = BeautifulSoup(html, 'lxml')

table = soup.find_all('table')

print(table)

python beautifulsoup urlopen

1个回答

0
投票

您发现该表与代码配合得很好。因为该表由多个元素（tr / td）组成，所以您必须遍历这些元素以获得表单元格的内部文本。

# This grabs the first occurrence of a table on the web page. If you want the second occurrence of a table on the web page, use soup.find_all('table')[1], etc.

table = soup.find_all('table')[0]

# Use a splice if there are table headers. If you want to include the table headers, use table('tr')[0:]

for row in table('tr')[1:]:
    print(row('td')[0].getText().strip())

最新问题

© www.soinside.com 2019 - 2024. All rights reserved.