beautifulsoup 相关问题

从这段Python代码中， ... resp = logout_session.get(logout_url, headers=headers, verify=False, allowed_redirects=False) soup = BeautifulSoup(resp.content, "html.parser") 打印（汤.prettif...

python beautifulsoup urlparse

回答 2 投票 0

Aws Glue 在使用 BeautifulSoup 运行 python 脚本时抛出错误

我有一个Python代码，可以使用Beautifulsoup从网站上抓取数据，并且在Jupyter.im中运行良好，尝试在awsglue中运行相同的脚本，并在glue中添加以下作业参数...

python amazon-web-services beautifulsoup aws-glue

回答 1 投票 0

美丽的汤不在外跨内定位内跨

我正在尝试为 Udemy 课程构建一个价格跟踪器，就像一个个人项目一样，因为我经常检查该网站是否有我想购买的课程的销售情况。我正在尝试使用美丽汤来抢夺...

python html web-scraping beautifulsoup

回答 1 投票 0

从雅虎财经废弃大量股票数据时出现问题

我想取消雅虎财经的“关键统计”选项卡。 HTML 页面包含我使用 Beautiful Soup 废弃的多个表。每个表仅包含 2 列，而我设法...

html web-scraping beautifulsoup yahoo-finance

回答 1 投票 0

BeatuifulSoup 迭代超过 10,000 个页面并获取数据，解析：欧洲志愿服务：一个从 EU-Site 收集机会的小型抓取工具

我正在寻找欧洲志愿服务的公开列表：我不需要完整的地址 - 但需要名称和网站。我想到数据... XML、CSV ... 具有这些字段：名称、国家/地区 - ...

python pandas dataframe web-scraping beautifulsoup

回答 1 投票 0

如何从维基百科抓取列表？

我面临着与如何从维基百科中抓取列表并传输到数据框提出的问题类似的问题。我想从列表“现代战争少于 25...

web-scraping beautifulsoup

回答 1 投票 0

使用Python和Beautiful Soup修改Confluence表

你好，我尝试在每次运行 python 代码时使用 python 自动修改汇合表（追加新行）。我能够连接到 Confluence API 并获取 Confluence 的主体...

python beautifulsoup confluence

回答 1 投票 0

我想抓取一个名字，但得到的输出是NONE

我正在抓取一个网站，想要提取名称和价格，但输出结果为“无”。我不知道我在这里做错了什么，因为我期待价格和名称的推出。

python web-scraping beautifulsoup

回答 1 投票 0

如何正确使用 Beautifulsoup 以免在 VSCode 中生成类型检查警报

页面源示例：从 bs4 导入 BeautifulSoup、标签、结果集从重新导入编译页源=“”“ 页面来源示例： from bs4 import BeautifulSoup, Tag, ResultSet from re import compile page_source = """ <html> <body> <div class="block_general_statistics"> <table> <tbody> <tr> <th>Header 1</th> <td class="total">Data 1</td> </tr> </tbody> </table> </div> </body> </html> """ 最初用于减少行数和字符数，但会生成类型检查警报，并且还要注意find | text | strip在列表理解中所有这些字体颜色都是白色的，因为缺乏必要的组合： soup = BeautifulSoup(page_source, 'html.parser') table_stats = soup.find('div', class_=compile('block_general_statistics')).find('table') table_stats_body = table_stats.find('tbody').find_all('tr') thead = [th.find('th').text.strip() for th in table_stats_body] tbody = [th.find('td', class_='total').text.strip() for th in table_stats_body] 凭借我的基础知识，我能够解决所有警报并修复所有正确着色的字体，而不会因“缺乏功能”而变成白色： soup = BeautifulSoup(page_source, 'html.parser') table_stats = soup.find('div', class_=compile('block_general_statistics')) if type(table_stats) == Tag: table_stats = table_stats.find('table') if type(table_stats) == Tag: table_stats_body = table_stats.find('tbody') if type(table_stats_body) == Tag: table_stats_body = table_stats_body.find_all('tr') if type(table_stats_body) == ResultSet: thead = [] for th in table_stats_body: if type(th) == Tag: th = th.find('th') if type(th) == Tag: thead.append(th.text.strip()) tbody = [] for th in table_stats_body: if type(th) == Tag: th = th.find('td', class_='total') if type(th) == Tag: tbody.append(th.text.strip()) 是否有任何更智能的方法可以解决警报，但又不会使简单、简短的代码变得如此庞大、详细甚至将来难以进行更改？将以下设置添加到settings.json： "python.analysis.diagnosticSeverityOverrides": { "reportAttributeAccessIssue": "none", "reportOptionalMemberAccess": "none" }, 这仅适用于那些不想修改代码而只是阻止错误的人。

python visual-studio-code beautifulsoup typechecking

回答 1 投票 0

JavascriptException：消息：javascript 错误：无法读取 null 的属性（读取“点击”）

我目前正在使用 Python Selenium WebDriver 从 HTML 网站提取信息。但是，当我访问某个网页时，该网站会显示一条消息，要求“请启用 Java...

javascript python html selenium-webdriver beautifulsoup

回答 1 投票 0

beautifulsoup 相关问题

最新问题