python bs4 从每个谷歌搜索结果中提取信息

问题描述 投票:0回答:0

我在python中使用bs4搜索新闻文章,想提取文章、链接、发布日期和发布者

看html,看到类了,但是拉不出来

到目前为止我有什么:

from bs4 import BeautifulSoup
import pandas as pd
import requests
url = 'https://www.google.com/search?q=deloitte&rlz=1C1ONGR_en-GBGB998GB998&biw=1280&bih=577&tbs=sbd%3A1&tbm=nws&ei=NlE-ZMPXA4aNgQb25IiIDg&ved=0ahUKEwjDyryV_rL-AhWGRsAKHXYyAuEQ4dUDCA0&uact=5&oq=bc+partners&gs_lcp=Cgxnd3Mtd2l6LW5ld3MQAzIHCAAQigUQQzIICAAQgAQQsQMyBQgAEIAEMgUIABCABDIFCAAQgAQyBQgAEIAEMgUIABCABDIFCAAQgAQyBQgAEIAEMgUIABCABDoGCAAQFhAeOggIABCKBRCGA1CmBljbC2CzLWgAcAB4AIABOYgBqgKSAQE2mAEAoAEBwAEB&sclient=gws-wiz-news'
cookies = {"CONSENT": "YES+cb.20210720-07-p0.en+FX+410"}
result = requests.get(url,  cookies=cookies)
soup = BeautifulSoup(result.text, 'lxml')
soup
for g in soup.find_all('h3'):
    print(g.text) # returns article title
soup.find_all('SoaBEf') # returns empty list

关于我可能做错了什么的任何建议。

python beautifulsoup google-search
© www.soinside.com 2019 - 2024. All rights reserved.