如何使用python网页抓取方法提取每个产品的标题

Question

这里是链接：https://www.118100.se/sok/foretag/?q=brf&loc=&ob=rel&p=0

def get_index_data(soup):
try:
    links = soup.find_all('div','a',id=False).get('href')
except:
    links = []
print(links)

Answer 1

查找全部div，其名称为className（class =“ Name”）]。这将为您提供所有标题名称。如果要href，则遍历所有titles并找到具有a的title标签是title.text的文本。

import requests
import bs4 as bs

url = 'https://www.118100.se/sok/foretag/?q=brf&loc=&ob=rel&p=0'

response = requests.get(url)
# print('Response:', response.status_code)

soup = bs.BeautifulSoup(response.text, 'lxml')
titles = soup.find_all('div',  {'class': 'Name'})

# a = soup.find_all('a')
# print(a)

for title in titles:
    link = soup.find('a',  {'title': title.text}).get('href')
    print('https://www.118100.se' + link)

如何使用python网页抓取方法提取每个产品的标题

问题描述投票：0回答：1

1个回答

最新问题

如何使用python网页抓取方法提取每个产品的标题

问题描述 投票：0回答：1

1个回答

最新问题

问题描述投票：0回答：1