如何获取JavaScript生成的内容

问题描述 投票:2回答:1

我目前在此(https://aca3.accela.com/MILARA/GeneralProperty/PropertyLookUp.aspx)网页上获取javascript生成的内容的HTML时遇到问题。它会在页面本身上生成javascript。我想知道自己在做什么错。我正在使用的代码是这样的:

import time
from selenium import webdriver
from selenium.webdriver.support.ui import Select
from selenium.webdriver import ActionChains
from selenium.webdriver.common.keys import Keys
import selenium.webdriver.support.ui as ui
from bs4 import BeautifulSoup
driver = webdriver.Chrome(executable_path="/Users/MrPete/Downloads/chromedriver_win32/chromedriver")
driver.get('https://aca3.accela.com/MILARA/GeneralProperty/PropertyLookUp.aspx')

profession = Select(driver.find_element_by_xpath('//*[@id="ctl00_PlaceHolderMain_refLicenseeSearchForm_ddlLicenseType"]'))
profession.select_by_value("Pharmacist")
time.sleep(5) # Let the user actually see something!
lName = driver.find_element_by_xpath('//*[@id="ctl00_PlaceHolderMain_refLicenseeSearchForm_txtLastName"]')
lName.send_keys('roy')
search = driver.find_element_by_xpath('//*[@id="ctl00_PlaceHolderMain_btnNewSearch"]')
search.click()
time.sleep(5)
html = driver.execute_script("return document.getElementsByTagName('table')[38].innerHTML")
print(html)

现在,我没有输出,我得到的输出是:

<tbody><tr>
                                        <td style="white-space:nowrap;"><span class="ACA_SmLabel ACA_SmLabel_FontSize"> Showing 1-13 of 13 </span></td>
                                </tr>
                        </tbody>

[这是我要获取的表的标题。我想要的输出是整个表的HTML(张贴了由javascript生成的表的图片。我现在得到的是图片顶部的小标题,“显示1-13” 13',而我想要的是整个桌子。enter image description here

python-3.x selenium selenium-webdriver web-scraping beautifulsoup
1个回答
1
投票

尝试更改

html = driver.execute_script("return document.getElementsByTagName('table')[38].innerHTML")
print(html)

收件人:

target = driver.find_element_by_xpath('//table[@class="ACA_GridView ACA_Grid_Caption"]')
print(target.text)

输出:

Showing 1-13 of 13
License Type
License Number
First Name
Middle Initial
Last Name
Organization Name
DBA/Trade Name
License Status
License Expiration Date
Pharmacist
5302017621
Arthur
James

© www.soinside.com 2019 - 2024. All rights reserved.