我正在尝试在我的虚拟 ubuntu shell 上安装和使用 Selenium chromedriver,我一步步按照各种教程进行操作,但似乎仍然出现问题......经过对这个问题的多次研究,我找不到任何答案。
这是我尝试运行的小代码:
from selenium import webdriver
from selenium.webdriver import Chrome
from selenium.webdriver.chrome.options import Options
chrome_options = Options()
chrome_options.add_argument('--no-sandbox')
chrome_options.add_argument('--disable-extensions')
driver = webdriver.Chrome(chrome_options=chrome_options)
driver.get('https://google.com')
由于我在各个主题上找到的建议,我添加了 chrome_options。但我不太明白为什么有必要?
不幸的是,我的程序向我发送了以下错误,我被困住了,我不知道该怎么办:
/home/lclis/anaconda3/lib/python3.7/site-packages/ipykernel_launcher.py:7: DeprecationWarning: use options instead of chrome_options
import sys
---------------------------------------------------------------------------
WebDriverException Traceback (most recent call last)
<ipython-input-4-259faa721232> in <module>
5 chrome_options.add_argument('--no-sandbox')
6 chrome_options.add_argument('--disable-extensions')
----> 7 driver = webdriver.Chrome(chrome_options=chrome_options)
8 driver.get('https://google.com')
~/anaconda3/lib/python3.7/site-packages/selenium/webdriver/chrome/webdriver.py in __init__(self, executable_path, port, options, service_args, desired_capabilities, service_log_path, chrome_options, keep_alive)
79 remote_server_addr=self.service.service_url,
80 keep_alive=keep_alive),
---> 81 desired_capabilities=desired_capabilities)
82 except Exception:
83 self.quit()
~/anaconda3/lib/python3.7/site-packages/selenium/webdriver/remote/webdriver.py in __init__(self, command_executor, desired_capabilities, browser_profile, proxy, keep_alive, file_detector, options)
155 warnings.warn("Please use FirefoxOptions to set browser profile",
156 DeprecationWarning, stacklevel=2)
--> 157 self.start_session(capabilities, browser_profile)
158 self._switch_to = SwitchTo(self)
159 self._mobile = Mobile(self)
~/anaconda3/lib/python3.7/site-packages/selenium/webdriver/remote/webdriver.py in start_session(self, capabilities, browser_profile)
250 parameters = {"capabilities": w3c_caps,
251 "desiredCapabilities": capabilities}
--> 252 response = self.execute(Command.NEW_SESSION, parameters)
253 if 'sessionId' not in response:
254 response = response['value']
~/anaconda3/lib/python3.7/site-packages/selenium/webdriver/remote/webdriver.py in execute(self, driver_command, params)
319 response = self.command_executor.execute(driver_command, params)
320 if response:
--> 321 self.error_handler.check_response(response)
322 response['value'] = self._unwrap_value(
323 response.get('value', None))
~/anaconda3/lib/python3.7/site-packages/selenium/webdriver/remote/errorhandler.py in check_response(self, response)
240 alert_text = value['alert'].get('text')
241 raise exception_class(message, screen, stacktrace, alert_text)
--> 242 raise exception_class(message, screen, stacktrace)
243
244 def _value_or_default(self, obj, key, default):
WebDriverException: Message: unknown error: Chrome failed to start: exited abnormally
(unknown error: DevToolsActivePort file doesn't exist)
(The process started from chrome location /usr/bin/chromium-browser is no longer running, so ChromeDriver is assuming that Chrome has crashed.)
有人已经遇到过这个问题或者有什么建议吗?
Chrome 在启动期间崩溃的一个常见原因是在 Linux 上以
用户 (root
) 身份运行 Chrome。虽然可以通过在创建 WebDriver 会话时传递administrator
标志来解决此问题,但此类配置不受支持且强烈建议不要这样做。您需要配置环境才能以普通用户身份运行 Chrome。--no-sandbox
第一个错误消息...
/home/lclis/anaconda3/lib/python3.7/site-packages/ipykernel_launcher.py:7: DeprecationWarning: use options instead of chrome_options
import sys
---------------------------------------------------------------------------
WebDriverException Traceback (most recent call last)
<ipython-input-4-259faa721232> in <module>
5 chrome_options.add_argument('--no-sandbox')
6 chrome_options.add_argument('--disable-extensions')
----> 7 driver = webdriver.Chrome(chrome_options=chrome_options)
...意味着 ChromeDriver 无法启动/生成新的 浏览上下文,即 Chrome 浏览器 会话。
根据 Selenium 的 python 客户端 v3.8.0 chrome_options
的
发行说明已弃用:
浏览器选项参数现已在驱动程序之间标准化为
。options
、firefox_options
和chrome_options
现已弃用。ie_options
在初始化 ChromeDriver/Chrome 会话时,您需要使用
options
而不是 chrome_options
。因此,您的代码块实际上将是:
from selenium import webdriver
from selenium.webdriver.chrome.options import Options
chrome_options = Options()
chrome_options.add_argument('--no-sandbox')
chrome_options.add_argument('--disable-extensions')
driver = webdriver.Chrome(executable_path='/path/to/chromedriver', options=chrome_options)
driver.get('https://google.com')
您可以使用以下代码片段在服务器上运行selenium爬虫(ubuntu,无GUI,框中没有X窗口可用):
from selenium import webdriver
from selenium.webdriver.chrome.options import Options
chrome_options = Options()
chrome_options.add_argument('--no-sandbox')
chrome_options.add_argument('--headless')
driver = webdriver.Chrome(executable_path='/usr/lib/chromium-browser/chromedriver', options=chrome_options)
driver.get('https://google.com')
我在谷歌上搜索红宝石错误,但发现了这个。对于某些人来说可能会有帮助。
我错误地设置了水豚会话(它本身依赖于selenium)
# WRONG
session = Capybara::Session.new(:selenium_chrome)
# CORRECT, see headless added
session = Capybara::Session.new(:selenium_chrome_headless)
错误消息明确指出启动 Chromium 时发生错误。在这种情况下,请尝试在系统上手动启动 Chromium/Chrome。当我尝试自己启动 Chromium 时,我发现
--no-sandbox
和 --headless=new
参数是必要的。
尝试改为
--headless
或 --headless=new