我有一个脚本,该脚本从一个站点下载pdf文件,该站点每个月都在更新,我想使它自动化。它可以工作,但我无法让它发挥得淋漓尽致,我认为这是因为它无法正确处理下载。它似乎以无头的方式启动了chrome,并且我的导航命令似乎可以正常运行,但是当下载时什么也没发生。
#!/usr/bin/env ruby
#
require 'capybara'
require 'rb-inotify'
require 'webdrivers/chromedriver'
def initialise
Capybara.register_driver :chrome do |app|
Capybara::Selenium::Driver.new(app, :browser => :chrome, options: chrome_options)
end
@session = Capybara::Session.new(:chrome)
end
# Settings and profile for the Chrome Browser
# NOTE: still cannot get headless working
def chrome_options
opts = Selenium::WebDriver::Chrome::Options.new
opts.add_argument('--headless') unless ENV['UI']
opts.add_argument('--no-sandbox')
opts.add_argument('--disable-gpu')
opts.add_argument('--disable-dev-shm-usage')
opts.add_argument('--window-size=1920,1080')
opts.add_preference(:download,
directory_upgrade: true,
prompt_for_download: false,
default_directory: "~/Downloads")
opts.add_preference(:plugins,
plugins_disabled: ["Chrome PDF Viewer"])
opts.add_preference(:browser, set_download_behavior: { behavior: 'allow' })
opts
end
在不同版本的Chrome和selenium-webdriver中,更改/增长下载所需的设置。似乎您缺少其中之一。
opts.add_preference('download.default_directory', '~/Downloads')
根据版本,您还可以做的另一件事是
def initialise
Capybara.register_driver :chrome do |app|
Capybara::Selenium::Driver.new(app, :browser => :chrome, options: chrome_options).tap do |driver|
driver.browser.download_path = '~/Downloads'
end
@session = Capybara::Session.new(:chrome)
end