代码之家 › 专栏 › 技术社区 › BigBoy1337

Selenium文件下载完成回调?

chromium selenium python

BigBoy1337 · 技术社区 · 5 年前

from selenium import webdriver
from selenium.webdriver.chrome.options import Options
import os

# function to take care of downloading file
def enable_download_headless(browser,download_dir):
    browser.command_executor._commands["send_command"] = ("POST", '/session/$sessionId/chromium/send_command')
    params = {'cmd':'Page.setDownloadBehavior', 'params': {'behavior': 'allow', 'downloadPath': download_dir}}
    browser.execute("send_command", params)

# instantiate a chrome options object so you can set the size and headless preference
# some of these chrome options might be uncessary but I just used a boilerplate
# change the <path_to_download_default_directory> to whatever your default download folder is located
chrome_options = Options()
chrome_options.add_argument("--headless")
chrome_options.add_argument("--window-size=1920x1080")
chrome_options.add_argument("--disable-notifications")
chrome_options.add_argument('--no-sandbox')
chrome_options.add_argument('--verbose')
chrome_options.add_experimental_option("prefs", {
        "download.default_directory": ".",
        "download.prompt_for_download": False,
        "download.directory_upgrade": True,
        "safebrowsing_for_trusted_sources_enabled": False,
        "safebrowsing.enabled": False
})
chrome_options.add_argument('--disable-gpu')
#chrome_options.add_argument('--disable-software-rasterizer')

# initialize driver object and change the <path_to_chrome_driver> depending on your directory where your chromedriver should be
driver = webdriver.Chrome(options=chrome_options)

# change the <path_to_place_downloaded_file> to your directory where you would like to place the downloaded file
download_dir = "."

# function to handle setting up headless download
enable_download_headless(driver, download_dir)

# get request to target the site selenium is active on
driver.get("https://www.thinkbroadband.com/download")

# initialize an object to the location on the html page and click on it to download
search_input = driver.find_element_by_css_selector('#main-col > div > div > div:nth-child(8) > p:nth-child(1) > a > img')
search_input.click()

#this won't work because the file isn't here yet
with zipfile.ZipFile('my_downloaded_file.zip', 'r') as zip_ref:
    zip_ref.extractall('.')
    file = open('my_downloaded_file')
    for line in file:
        print(line)
    file.close()

我想用我刚下载的文件。一切都在底部按钮。单击()只是一个示例—例如打印出内容。如果我不必按名称查找文件会更好—如果我可以传递一个空对象或其他东西以及一个函数,以便在下载完成时调用,在那里我可以使用该对象。我怎么能做这样的事?

0 回复 | 直到 5 年前

john heroi 4 年前

from selenium import webdriver
from selenium.webdriver.chrome.options import Options

options = Options()
options.headless = True

with webdriver.Chrome(options=options) as b:
    # your code here
    b.execute_script("window.open('');")

    
    b.switch_to.window(b.window_handles[1])
    b.get('chrome://downloads/')
    x = b.page_source

    if "Show in folder" in x:
        print("Yes")
    else:
        print("No")

推荐文章

Google User · Django管理员在`list_display中未显示`creation_date`字段`

7 月前

user29747013 · 如何创建一个新的数据框架,其中包含原始数据框架中列的聚合列?

7 月前

ÎÎÎ½Î· ÎÎ®Î¹Î½Î¿Ï · Python lxml.html语法错误:使用lxml find时XPATH的谓词无效

7 月前

user29715306 · from_users=和chats=电视节目中的差异

7 月前

Redshoe · 当执行numpy.genfromtxt()时,python是否会读取文件的所有行?

7 月前

RASEL MAHMUD · 为什么以及如何在is_even()函数内的IF条件中递归X变量在满足0后递增?[副本]

7 月前

prayner · 更新嵌套字典包含列表中的项

7 月前

Bringo Jr · 我可以在O(n)中解决这个问题吗?

7 月前

Dave · 如何在for循环中修改列表值

7 月前

Shukurullox Komiljonov · 从记录中获得相互和解。使用SQL

7 月前