如何在网站完成加载动态内容之前延迟fetch()

问题描述 投票:1回答:2

我有一个chrome扩展名。每当用户点击扩展程序的按钮时,它将下载以下网址的来源:“smmry.com/(用户当前活动标签的网址)”

我正在使用以下javascript代码以html文件的形式下载URL的来源。当用户单击我的扩展程序按钮时,此代码当前运行(变量URL是我的扩展程序可以下载的假设URL。在这种情况下,用户实际上将浏览cnn.com/(path_to_news_article),但扩展名将被下载:smmry.com/https://www.cnn.com/(path_to_news_article)):

let URL = 'https://smmry.com/https://www.cnn.com/2018/04/01/politics/ronald-kessler-jake-tapper-interview/index.html#&SM_LENGTH=7'
    fetch(URL)
        .then((resp) => resp.text())
        .then(responseText => {
           download("website_source.html", responseText)
        })

function download(filename, text) {

    var element = document.createElement('a');
    element.setAttribute('href', 'data:text/plain;charset=utf-8,' + encodeURIComponent(text));
    element.setAttribute('download', filename);

    element.style.display = 'none';
    document.body.appendChild(element);

    element.click();

    document.body.removeChild(element);
}

这是网页的来源:https://smmry.com/https://www.cnn.com/2018/04/01/politics/ronald-kessler-jake-tapper-interview/index.html#&SM_LENGTH=7

但是,正如您可以看到的那样,如果您访问该网页,有时网页会花费很少的时间(最多几秒钟)来总结文章。这篇文章不太引人注意 - 但通常粉红色的加载栏会在粉红色框中上下移动,直到摘要创建并显示在网站上。

我相信我的代码在完成文章总结之前下载了网站的源代码,因此我的程序下载的HTML文件不包含文章的摘要。

我怎样才能确保fetch()请求只在网站https://smmry.com完成文章https://www.cnn.com/2018/04/01/politics/ronald-kessler-jake-tapper-interview/index.html后才下载网站内容。

编辑:我的manifest.json文件。

{
"manifest_version": 2,
"name": "Summarizer",
"version": "1.0",

"description": "Summarizes webpages",

"permissions": [
    "tabs",
    "downloads",
    "*://*.smmry.com/*"
],

"icons": {
    "48": "icons/border-48.png"
},

"browser_action": {
    "browser_style": true,
    "default_popup": "popup/choose_page.html",
    "default_icon": {
        "16": "icons/summarizer-icon-16.png",
        "32": "icons/summarizer-icon-32.png"
    }
}
}
javascript google-chrome-extension web-scraping fetch-api dynamic-content
2个回答
0
投票

我想你正在寻找document.onload

也许你需要做这样的事情:

document.onload = () => { 
    let URL = 'https://smmry.com/https://www.cnn.com/2018/04/01/politics/ronald-kessler-jake-tapper-interview/index.html#&SM_LENGTH=7'

    fetch(URL)
    .then((resp) => resp.text())
    .then(responseText => {
       download("website_source.html", responseText)
    })

    const download = (filename, text) => {

    const element = document.createElement('a');
    element.setAttribute('href', 'data:text/plain;charset=utf-8,' + encodeURIComponent(text));
    element.setAttribute('download', filename);

    element.style.display = 'none';
    document.body.appendChild(element);

    element.click();

     document.body.removeChild(element);
    }
};

onload将等待页面,然后您可以进行提取


0
投票

使用addEventListener并清理代码:

function main(){
  const URL = 'https://smmry.com/https://www.cnn.com/2018/04/01/politics/ronald-kessler-jake-tapper-interview/index.html#&SM_LENGTH=7'
  fetch(URL)
    .then(resp => resp.text())
    .then(responseText => download("website_source.html", responseText));

  function download(filename, text) {
    const element = document.createElement('a');
    element.href = 'data:text/plain;charset=utf-8,' + encodeURIComponent(text);
    element.setAttribute('download', filename);
    element.style.display = 'none';
    document.body.appendChild(element);
    element.click();
    element.remove();
  }
}
document.addEventListener('DOMContentLoaded', main);
© www.soinside.com 2019 - 2024. All rights reserved.