如何通过在检查器中操作Java脚本来刮取表?该页面仅显示当天的数据,但我想及时返回并抓取

问题描述 投票:0回答:1

我想做什么:

http://www.nea.gov.sg/anti-pollution-radiation-protection/air-pollution-control/psi/pollutant-concentrations/type/pm25-1hr/time/0100#pollutant

我指的是本页的第二和第三表。不幸的是,它仅显示您正在查看页面的“一天”的数据。我希望能够回到9月1日,然后将这些数据抓取到Google表格中。我只需要第二张表中的1小时PM2.5浓度水平和第三张表中的3小时PSI读数-但是如果我能得到全部,那就更好了。

问题

除非能以某种方式操纵访问日期,否则我看不到任何访问数据的方法。这是一个政府机构,我想他们正在用庞大的电子表格运行它,但抓取该数据的机会几乎是不可能的。

代码

<script type="text/javascript">
    var typebaseUrl = '/anti-pollution-radiation-protection/air-pollution-control/psi/pollutant-concentrations/time/0100';

    function changetype(ddl) {
        var strTime = ddl.options[ddl.selectedIndex].value;

        if (strTime != null) {
            var npage = typebaseUrl + "/type/" + strTime + "#pollutant";
            window.location = npage;
        }
    }
</script>
<h1 id="pollutant">
    1-hr PM<sub>2.5</sub> (µg/m<sup>3</sup>)<sup>b</sup> Readings on 28 Oct 2015
</h1>
<p>
    View reading for:
    <select name="ctl00$ContentPlaceHolderTitle$C005$DDLType" id="ContentPlaceHolderTitle_C005_DDLType" class="default" onchange="changetype(this);">
	<option value="SulphurDioxide">24-hr Sulphur Dioxide</option>
	<option value="PM10">24-hr PM10</option>
	<option value="PM25">24-hr PM2.5</option>
	<option value="Ozone">8-hr Ozone</option>
	<option value="CarbonMonoxide">8-hr Carbon Monoxide</option>
	<option selected="selected" value="PM25-1Hr">1-hr PM2.5</option>
	<option value="NitrogenDioxide">1-hr Nitrogen Dioxide</option>

</select>
</p>
<div>
    
        
        <table width="100%" border="0" cellpadding="4" cellspacing="1" class="noalter">
            <tr class="even">
                <td width="16%" align="center">
                    <strong>Time</strong>
                </td>

                <td width="7%" align="center"><strong>1am</strong>
                </td>
                <td width="7%" align="center"><strong>2am</strong>
                </td>
                <td width="7%" align="center"><strong>3am</strong>
                </td>
                <td width="7%" align="center"><strong>4am</strong>
                </td>
                <td width="7%" align="center"><strong>5am</strong>
                </td>
                <td width="7%" align="center"><strong>6am</strong>
                </td>
                <td width="7%" align="center"><strong>7am</strong>
                </td>
                <td width="7%" align="center"><strong>8am</strong>
                </td>
                <td width="7%" align="center"><strong>9am</strong>
                </td>
                <td width="7%" align="center"><strong>10am</strong>
                </td>
                <td width="7%" align="center"><strong>11am</strong>
                </td>
                <td width="7%" align="center"><strong>12pm</strong>
                </td>
            </tr>
            <tr>
                <td align="center">
                    <strong>North</strong>
                </td>
                <td align="center">
                    87
                </td>
                <td align="center">
                    52
                </td>
                <td align="center">
                    49
                </td>
                <td align="center">
                    70
                </td>
                <td align="center">
                    61
                </td>
                <td align="center">
                    62
                </td>
                <td align="center">
                    16
                </td>
                <td align="center">
                    5
                </td>
                <td align="center">
                    7
                </td>
                <td align="center">
                    8
                </td>
                <td align="center">
                    12
                </td>
                <td align="center">
                    17
                </td>
            </tr>
            <tr>
                <td align="center">
                    <strong>South</strong>
                </td>

                <td align="center">
                    66
                </td>
                <td align="center">
                    63
                </td>
                <td align="center">
                    53
                </td>
                <td align="center">
                    63
                </td>
                <td align="center">
                    72
                </td>
                <td align="center">
                    51
                </td>
                <td align="center">
                    5
                </td>
                <td align="center">
                    3
                </td>
                <td align="center">
                    6
                </td>
                <td align="center">
                    23
                </td>
                <td align="center">
                    30
                </td>
                <td align="center">
                    27
                </td>
            </tr>
            <tr>
                <td align="center">
                    <strong>East</strong>
                </td>
                <td align="center">
                    55
                </td>
                <td align="center">
                    68
                </td>
                <td align="center">
                    70
                </td>
                <td align="center">
                    47
                </td>
                <td align="center">
                    60
                </td>
                <td align="center">
                    37
                </td>
                <td align="center">
                    4
                </td>
                <td align="center">
                    5
                </td>
                <td align="center">
                    7
                </td>
                <td align="center">
                    8
                </td>
                <td align="center">
                    18
                </td>
                <td align="center">
                    28
                </td>
            </tr>
            <tr>
                <td align="center">
                    <strong>West</strong>
                </td>

                <td align="center">
                    49
                </td>
                <td align="center">
                    40
                </td>
                <td align="center">
                    48
                </td>
                <td align="center">
                    58
                </td>
                <td align="center">
                    51
                </td>
                <td align="center">
                    46
                </td>
                <td align="center">
                    12
                </td>
                <td align="center">
                    4
                </td>
                <td align="center">
                    2
                </td>
                <td align="center">
                    10
                </td>
                <td align="center">
                    9
                </td>
                <td align="center">
                    24
                </td>
            </tr>
            <tr>
                <td align="center">
                    <strong>Central</strong>
                </td>

                <td align="center">
                    52
                </td>
                <td align="center">
                    64
                </td>
                <td align="center">
                    47
                </td>
                <td align="center">
                    44
                </td>
                <td align="center">
                    54
                </td>
                <td align="center">
                    42
                </td>
                <td align="center">
                    9
                </td>
                <td align="center">
                    6
                </td>
                <td align="center">
                    2
                </td>
                <td align="center">
                    9
                </td>
                <td align="center">
                    18
                </td>
                <td align="center">
                    18
                </td>
            </tr>
            
            <tr class="even">
                <td width="16%" align="center">
                    <strong>Time</strong>
                </td>

                <td width="7%" align="center"><strong>1pm</strong>
                </td>
                <td width="7%" align="center"><strong>2pm</strong>
                </td>
                <td width="7%" align="center"><strong>3pm</strong>
                </td>
                <td width="7%" align="center"><strong>4pm</strong>
                </td>
                <td width="7%" align="center"><strong>5pm</strong>
                </td>
                <td width="7%" align="center"><strong>6pm</strong>
                </td>
                <td width="7%" align="center"><strong>7pm</strong>
                </td>
                <td width="7%" align="center"><strong>8pm</strong>
                </td>
                <td width="7%" align="center"><strong>9pm</strong>
                </td>
                <td width="7%" align="center"><strong>10pm</strong>
                </td>
                <td width="7%" align="center"><strong>11pm</strong>
                </td>
                <td width="7%" align="center"><strong>12am</strong>
                </td>
            </tr>
            <tr>
                <td align="center">
                    <strong>North</strong>
                </td>

                <td align="center">
                    <strong style="font-size:14px;">33</strong>
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
            </tr>
            <tr>
                <td align="center">
                    <strong>South</strong>
                </td>

                <td align="center">
                    <strong style="font-size:14px;">35</strong>
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
            </tr>
            <tr>
                <td align="center">
                    <strong>East</strong>
                </td>

                <td align="center">
                    <strong style="font-size:14px;">55</strong>
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
            </tr>
            <tr>
                <td align="center">
                    <strong>West</strong>
                </td>

                <td align="center">
                    <strong style="font-size:14px;">32</strong>
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
            </tr>
            <tr>
                <td align="center">
                    <strong>Central</strong>
                </td>

                <td align="center">
                    <strong style="font-size:14px;">33</strong>
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
                <td align="center">
                    -
                </td>
            </tr>
            
        </table>
javascript html web-scraping
1个回答
1
投票

我想回到9月1日,然后将这些数据抓取到Google表格中。

有一些方法可以让您获取“ 回到未来” /过去以获取信息。

  1. 请询问网站所有者(政府机构)以发布过时的数据。可以启用API访问。
  2. 每天抓取并将数据保存到数据库中,过一会儿您将积累过去的数据以供需要。

    在检查器中操作JavaScript

  3. 您无需“在[浏览器]检查器*中操纵JS即可快速抓取。Scraper,Google Chrome插件非常适合将浏览器信息快速抓取到Google表格中。

© www.soinside.com 2019 - 2024. All rights reserved.