我想做什么:
我指的是本页的第二和第三表。不幸的是,它仅显示您正在查看页面的“一天”的数据。我希望能够回到9月1日,然后将这些数据抓取到Google表格中。我只需要第二张表中的1小时PM2.5浓度水平和第三张表中的3小时PSI读数-但是如果我能得到全部,那就更好了。
问题
除非能以某种方式操纵访问日期,否则我看不到任何访问数据的方法。这是一个政府机构,我想他们正在用庞大的电子表格运行它,但抓取该数据的机会几乎是不可能的。
代码
<script type="text/javascript">
var typebaseUrl = '/anti-pollution-radiation-protection/air-pollution-control/psi/pollutant-concentrations/time/0100';
function changetype(ddl) {
var strTime = ddl.options[ddl.selectedIndex].value;
if (strTime != null) {
var npage = typebaseUrl + "/type/" + strTime + "#pollutant";
window.location = npage;
}
}
</script>
<h1 id="pollutant">
1-hr PM<sub>2.5</sub> (µg/m<sup>3</sup>)<sup>b</sup> Readings on 28 Oct 2015
</h1>
<p>
View reading for:
<select name="ctl00$ContentPlaceHolderTitle$C005$DDLType" id="ContentPlaceHolderTitle_C005_DDLType" class="default" onchange="changetype(this);">
<option value="SulphurDioxide">24-hr Sulphur Dioxide</option>
<option value="PM10">24-hr PM10</option>
<option value="PM25">24-hr PM2.5</option>
<option value="Ozone">8-hr Ozone</option>
<option value="CarbonMonoxide">8-hr Carbon Monoxide</option>
<option selected="selected" value="PM25-1Hr">1-hr PM2.5</option>
<option value="NitrogenDioxide">1-hr Nitrogen Dioxide</option>
</select>
</p>
<div>
<table width="100%" border="0" cellpadding="4" cellspacing="1" class="noalter">
<tr class="even">
<td width="16%" align="center">
<strong>Time</strong>
</td>
<td width="7%" align="center"><strong>1am</strong>
</td>
<td width="7%" align="center"><strong>2am</strong>
</td>
<td width="7%" align="center"><strong>3am</strong>
</td>
<td width="7%" align="center"><strong>4am</strong>
</td>
<td width="7%" align="center"><strong>5am</strong>
</td>
<td width="7%" align="center"><strong>6am</strong>
</td>
<td width="7%" align="center"><strong>7am</strong>
</td>
<td width="7%" align="center"><strong>8am</strong>
</td>
<td width="7%" align="center"><strong>9am</strong>
</td>
<td width="7%" align="center"><strong>10am</strong>
</td>
<td width="7%" align="center"><strong>11am</strong>
</td>
<td width="7%" align="center"><strong>12pm</strong>
</td>
</tr>
<tr>
<td align="center">
<strong>North</strong>
</td>
<td align="center">
87
</td>
<td align="center">
52
</td>
<td align="center">
49
</td>
<td align="center">
70
</td>
<td align="center">
61
</td>
<td align="center">
62
</td>
<td align="center">
16
</td>
<td align="center">
5
</td>
<td align="center">
7
</td>
<td align="center">
8
</td>
<td align="center">
12
</td>
<td align="center">
17
</td>
</tr>
<tr>
<td align="center">
<strong>South</strong>
</td>
<td align="center">
66
</td>
<td align="center">
63
</td>
<td align="center">
53
</td>
<td align="center">
63
</td>
<td align="center">
72
</td>
<td align="center">
51
</td>
<td align="center">
5
</td>
<td align="center">
3
</td>
<td align="center">
6
</td>
<td align="center">
23
</td>
<td align="center">
30
</td>
<td align="center">
27
</td>
</tr>
<tr>
<td align="center">
<strong>East</strong>
</td>
<td align="center">
55
</td>
<td align="center">
68
</td>
<td align="center">
70
</td>
<td align="center">
47
</td>
<td align="center">
60
</td>
<td align="center">
37
</td>
<td align="center">
4
</td>
<td align="center">
5
</td>
<td align="center">
7
</td>
<td align="center">
8
</td>
<td align="center">
18
</td>
<td align="center">
28
</td>
</tr>
<tr>
<td align="center">
<strong>West</strong>
</td>
<td align="center">
49
</td>
<td align="center">
40
</td>
<td align="center">
48
</td>
<td align="center">
58
</td>
<td align="center">
51
</td>
<td align="center">
46
</td>
<td align="center">
12
</td>
<td align="center">
4
</td>
<td align="center">
2
</td>
<td align="center">
10
</td>
<td align="center">
9
</td>
<td align="center">
24
</td>
</tr>
<tr>
<td align="center">
<strong>Central</strong>
</td>
<td align="center">
52
</td>
<td align="center">
64
</td>
<td align="center">
47
</td>
<td align="center">
44
</td>
<td align="center">
54
</td>
<td align="center">
42
</td>
<td align="center">
9
</td>
<td align="center">
6
</td>
<td align="center">
2
</td>
<td align="center">
9
</td>
<td align="center">
18
</td>
<td align="center">
18
</td>
</tr>
<tr class="even">
<td width="16%" align="center">
<strong>Time</strong>
</td>
<td width="7%" align="center"><strong>1pm</strong>
</td>
<td width="7%" align="center"><strong>2pm</strong>
</td>
<td width="7%" align="center"><strong>3pm</strong>
</td>
<td width="7%" align="center"><strong>4pm</strong>
</td>
<td width="7%" align="center"><strong>5pm</strong>
</td>
<td width="7%" align="center"><strong>6pm</strong>
</td>
<td width="7%" align="center"><strong>7pm</strong>
</td>
<td width="7%" align="center"><strong>8pm</strong>
</td>
<td width="7%" align="center"><strong>9pm</strong>
</td>
<td width="7%" align="center"><strong>10pm</strong>
</td>
<td width="7%" align="center"><strong>11pm</strong>
</td>
<td width="7%" align="center"><strong>12am</strong>
</td>
</tr>
<tr>
<td align="center">
<strong>North</strong>
</td>
<td align="center">
<strong style="font-size:14px;">33</strong>
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
</tr>
<tr>
<td align="center">
<strong>South</strong>
</td>
<td align="center">
<strong style="font-size:14px;">35</strong>
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
</tr>
<tr>
<td align="center">
<strong>East</strong>
</td>
<td align="center">
<strong style="font-size:14px;">55</strong>
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
</tr>
<tr>
<td align="center">
<strong>West</strong>
</td>
<td align="center">
<strong style="font-size:14px;">32</strong>
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
</tr>
<tr>
<td align="center">
<strong>Central</strong>
</td>
<td align="center">
<strong style="font-size:14px;">33</strong>
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
<td align="center">
-
</td>
</tr>
</table>
我想回到9月1日,然后将这些数据抓取到Google表格中。
有一些方法可以让您获取“ 回到未来” /过去以获取信息。
每天抓取并将数据保存到数据库中,过一会儿您将积累过去的数据以供需要。
在检查器中操作JavaScript
您无需“在[浏览器]检查器*中操纵JS即可快速抓取。Scraper,Google Chrome插件非常适合将浏览器信息快速抓取到Google表格中。