使用 R 抓取篮球参考页面上的多个表格

问题描述 投票:0回答:1

我正在尝试在篮球参考球员网页上抓取多个表格(使用 R 中的 rvest 包)但遇到了一些问题。我们以这个网页为例:https://www.basketball-reference.com/players/w/whitede01/gamelog/2023。我想为“2022-23 常规赛”和“2023 年季后赛”争取积分。出于某种原因,相同类型的 rvest 代码适用于第一个但不适用于第二个。这是显示此内容的代码:

# First load packages
library(tidyverse)
library(rvest)

# This code works
"https://www.basketball-reference.com/players/w/whitede01/gamelog/2023" %>%
  read_html() %>%
  html_nodes(xpath = '//*[@id="content"]/div[5]') %>%
  html_table(fill = TRUE, header = FALSE) %>%
  .[[1]]

# This code doesn't work
"https://www.basketball-reference.com/players/w/whitede01/gamelog/2023" %>%
  read_html() %>%
  html_nodes(xpath = '//*[@id="content"]/div[6]') %>%
  html_table(fill = TRUE, header = FALSE) %>%
  .[[1]] 

# And this xpath doesn't make it work eitehr:
'//*[@id="all_pgl_basic_playoffs"]'

有什么方法可以使第二个(季后赛)表的刮擦起作用吗?感谢您的帮助。

r rvest
1个回答
0
投票

@margusl 的评论是正确的。

理想情况下,您可以使用代码中的 xml 路径直接访问具有 id

all_pgl_basic_playoffs
.

的表来访问该表

但是,在执行 JavaScript 函数之前,该表不存在。如果您使用移动设备,您会看到表格不会自动填充到您的浏览器中。您必须先展开表格。必须发生的是,一旦您告诉表展开,JavaScript 函数就会执行,然后用 id

all_pgl_basic_playoffs
.

填充该表

谢天谢地,表的内容存储为

div<id="placeholder"></div>
下方的注释。评论以
<!--
开始,以
-->
结束。

对于这个特定的表,这里是评论:

<!--

<div class="table_container" id="div_pgl_basic_playoffs">
    
    <table class="row_summable sortable stats_table" id="pgl_basic_playoffs" data-cols-to-freeze=",3">
    <caption>2023 Playoffs Table</caption>
    

   <colgroup><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col><col></colgroup>
   <thead>      
      <tr>
         <th aria-label="Rk" data-stat="ranker" scope="col" class="ranker poptip sort_default_asc show_partial_when_sorting center" data-tip="Rank" >Rk</th>
         <th aria-label="G" data-stat="game_season" scope="col" class=" poptip center" data-tip="Season Game" >G</th>
         <th aria-label="Date" data-stat="date_game" scope="col" class=" poptip sort_default_asc center" >Date</th>
         <th aria-label="Age" data-stat="age" scope="col" class=" poptip sort_default_asc center" data-tip="Player's age on February 1 of the season" >Age</th>
         <th aria-label="Tm" data-stat="team_id" scope="col" class=" poptip sort_default_asc center" data-tip="Team" >Tm</th>
         <th aria-label="&nbsp;" data-stat="game_location" scope="col" class=" poptip center" >&nbsp;</th>
         <th aria-label="Opp" data-stat="opp_id" scope="col" class=" poptip sort_default_asc center" data-tip="Opponent" >Opp</th>
         <th aria-label="&nbsp;" data-stat="game_result" scope="col" class=" poptip center" >&nbsp;</th>
         <th aria-label="Games Started" data-stat="gs" scope="col" class=" poptip center" data-tip="Games Started" >GS</th>
         <th aria-label="Minutes Played" data-stat="mp" scope="col" class=" poptip center" data-tip="Minutes Played" >MP</th>
         <th aria-label="Field Goals" data-stat="fg" scope="col" class=" poptip center" data-tip="Field Goals" >FG</th>
         <th aria-label="FGA" data-stat="fga" scope="col" class=" poptip center" data-tip="Field Goal Attempts" >FGA</th>
         <th aria-label="Field Goal Percentage" data-stat="fg_pct" scope="col" class=" poptip center" data-tip="Field Goal Percentage" >FG%</th>
         <th aria-label="3-Point Field Goals" data-stat="fg3" scope="col" class=" poptip center" data-tip="3-Point Field Goals" >3P</th>
         <th aria-label="3-Point Field Goal Attempts" data-stat="fg3a" scope="col" class=" poptip center" data-tip="3-Point Field Goal Attempts" >3PA</th>
         <th aria-label="3-Point Field Goal Percentage" data-stat="fg3_pct" scope="col" class=" poptip center" data-tip="3-Point Field Goal Percentage" >3P%</th>
         <th aria-label="Free Throws" data-stat="ft" scope="col" class=" poptip center" data-tip="Free Throws" >FT</th>
         <th aria-label="Free Throw Attempts" data-stat="fta" scope="col" class=" poptip center" data-tip="Free Throw Attempts" >FTA</th>
         <th aria-label="Free Throw Percentage" data-stat="ft_pct" scope="col" class=" poptip center" data-tip="Free Throw Percentage" >FT%</th>
         <th aria-label="Offensive Rebounds" data-stat="orb" scope="col" class=" poptip center" data-tip="Offensive Rebounds" >ORB</th>
         <th aria-label="Defensive Rebounds" data-stat="drb" scope="col" class=" poptip center" data-tip="Defensive Rebounds" >DRB</th>
         <th aria-label="Total Rebounds" data-stat="trb" scope="col" class=" poptip center" data-tip="Total Rebounds" >TRB</th>
         <th aria-label="Assists" data-stat="ast" scope="col" class=" poptip center" data-tip="Assists" >AST</th>
         <th aria-label="Steals" data-stat="stl" scope="col" class=" poptip center" data-tip="Steals" >STL</th>
         <th aria-label="Blocks" data-stat="blk" scope="col" class=" poptip center" data-tip="Blocks" >BLK</th>
         <th aria-label="Turnovers" data-stat="tov" scope="col" class=" poptip center" data-tip="Turnovers" >TOV</th>
         <th aria-label="Personal Fouls" data-stat="pf" scope="col" class=" poptip center" data-tip="Personal Fouls" >PF</th>
         <th aria-label="Points" data-stat="pts" scope="col" class=" poptip center" data-tip="Points" >PTS</th>
         <th aria-label="GmSc" data-stat="game_score" scope="col" class=" poptip center" data-tip="Game Score" >GmSc</th>
         <th aria-label="Plus/Minus" data-stat="plus_minus" scope="col" class=" poptip right" data-tip="Plus/Minus" >+/-</th>
      </tr>
      </thead>
<tbody><tr id="pgl_basic_playoffs.34" ><th scope="row" class="right " data-stat="ranker" csk="1" >1</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-04-15&is_playoff_game=Y" data-stat="game_season" ><strong>1</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202304150BOS.html">2023-04-15</a></td><td class="right " data-stat="age" >28-287</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center iz" data-stat="game_location" ></td><td class="left " data-stat="opp_id" ><a href="/teams/ATL/2023.html">ATL</a></td><td class="center " data-stat="game_result" csk="13" >W (+13)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="2310" >38:30</td><td class="right " data-stat="fg" >7</td><td class="right " data-stat="fga" >13</td><td class="right " data-stat="fg_pct" >.538</td><td class="right " data-stat="fg3" >4</td><td class="right " data-stat="fg3a" >7</td><td class="right " data-stat="fg3_pct" >.571</td><td class="right " data-stat="ft" >6</td><td class="right " data-stat="fta" >7</td><td class="right " data-stat="ft_pct" >.857</td><td class="right " data-stat="orb" >1</td><td class="right " data-stat="drb" >4</td><td class="right " data-stat="trb" >5</td><td class="right " data-stat="ast" >7</td><td class="right iz" data-stat="stl" >0</td><td class="right " data-stat="blk" >2</td><td class="right " data-stat="tov" >2</td><td class="right " data-stat="pf" >1</td><td class="right " data-stat="pts" >24</td><td class="right " data-stat="game_score" >23.1</td><td class="right " data-stat="plus_minus" >+13</td></tr>
<tr id="pgl_basic_playoffs.35" ><th scope="row" class="right " data-stat="ranker" csk="2" >2</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-04-18&is_playoff_game=Y" data-stat="game_season" ><strong>2</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202304180BOS.html">2023-04-18</a></td><td class="right " data-stat="age" >28-290</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center iz" data-stat="game_location" ></td><td class="left " data-stat="opp_id" ><a href="/teams/ATL/2023.html">ATL</a></td><td class="center " data-stat="game_result" csk="13" >W (+13)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="2025" >33:45</td><td class="right " data-stat="fg" >11</td><td class="right " data-stat="fga" >16</td><td class="right " data-stat="fg_pct" >.688</td><td class="right " data-stat="fg3" >2</td><td class="right " data-stat="fg3a" >5</td><td class="right " data-stat="fg3_pct" >.400</td><td class="right " data-stat="ft" >2</td><td class="right " data-stat="fta" >2</td><td class="right " data-stat="ft_pct" >1.000</td><td class="right iz" data-stat="orb" >0</td><td class="right " data-stat="drb" >7</td><td class="right " data-stat="trb" >7</td><td class="right " data-stat="ast" >2</td><td class="right " data-stat="stl" >1</td><td class="right " data-stat="blk" >3</td><td class="right " data-stat="tov" >3</td><td class="right " data-stat="pf" >1</td><td class="right " data-stat="pts" >26</td><td class="right " data-stat="game_score" >22.4</td><td class="right " data-stat="plus_minus" >+2</td></tr>
<tr id="pgl_basic_playoffs.36" ><th scope="row" class="right " data-stat="ranker" csk="3" >3</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-04-21&is_playoff_game=Y" data-stat="game_season" ><strong>3</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202304210ATL.html">2023-04-21</a></td><td class="right " data-stat="age" >28-293</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center " data-stat="game_location" >@</td><td class="left " data-stat="opp_id" ><a href="/teams/ATL/2023.html">ATL</a></td><td class="center " data-stat="game_result" csk="-8" >L (-8)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="1808" >30:08</td><td class="right " data-stat="fg" >4</td><td class="right " data-stat="fga" >7</td><td class="right " data-stat="fg_pct" >.571</td><td class="right " data-stat="fg3" >3</td><td class="right " data-stat="fg3a" >4</td><td class="right " data-stat="fg3_pct" >.750</td><td class="right iz" data-stat="ft" >0</td><td class="right iz" data-stat="fta" >0</td><td class="right iz" data-stat="ft_pct" ></td><td class="right iz" data-stat="orb" >0</td><td class="right " data-stat="drb" >2</td><td class="right " data-stat="trb" >2</td><td class="right " data-stat="ast" >4</td><td class="right " data-stat="stl" >1</td><td class="right iz" data-stat="blk" >0</td><td class="right iz" data-stat="tov" >0</td><td class="right " data-stat="pf" >5</td><td class="right " data-stat="pts" >11</td><td class="right " data-stat="game_score" >10.1</td><td class="right " data-stat="plus_minus" >-14</td></tr>
<tr id="pgl_basic_playoffs.37" ><th scope="row" class="right " data-stat="ranker" csk="4" >4</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-04-23&is_playoff_game=Y" data-stat="game_season" ><strong>4</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202304230ATL.html">2023-04-23</a></td><td class="right " data-stat="age" >28-295</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center " data-stat="game_location" >@</td><td class="left " data-stat="opp_id" ><a href="/teams/ATL/2023.html">ATL</a></td><td class="center " data-stat="game_result" csk="8" >W (+8)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="1891" >31:31</td><td class="right " data-stat="fg" >7</td><td class="right " data-stat="fga" >14</td><td class="right " data-stat="fg_pct" >.500</td><td class="right " data-stat="fg3" >4</td><td class="right " data-stat="fg3a" >9</td><td class="right " data-stat="fg3_pct" >.444</td><td class="right iz" data-stat="ft" >0</td><td class="right iz" data-stat="fta" >0</td><td class="right iz" data-stat="ft_pct" ></td><td class="right iz" data-stat="orb" >0</td><td class="right " data-stat="drb" >2</td><td class="right " data-stat="trb" >2</td><td class="right " data-stat="ast" >2</td><td class="right iz" data-stat="stl" >0</td><td class="right iz" data-stat="blk" >0</td><td class="right " data-stat="tov" >2</td><td class="right " data-stat="pf" >3</td><td class="right " data-stat="pts" >18</td><td class="right " data-stat="game_score" >9.8</td><td class="right " data-stat="plus_minus" >+7</td></tr>
<tr id="pgl_basic_playoffs.38" ><th scope="row" class="right " data-stat="ranker" csk="5" >5</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-04-25&is_playoff_game=Y" data-stat="game_season" ><strong>5</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202304250BOS.html">2023-04-25</a></td><td class="right " data-stat="age" >28-297</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center iz" data-stat="game_location" ></td><td class="left " data-stat="opp_id" ><a href="/teams/ATL/2023.html">ATL</a></td><td class="center " data-stat="game_result" csk="-2" >L (-2)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="1882" >31:22</td><td class="right " data-stat="fg" >7</td><td class="right " data-stat="fga" >11</td><td class="right " data-stat="fg_pct" >.636</td><td class="right " data-stat="fg3" >2</td><td class="right " data-stat="fg3a" >5</td><td class="right " data-stat="fg3_pct" >.400</td><td class="right " data-stat="ft" >2</td><td class="right " data-stat="fta" >2</td><td class="right " data-stat="ft_pct" >1.000</td><td class="right iz" data-stat="orb" >0</td><td class="right " data-stat="drb" >5</td><td class="right " data-stat="trb" >5</td><td class="right " data-stat="ast" >4</td><td class="right iz" data-stat="stl" >0</td><td class="right " data-stat="blk" >2</td><td class="right iz" data-stat="tov" >0</td><td class="right " data-stat="pf" >1</td><td class="right " data-stat="pts" >18</td><td class="right " data-stat="game_score" >18.4</td><td class="right " data-stat="plus_minus" >+9</td></tr>
<tr id="pgl_basic_playoffs.39" ><th scope="row" class="right " data-stat="ranker" csk="6" >6</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-04-27&is_playoff_game=Y" data-stat="game_season" ><strong>6</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202304270ATL.html">2023-04-27</a></td><td class="right " data-stat="age" >28-299</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center " data-stat="game_location" >@</td><td class="left " data-stat="opp_id" ><a href="/teams/ATL/2023.html">ATL</a></td><td class="center " data-stat="game_result" csk="8" >W (+8)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="1786" >29:46</td><td class="right " data-stat="fg" >3</td><td class="right " data-stat="fga" >9</td><td class="right " data-stat="fg_pct" >.333</td><td class="right " data-stat="fg3" >1</td><td class="right " data-stat="fg3a" >5</td><td class="right " data-stat="fg3_pct" >.200</td><td class="right iz" data-stat="ft" >0</td><td class="right iz" data-stat="fta" >0</td><td class="right iz" data-stat="ft_pct" ></td><td class="right " data-stat="orb" >1</td><td class="right " data-stat="drb" >1</td><td class="right " data-stat="trb" >2</td><td class="right " data-stat="ast" >2</td><td class="right iz" data-stat="stl" >0</td><td class="right " data-stat="blk" >1</td><td class="right " data-stat="tov" >1</td><td class="right " data-stat="pf" >3</td><td class="right " data-stat="pts" >7</td><td class="right " data-stat="game_score" >2.8</td><td class="right " data-stat="plus_minus" >-1</td></tr>
<tr id="pgl_basic_playoffs.40" ><th scope="row" class="right " data-stat="ranker" csk="7" >7</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-05-01&is_playoff_game=Y" data-stat="game_season" ><strong>7</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202305010BOS.html">2023-05-01</a></td><td class="right " data-stat="age" >28-303</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center iz" data-stat="game_location" ></td><td class="left " data-stat="opp_id" ><a href="/teams/PHI/2023.html">PHI</a></td><td class="center " data-stat="game_result" csk="-4" >L (-4)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="1628" >27:08</td><td class="right " data-stat="fg" >1</td><td class="right " data-stat="fga" >5</td><td class="right " data-stat="fg_pct" >.200</td><td class="right iz" data-stat="fg3" >0</td><td class="right " data-stat="fg3a" >4</td><td class="right iz" data-stat="fg3_pct" >.000</td><td class="right " data-stat="ft" >2</td><td class="right " data-stat="fta" >2</td><td class="right " data-stat="ft_pct" >1.000</td><td class="right iz" data-stat="orb" >0</td><td class="right " data-stat="drb" >2</td><td class="right " data-stat="trb" >2</td><td class="right " data-stat="ast" >3</td><td class="right " data-stat="stl" >1</td><td class="right iz" data-stat="blk" >0</td><td class="right " data-stat="tov" >2</td><td class="right " data-stat="pf" >2</td><td class="right " data-stat="pts" >4</td><td class="right " data-stat="game_score" >1.8</td><td class="right " data-stat="plus_minus" >-15</td></tr>
<tr id="pgl_basic_playoffs.41" ><th scope="row" class="right " data-stat="ranker" csk="8" >8</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-05-03&is_playoff_game=Y" data-stat="game_season" ><strong>8</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202305030BOS.html">2023-05-03</a></td><td class="right " data-stat="age" >28-305</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center iz" data-stat="game_location" ></td><td class="left " data-stat="opp_id" ><a href="/teams/PHI/2023.html">PHI</a></td><td class="center " data-stat="game_result" csk="34" >W (+34)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="1793" >29:53</td><td class="right " data-stat="fg" >5</td><td class="right " data-stat="fga" >9</td><td class="right " data-stat="fg_pct" >.556</td><td class="right " data-stat="fg3" >3</td><td class="right " data-stat="fg3a" >6</td><td class="right " data-stat="fg3_pct" >.500</td><td class="right " data-stat="ft" >2</td><td class="right " data-stat="fta" >2</td><td class="right " data-stat="ft_pct" >1.000</td><td class="right " data-stat="orb" >1</td><td class="right iz" data-stat="drb" >0</td><td class="right " data-stat="trb" >1</td><td class="right " data-stat="ast" >1</td><td class="right iz" data-stat="stl" >0</td><td class="right iz" data-stat="blk" >0</td><td class="right iz" data-stat="tov" >0</td><td class="right " data-stat="pf" >1</td><td class="right " data-stat="pts" >15</td><td class="right " data-stat="game_score" >11.7</td><td class="right " data-stat="plus_minus" >+23</td></tr>
<tr id="pgl_basic_playoffs.42" ><th scope="row" class="right " data-stat="ranker" csk="9" >9</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-05-05&is_playoff_game=Y" data-stat="game_season" ><strong>9</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202305050PHI.html">2023-05-05</a></td><td class="right " data-stat="age" >28-307</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center " data-stat="game_location" >@</td><td class="left " data-stat="opp_id" ><a href="/teams/PHI/2023.html">PHI</a></td><td class="center " data-stat="game_result" csk="12" >W (+12)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="1948" >32:28</td><td class="right " data-stat="fg" >4</td><td class="right " data-stat="fga" >9</td><td class="right " data-stat="fg_pct" >.444</td><td class="right " data-stat="fg3" >2</td><td class="right " data-stat="fg3a" >5</td><td class="right " data-stat="fg3_pct" >.400</td><td class="right " data-stat="ft" >3</td><td class="right " data-stat="fta" >3</td><td class="right " data-stat="ft_pct" >1.000</td><td class="right " data-stat="orb" >1</td><td class="right " data-stat="drb" >5</td><td class="right " data-stat="trb" >6</td><td class="right " data-stat="ast" >2</td><td class="right iz" data-stat="stl" >0</td><td class="right iz" data-stat="blk" >0</td><td class="right " data-stat="tov" >1</td><td class="right " data-stat="pf" >1</td><td class="right " data-stat="pts" >13</td><td class="right " data-stat="game_score" >10.5</td><td class="right " data-stat="plus_minus" >+5</td></tr>
<tr id="pgl_basic_playoffs.43" ><th scope="row" class="right " data-stat="ranker" csk="10" >10</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-05-07&is_playoff_game=Y" data-stat="game_season" ><strong>10</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202305070PHI.html">2023-05-07</a></td><td class="right " data-stat="age" >28-309</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center " data-stat="game_location" >@</td><td class="left " data-stat="opp_id" ><a href="/teams/PHI/2023.html">PHI</a></td><td class="center " data-stat="game_result" csk="-1" >L (-1)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="1843" >30:43</td><td class="right " data-stat="fg" >5</td><td class="right " data-stat="fga" >11</td><td class="right " data-stat="fg_pct" >.455</td><td class="right " data-stat="fg3" >2</td><td class="right " data-stat="fg3a" >4</td><td class="right " data-stat="fg3_pct" >.500</td><td class="right iz" data-stat="ft" >0</td><td class="right iz" data-stat="fta" >0</td><td class="right iz" data-stat="ft_pct" ></td><td class="right " data-stat="orb" >1</td><td class="right " data-stat="drb" >1</td><td class="right " data-stat="trb" >2</td><td class="right iz" data-stat="ast" >0</td><td class="right iz" data-stat="stl" >0</td><td class="right " data-stat="blk" >1</td><td class="right iz" data-stat="tov" >0</td><td class="right " data-stat="pf" >1</td><td class="right " data-stat="pts" >12</td><td class="right " data-stat="game_score" >7.6</td><td class="right " data-stat="plus_minus" >-8</td></tr>
<tr id="pgl_basic_playoffs.44" ><th scope="row" class="right " data-stat="ranker" csk="11" >11</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-05-09&is_playoff_game=Y" data-stat="game_season" ><strong>11</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202305090BOS.html">2023-05-09</a></td><td class="right " data-stat="age" >28-311</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center iz" data-stat="game_location" ></td><td class="left " data-stat="opp_id" ><a href="/teams/PHI/2023.html">PHI</a></td><td class="center " data-stat="game_result" csk="-12" >L (-12)</td><td class="right " data-stat="gs" >1</td><td class="right " data-stat="mp" csk="2018" >33:38</td><td class="right " data-stat="fg" >2</td><td class="right " data-stat="fga" >6</td><td class="right " data-stat="fg_pct" >.333</td><td class="right iz" data-stat="fg3" >0</td><td class="right " data-stat="fg3a" >2</td><td class="right iz" data-stat="fg3_pct" >.000</td><td class="right " data-stat="ft" >3</td><td class="right " data-stat="fta" >3</td><td class="right " data-stat="ft_pct" >1.000</td><td class="right " data-stat="orb" >2</td><td class="right " data-stat="drb" >3</td><td class="right " data-stat="trb" >5</td><td class="right iz" data-stat="ast" >0</td><td class="right iz" data-stat="stl" >0</td><td class="right iz" data-stat="blk" >0</td><td class="right iz" data-stat="tov" >0</td><td class="right " data-stat="pf" >1</td><td class="right " data-stat="pts" >7</td><td class="right " data-stat="game_score" >5.5</td><td class="right " data-stat="plus_minus" >-15</td></tr>
<tr id="pgl_basic_playoffs.45" ><th scope="row" class="right " data-stat="ranker" csk="12" >12</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-05-11&is_playoff_game=Y" data-stat="game_season" ><strong>12</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202305110PHI.html">2023-05-11</a></td><td class="right " data-stat="age" >28-313</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center " data-stat="game_location" >@</td><td class="left " data-stat="opp_id" ><a href="/teams/PHI/2023.html">PHI</a></td><td class="center " data-stat="game_result" csk="9" >W (+9)</td><td class="right iz" data-stat="gs" >0</td><td class="right " data-stat="mp" csk="1152" >19:12</td><td class="right " data-stat="fg" >3</td><td class="right " data-stat="fga" >6</td><td class="right " data-stat="fg_pct" >.500</td><td class="right " data-stat="fg3" >3</td><td class="right " data-stat="fg3a" >5</td><td class="right " data-stat="fg3_pct" >.600</td><td class="right iz" data-stat="ft" >0</td><td class="right iz" data-stat="fta" >0</td><td class="right iz" data-stat="ft_pct" ></td><td class="right iz" data-stat="orb" >0</td><td class="right " data-stat="drb" >2</td><td class="right " data-stat="trb" >2</td><td class="right " data-stat="ast" >1</td><td class="right " data-stat="stl" >1</td><td class="right " data-stat="blk" >1</td><td class="right " data-stat="tov" >1</td><td class="right iz" data-stat="pf" >0</td><td class="right " data-stat="pts" >9</td><td class="right " data-stat="game_score" >8.0</td><td class="right " data-stat="plus_minus" >-7</td></tr>
<tr id="pgl_basic_playoffs.46" ><th scope="row" class="right " data-stat="ranker" csk="13" >13</th><td class="right endpoint tooltip" data-endpoint="/players/pgl_cum_stats.cgi?player=whitede01&year=2023&date_game=2023-05-14&is_playoff_game=Y" data-stat="game_season" ><strong>13</strong></td><td class="left " data-stat="date_game" ><a href="/boxscores/202305140BOS.html">2023-05-14</a></td><td class="right " data-stat="age" >28-316</td><td class="left " data-stat="team_id" ><a href="/teams/BOS/2023.html">BOS</a></td><td class="center iz" data-stat="game_location" ></td><td class="left " data-stat="opp_id" ><a href="/teams/PHI/2023.html">PHI</a></td><td class="center " data-stat="game_result" csk="24" >W (+24)</td><td class="right iz" data-stat="gs" >0</td><td class="right " data-stat="mp" csk="746" >12:26</td><td class="right " data-stat="fg" >1</td><td class="right " data-stat="fga" >4</td><td class="right " data-stat="fg_pct" >.250</td><td class="right " data-stat="fg3" >1</td><td class="right " data-stat="fg3a" >2</td><td class="right " data-stat="fg3_pct" >.500</td><td class="right iz" data-stat="ft" >0</td><td class="right iz" data-stat="fta" >0</td><td class="right iz" data-stat="ft_pct" ></td><td class="right iz" data-stat="orb" >0</td><td class="right iz" data-stat="drb" >0</td><td class="right iz" data-stat="trb" >0</td><td class="right iz" data-stat="ast" >0</td><td class="right iz" data-stat="stl" >0</td><td class="right " data-stat="blk" >1</td><td class="right iz" data-stat="tov" >0</td><td class="right " data-stat="pf" >3</td><td class="right " data-stat="pts" >3</td><td class="right " data-stat="game_score" >0.1</td><td class="right " data-stat="plus_minus" >-8</td></tr>

</table>


</div>
-->

所以你可以使用xml标签访问评论

comment()
.

然后您可以解析评论以将其视为现有的 html。这将使您可以访问 ` 节点。然后,您可以从那里提取该表节点的内容。

这是适合我的调整后的代码:

box::use(
    rvest[...],
    magrittr[`%>%`]
    readr[...]
)

"https://www.basketball-reference.com/players/w/whitede01/gamelog/2023" %>%
read_html() %>%
  html_nodes(xpath = '//body//div[@id="wrap"]//div[@id="content"]/div[6]/div[@id="all_pgl_basic_playoffs"]//comment()')  %>%
  html_text() %>%    # extract comment text
    paste(collapse = '') %>%    # collapse to single string
    read_html() %>%    # reread as HTML
    html_node('table#pgl_basic_playoffs') %>%    # select desired node
    html_table() %>%    # parse node to table
    { setNames(.[-1, ], paste0(names(.), .[1, ])) } %>%    # extract names from first row
    type_convert()

此外,可能更有帮助的是,在编写 xml 路径时,我倾向于尽可能具体一些,以确保我没有访问错误的表或错误的 div。有些人更不擅长给不同的东西赋予相同的 id 或类名。

© www.soinside.com 2019 - 2024. All rights reserved.