如何使用此代码中的RSelenium获取href元素?

问题描述 投票:0回答:1

我使用Rselenium获取URL,我不明白为什么不工作。

我的代码:

    links <- remDr$findElements(using = "xpath", value = "//*[@class 
    = 
    'item-info']")
     final_link <- data.frame(link = unlist(sapply(links, function(x) 
     {x$getElementAttribute('href')})))

页面代码:

<div class="item-info">
<p class="title product-field" translate="no" data-track-info=" 
{&quot;description&quot;:&quot;title&quot;}" data-force-track="">
       <a href="https://www.kobo.com/us/en/ebook/added-youth" 
 class="">Added Youth</a>
 </p>
 <p class="subtitle product-field" translate="no" data-track-info=" 
     {&quot;description&quot;:&quot;subtitle&quot;}" data-force-track="">
       <a href="https://www.kobo.com/us/en/ebook/added-youth" class="">All Natural Anti-Aging Program</a>
  </p>

预期的结果是链接,在这种情况下:https://www.kobo.com/us/en/ebook/added-youth

r rselenium
1个回答
0
投票

我的代码:

page <- read_html(paste0(urls1[[i]]))

links <- page %>% 
      html_nodes("a") %>%
      html_attr("href")
© www.soinside.com 2019 - 2024. All rights reserved.