如何将所有列表转换为数据帧,附加所有数据帧并导出到csv?
library(rvest)
webpage <- read_html("https://www.sec.gov/Archives/edgar/data/21665/000144530512000409/exhibit21.htm")
tbls <- html_nodes(webpage, "table")
tbls_ls <- html_table(tbls,fill = TRUE)
colnames(tbls_ls[[1]]) <- c("Name", "Country")
您可以从链接获取所有表,并使用bind_rows
将它们绑定在一起
library(rvest)
library(dplyr)
url <- "https://www.sec.gov/Archives/edgar/data/21665/000144530512000409/exhibit21.htm"
url %>%
read_html() %>%
html_table() %>%
bind_rows()
但是,这需要稍微清理和重命名,因为它不会单独标识列名。