我正在尝试从当前数据中解析/读取多个xml文件,并尝试将它们组合在一起。
我的示例xml文件是这样的:
<ApplicationResponse>
<Service Name="AlternativeCreditAttributes">
<Categories>
<Category Name="Default">
<Attributes>
<Attribute Name="ACA_ACH_NSF_12M" Value="0" />
<Attribute Name="ACA_ACH_NSF_18M" Value="0" />
<Attribute Name="ACA_ACH_NSF_24M" Value="0" />
<Attribute Name="ACA_ACH_NSF_3M" Value="0" />
<Attribute Name="ACA_ACH_NSF_6M" Value="0" />
<Attribute Name="ACA_ACH_NSF_9M" Value="0" />
<Attribute Name="ACA_ACH_NSF_AMT_12M" Value="" />
<Attribute Name="ACA_ACH_NSF_AMT_18M" Value="" />
<Attribute Name="ACA_ACH_NSF_AMT_24M" Value="" />
<Attribute Name="ACA_ACH_NSF_AMT_3M" Value="" />
<Attribute Name="ACA_ACH_NSF_AMT_6M" Value="" />
<Attribute Name="ACA_ACH_NSF_AMT_9M" Value="" />
<Attribute Name="ACA_ACH_NSF_AMT_EVER" Value="600" />
<Attribute Name="ACA_ACH_NSF_EVER" Value="2" />
<Attribute Name="ACA_ACH_NSF_MONTHS_SINCE_NEWEST" Value="41" />
<Attribute Name="ACA_ACH_NSF_MONTHS_SINCE_OLDEST" Value="41" />
</Attributes>
</Category>
</Categories>
</Service>
</ApplicationResponse>
我已根据以下代码成功提取了一个文件:
doc<-read_xml(Data$XMLResponse[1])
# setNames(data.frame(
cols<- xml_attr(xml_find_all(doc, "//Attribute"), "Name")
rows<- xml_attr(xml_find_all(doc, "//Attribute"), "Value")
# ),
out <- data.frame(rows, row.names = cols)
out
但是当我尝试使用lapply
来基于此answer提取多个文件时,我在工作目录上遇到了错误。
错误:“ NA”在当前工作目录中不存在
下面是我使用的代码。如果您知道此问题或需要有关此问题的任何详细信息,请告诉我。提前致谢。
df_list <- lapply(Data$XMLResponse, function(f) {
doc <- read_xml(f)
setNames(data.frame(
xml_attr(xml_find_all(doc, "//Attribute"), "Name"),
xml_attr(xml_find_all(doc, "//Attribute"), "Value")
),c("Name", f))
})
your_column_names<-xml_attr(xml_find_all(Data$XMLResponse[1], "//Attribute"), "Name")
doc<-setNames(data.frame(matrix(ncol = length(your_column_names), nrow = 0)), your_column_names)