人口普查批量地理编码 API,其源地址位于 R 中的数据帧(非 CSV)中

问题描述 投票:0回答:2

我正在尝试将人口普查局的批量地理编码器 (http://geocoding.geo.census.gov/geocoder/Geocoding_Services_API.pdf) 与 R 结合使用。输入地址位于数据帧中,而不是 CSV 中。由于地址是中间步骤,我不想将它们写入 CSV。

我读过 stackoverflow 上的几篇帖子。 Hadley 的解决方案(使用 R 中的 httr 从 API 发布和接收数据)说明了如何上传现有 CSV 文件。 MrFlick 的解决方案(将 csv 上传到 R 中的 api)似乎接近我想要的,但使用字符串,而不是数据框。

这是我的代码:

#generate data frame of test addresses for this example
a = c(1, 2, 3) 
b = c("125 Worth Street", "258 Broadway", "8 Centre Street") 
c = rep("New York", 3) 
d = rep("NY", 3)
e = c("10013","10007","10007")
addresses = data.frame(a,b,c,d,e)

#names specified by API documentation
colnames(addresses) <- c("Unique ID","Street address","City","State","ZIP")

apiurl <- "http://geocoding.geo.census.gov/geocoder/geographies/addressbatch"

req <- POST(apiurl, body=list(
    addressFile = RCurl::fileUpload(
        filename = "test.csv", 
        contents = addresses
    ), 
    benchmark = "Public_AR_Census2010",
    vintage = "Census2010_Census2010"
    ), 
    encode="multipart"
)
stop_for_status(req)

提前致谢。

r geocoding httr
2个回答
2
投票

如果您愿意将数据写入临时文件...

library("httr")
a = c(1, 2, 3) 
b = c("125 Worth Street", "258 Broadway", "8 Centre Street") 
c = rep("New York", 3) 
d = rep("NY", 3)
e = c("10013","10007","10007")
addresses = data.frame(a,b,c,d,e)
colnames(addresses) <- c("Unique_ID","Street address","City","State","ZIP")
apiurl <- "http://geocoding.geo.census.gov/geocoder/geographies/addressbatch"
file <- tempfile(fileext = ".csv")
write.csv(addresses, file, row.names = FALSE)
req <- POST(apiurl, body=list(
    addressFile = upload_file(file), 
    benchmark = "Public_AR_Census2010",
    vintage = "Census2010_Census2010"
  ), 
  encode="multipart"
)
content(req, "text", encoding = "UTF-8")


#> [1] "\"3\",\"8 Centre Street, New York, NY, 10007\",\"Match\",\"Non_Exact\",\"8 Centre St, NEW YORK, NY, 10013\",\"-74.00442,40.712765\",\"59660429\",\"R\",\"36\",\"061\",\"002900\",\"4019\"\n\"2\",\"258 Broadway, New York, NY, 10007\",\"No_Match\"\n\"1\",\"125 Worth Street, New York, NY, 10013\",\"Match\",\"Exact\",\"125 Worth St, NEW YORK, NY, 10013\",\"-74.0027,40.715446\",\"59660405\",\"L\",\"36\",\"061\",\"003100\",\"1012\"\n\"Unique_ID\",\"Street address, City, State, ZIP\",\"No_Match\"\n"

0
投票

如果有人因尝试这些代码中的任何一个而收到错误,那是因为人口普查 API 从 http 切换到了 https。将

apiurl
调整为基于 https,它应该可以工作。

library("httr")
a = c(1, 2, 3) 
b = c("125 Worth Street", "258 Broadway", "8 Centre Street") 
c = rep("New York", 3) 
d = rep("NY", 3)
e = c("10013","10007","10007")
addresses = data.frame(a,b,c,d,e)
colnames(addresses) <- c("Unique_ID","Street address","City","State","ZIP")
apiurl <- "https://geocoding.geo.census.gov/geocoder/geographies/addressbatch"
file <- tempfile(fileext = ".csv")
write.csv(addresses, file, row.names = FALSE)
req <- POST(apiurl, body=list(
    addressFile = upload_file(file), 
    benchmark = "Public_AR_Census2010",
    vintage = "Census2010_Census2010"
  ), 
  encode="multipart"
)
content(req, "text", encoding = "UTF-8")
© www.soinside.com 2019 - 2024. All rights reserved.