通过休息api进行分页结果?

问题描述 投票:1回答:2

我使用以下代码从URL中提取json文件:

options NOQUOTELENMAX;

filename usage "/folders/myfolders/sasuser.v94/usage.json";

%let AccessKey = reallylongstring;

proc http
 url="https://a.url"
 method="GET" out=usage;
 headers 
   "Authorization"="Bearer &AccessKey.";
run;

libname usage json "/folders/myfolders/sasuser.v94/usage.json";

data usage;
 set usage.data;
run;

proc print data=usage noobs;
run;   

但是现在结果返回了1000多个结果,我需要以某种方式检查nextLink属性?

在.net我可以使用这样的东西:

$usagerest = Invoke-Restmethod -url $usageurl -header $authheaders -method get

while ($null -ne $usageRest.nextLink) {
$usageRest = Invoke-Restmethod -uri $usagerest.nextlink -headers $authheaders -method get
}

sas中的proc http是这样的吗?

如果有帮助的话,可以看到实际json的树视图?

enter image description here

到目前为止,我尝试了一个快速的脏版本:

libname usage1 JSON fileref=resp1;

data x;
set usage1.root;
call symputx('nextLink',nextLink);
run;

proc http
url="%superq(nextLink)"
method="GET" out=resp2;
headers 
   "Authorization"="Bearer &AccessKey.";
run;

libname usage2 JSON fileref=resp2;

data y;
set usage2.root;
call symputx('nextLink',nextLink);
run;

proc http
url="%superq(nextLink)"
method="GET" out=resp3;
headers 
   "Authorization"="Bearer &AccessKey.";
run;

libname usage3 JSON fileref=resp3;

data z;
    set usage3.root;
    call symputx('nextLink',nextLink);
run;

图书馆概述:

enter image description here

usage1.data示例:

enter image description here

work.x示例:

enter image description here

谢谢

sas
2个回答
1
投票

问题似乎更清楚了。可以对该过程进行宏观化,以便在Proc HTTP循环内重复调用%do。每页数据都可以附加到一个数据集中,该数据集随着检索到的每个页面而增长。

未经测试

%macro getThatPagedJsonData (url=, accesskey=, out=);

  %local page guard;

  %let page = 1;
  %Let guard = 50;  * just in case - prevent infinite/excessive looping during development/testing;

  filename response temp;

  %do %until (%length(%superq(url)) eq 0 or &page > &guard);

    * clear libname, releasing any locks on json repsonse file;
    libname page;

    * prior response will be over written;
    proc http url="%superq(url)" method="GET" out=response;
      headers "Authorization"="Bearer &AccessKey.";
    run;

    * magic json engine;
    libname page JSON fileref=response;

    if &page = 1 %then %do;
      * first page starts the output data set;
      data &out;
        set page.data;
      run;
    %end;
    %else %do;
      * append subsequent pages of data;
      proc append base=&out data=page.data;
      run;
    %end;

    * track number of pages processed;
    %let page = %eval (&page + 1);

    * reset url for %until test;
    %let url=;
    * fetch the nextlink as the url for next iteration of %do %until; 
    * might need error handling here when last page has no nextlink;
    data _null_;
      set page.root;
      call symput('url', trim(nextlink));
    run;
  %end;

%mend;

%getThatPagedJsonData (url=...., accesskey=...., out=serviceAggreements);

2
投票

对于需要双引号的已解析宏变量的上下文,例如url=选项值,将%superq放在双引号内 - "%superq(<macro-var-name>)"

尝试

… 
url = "%superq(nextLink)"

如果链接仍然很麻烦你可以试试

url = "%qsysfunc(urlencode(%superq(nextLink)))"

在控件数据集中每行调用一次宏

有很多方法,比如

  • 通过data _null_;call execute(…堆叠1,000次调用
  • Proc SQL select nextLink into :link1-创建1,000个宏变量
  • 使用%let ds=%sysfunc(open(…打开数据集并循环遍历1,000行 - %do %while (%sysfunc(fetch(&ds) = 0))

第二个子弹项目的示例代码:

此示例将宏变量从“循环”宏传递到“worker”宏。传递宏变量名时,不必解析宏变量并引用它进入工作者。相反,工作获取宏var(也就是符号名称)并让superq以引用的方式解析它以用于源代码生成。从本质上讲,传递宏变量类似于传统语言的传递参考概念。

data have;
  do linknum = 1 to 25;
    link = cats("place=", byte(64+linknum),'&extra="zoom=',linknum,'"&key=MYAPIKEY');
    output;
  end;
run;


%macro processLink(link_mvar=);
  %put url="%sysfunc(urlencode(%superq(&link_mvar)))";
%mend;

%macro processLinks (data=);
  proc sql;
    reset noprint;
    select link into :link1- from &data;
  quit;

  %local i;
  %do i = 1 %to &sqlobs;

    %* pass name of macro variable to macro;
    %processLink (link_mvar=link&i);

  %end;
%mend;

%processLinks(data=have)
© www.soinside.com 2019 - 2024. All rights reserved.