使用NEST和C#进行弹性搜索滚动

问题描述 投票:0回答:1

我正在使用以下代码在弹性搜索框中循环/滚动所有文档:

const string indexName = "bla";
var client = GetClient(indexName);
const int scrollTimeout = 1000;

var initialResponse = client.Search<Document>
    (scr => scr.Index(indexName)
    .From(0)
    .Take(100)
    .MatchAll()
    .Scroll(scrollTimeout))
;

List<XYZ> results;
results = new List<XYZ>();

if (!initialResponse.IsValid || string.IsNullOrEmpty(initialResponse.ScrollId))
throw new Exception(initialResponse.ServerError.Error.Reason);

if (initialResponse.Documents.Any())
results.AddRange(initialResponse.Documents);

var scrollid = initialResponse.ScrollId;
bool isScrollSetHasData = true;
while (isScrollSetHasData)
{
    var loopingResponse = client.Scroll<XYZ>(scrollTimeout, scrollid);

    if (loopingResponse.IsValid)
    {
        results.AddRange(loopingResponse.Documents);
        scrollid = loopingResponse.ScrollId;
    }
    isScrollSetHasData = loopingResponse.Documents.Any();

    // do some amazing stuff
}

client.ClearScroll(new ClearScrollRequest(scrollid));

出于某种原因,loopingResponse为空的时间比预期的要早得多-即滚动完成。有人可以看到我的代码有根本性的错误吗?谢谢!

c# elasticsearch nest
1个回答
1
投票

查看您的代码,我认为可能是scrollTimeout。通常,滚动用于返回大块数据,而1000ms不足以使搜索上下文在请求之间保持活动状态。您可以尝试将其增加到几分钟,以找到最适合您的案例的数字:

var scrollTimeout = new Time(TimeSpan.FromMinutes(3));

或根据source code,您可以使用时间单位(微米,纳米,ms,s,m,h和d):

var response = client.Search<Document>(scr => scr.Index(indexName)
    ...
    .Scroll("3m")
    );
© www.soinside.com 2019 - 2024. All rights reserved.