查询 .msg 电子邮件文件时,如何从 Azure AI 搜索中获取提取的文本?

问题描述 投票:0回答:1

我正在尝试访问在搜索 Azure 存储 blob 容器中的 .msg 电子邮件文件时匹配的电子邮件正文/文本。我可以使用以下方式获取发件人、收件人、主题和发送日期:

元数据内容类型 元数据消息来源 来自电子邮件的元数据消息 元数据消息发送至 元数据消息到电子邮件 元数据消息抄送 元数据_消息_抄送_电子邮件 元数据消息_密件抄送 元数据_消息_密件抄送_电子邮件 元数据创建日期 元数据最后修改时间 元数据_主题

记录于此:https://learn.microsoft.com/en-us/azure/search/search-blob-metadata-properties

如何检索匹配的正文和附件文本?

我可以添加其他字段到我的索引和/或索引器中吗?

我尝试过以下领域:

{
    "name": "email-msg-index",  
    "fields": [
        {"name": "ID", "type": "Edm.String", "key": true, "searchable": false},
        {"name": "metadata_subject", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_content_type", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_message_from", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_message_from_email", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_message_to", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_message_to_email", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_message_cc", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_message_cc_email", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_message_bcc", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_message_bcc_email", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_creation_date", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false},
        {"name": "metadata_last_modified", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}
    ]
}

indexing indexer azure-ai-search
1个回答
0
投票

我能够使用以下索引获取内容:

{ "name": "email-msg-index",   "fields": [ {"name": "ID", "type": "Edm.String", "key": true, "searchable": false}, {"name": "metadata_subject", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_content_type", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_message_from", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_message_from_email", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_message_to", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_message_to_email", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_message_cc", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_message_cc_email", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_message_bcc", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_message_bcc_email", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_creation_date", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "metadata_last_modified", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": true, "facetable": false}, {"name": "content", "type": "Edm.String", "searchable": true, "filterable": false, "sortable": false, "facetable": false} ] } 

使用的索引器是:

{ "@odata.context": "https://<servicename>.search.windows.net/$metadata#indexers/$entity", "@odata.etag": "\"0x000000000000000\"", "name": "emailindexer", "dataSourceName": "email-blob-datasource", "targetIndexName": "email-msg-index", "parameters": { "configuration": { "indexedFileNameExtensions": ".msg", "dataToExtract": "contentAndMetadata", "parsingMode": "default" } } }

查询是: { “搜索”:“{{搜索}}”, “选择”:“metadata_subject、metadata_creation_date、metadata_message_from、metadata_message_to、内容”, “searchFields”:“元数据主题”, “计数”:真实 }

© www.soinside.com 2019 - 2024. All rights reserved.