(转)es进行聚合操作时提示Fielddata is disabled on text fields by default
根据es官网的文档执行
1 2 3 4 5 6 7 8 | GET /megacorp/employee/_search { "aggs" : { "all_interests" : { "terms" : { "field" : "interests" } } } } |
这个例子时,报错
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 | { "error" : { "root_cause" : [ { "type" : "illegal_argument_exception" , "reason" : "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory." } ], "type" : "search_phase_execution_exception" , "reason" : "all shards failed" , "phase" : "query" , "grouped" : true , "failed_shards" : [ { "shard" : 0 , "index" : "megacorp" , "node" : "-Md3f007Q3G6HtdnkXoRiA" , "reason" : { "type" : "illegal_argument_exception" , "reason" : "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory." } } ], "caused_by" : { "type" : "illegal_argument_exception" , "reason" : "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory." } }, "status" : 400 } |
搜了一下应该是5.x后对排序,聚合这些操作用单独的数据结构(fielddata)缓存到内存里了,需要单独开启,官方解释在此fielddata
简单来说就是在聚合前执行如下操作
1 2 3 4 5 6 7 8 9 | PUT megacorp/_mapping/employee/ { "properties" : { "interests" : { "type" : "text" , "fielddata" : true } } } |
PS:执行上面操作前,先GET megacorp/_mapping/employee/查看mapping结构,然后执行上述命令,贴一下我聚合logstash读取tomcat.log到es里cilentip字段的步骤:
1.首先先GET logstash-apacheaccesslog*/_mapping/logs/查看mapping结构
PUT logstash-apacheaccesslog*/_mapping/logs/
{
"properties": {
"verb": {
"type": "text",
"norms": false,
"fielddata": true
}
}
}
2、对clientip字段进行聚合
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步
· .NET Core 中如何实现缓存的预热?
· 从 HTTP 原因短语缺失研究 HTTP/2 和 HTTP/3 的设计差异
· AI与.NET技术实操系列:向量存储与相似性搜索在 .NET 中的实现
· 基于Microsoft.Extensions.AI核心库实现RAG应用
· Linux系列:如何用heaptrack跟踪.NET程序的非托管内存泄露
· TypeScript + Deepseek 打造卜卦网站:技术与玄学的结合
· 阿里巴巴 QwQ-32B真的超越了 DeepSeek R-1吗?
· 【译】Visual Studio 中新的强大生产力特性
· 张高兴的大模型开发实战:(一)使用 Selenium 进行网页爬虫
· 【设计模式】告别冗长if-else语句:使用策略模式优化代码结构
2014-09-25 HDU3549:Flow Problem(最大流入门EK)