Elasticsearch权威指南：聚合基础入门与实践-CSDN博客

本文链接：https://blog.csdn.net/gitblog_00805/article/details/148524878

Elasticsearch权威指南：聚合基础入门与实践

什么是聚合？

聚合（Aggregation）是Elasticsearch中强大的数据分析功能，它允许我们对数据进行分组、统计和计算各种指标。与SQL中的GROUP BY类似，但功能更加强大和灵活。

准备示例数据

为了更好地理解聚合，我们先创建一个汽车交易数据的示例。这些数据包含以下字段：

price：汽车售价
color：汽车颜色
make：汽车制造商
sold：销售日期

POST /cars/transactions/_bulk
{ "index": {}}
{ "price" : 10000, "color" : "red", "make" : "honda", "sold" : "2014-10-28" }
{ "index": {}}
{ "price" : 20000, "color" : "red", "make" : "honda", "sold" : "2014-11-05" }
{ "index": {}}
{ "price" : 30000, "color" : "green", "make" : "ford", "sold" : "2014-05-18" }
{ "index": {}}
{ "price" : 15000, "color" : "blue", "make" : "toyota", "sold" : "2014-07-02" }
{ "index": {}}
{ "price" : 12000, "color" : "green", "make" : "toyota", "sold" : "2014-08-19" }
{ "index": {}}
{ "price" : 20000, "color" : "red", "make" : "honda", "sold" : "2014-11-05" }
{ "index": {}}
{ "price" : 80000, "color" : "red", "make" : "bmw", "sold" : "2014-01-01" }
{ "index": {}}
{ "price" : 25000, "color" : "blue", "make" : "ford", "sold" : "2014-02-12" }

第一个聚合查询：统计汽车颜色分布

假设我们是汽车经销商，想知道哪种颜色的汽车最受欢迎。我们可以使用terms聚合来实现：

GET /cars/transactions/_search
{
    "size" : 0,
    "aggs" : {
        "popular_colors" : {
            "terms" : {
              "field" : "color"
            }
        }
    }
}

关键参数解析：

size: 0：表示我们不关心具体的搜索结果，只想要聚合结果，这样可以提高查询效率
aggs（或aggregations）：聚合操作的顶层参数
popular_colors：我们为这个聚合指定的自定义名称
terms：表示我们要对字段值进行分组统计
field: "color"：指定我们要聚合的字段

查询结果分析

返回结果会类似这样：

{
   "hits": {
      "hits": []
   },
   "aggregations": {
      "popular_colors": {
         "buckets": [
            {
               "key": "red",
               "doc_count": 4
            },
            {
               "key": "blue",
               "doc_count": 2
            },
            {
               "key": "green",
               "doc_count": 2
            }
         ]
      }
   }
}

结果解读：