唯一条件计数

jobtbby3 于 2022-10-06 发布在 ElasticSearch

关注(0)|答案(1)|浏览(198)

我正在尝试获取包含和排除条件的唯一计数。假设我希望在place等于london而不是paris时获得每组ID的计数。下面是同一索引中不同文档的示例。

[
  {
    "groupId": 123,
    "place": "london"
  },
  {
    "groupId": 123,
    "place": "berlin"
  },
  {
    "groupId": 456,
    "place": "london"
  },
  {
    "groupId": 789,
    "place": "london"
  },
  {
    "groupId": 789,
    "place": "paris"
  },
  {
    "groupId": 789,
    "place": "berlin"
  },
  {
    "groupId": ABC,
    "place": "tokyo"
  }
]

输出应类似于：

[
  {
    "groupId": 123,
    "count": "1"
  },
  {
    "groupId": 456,
    "count": "1"
  }
]

不包括"groupId": 789，因为有一个place是paris，不包括"groupId": "ABC"，因为它没有任何london

elasticsearch

来源：https://stackoverflow.com/questions/73880965/unique-conditional-count

1条答案

按热度按时间

0vvn1miw1#

我使用了以下聚合

1. Terms aggregation

2. filter aggregation

3. Bucket selector

在查询部分，我已经过滤了位置为伦敦或巴黎的文档。这是为了通过删除不具有这两个属性的文档来提高性能。例如，“groupID”：“abc”

在Aggregation部分，我对groupId执行了GROUP BY，然后使用过滤器Aggregation计算了它下面的伦敦和巴黎的计数。使用存储桶选择器，我只保存了伦敦计数至少为1、巴黎计数为零的那些组ID。

查询

{
    "query": {
    "bool": {
      "filter": [
        {
          "terms": {
            "place": [
              "london",
              "paris"
            ]
          }
        }
      ]
    }
  },
  "size": 0,
  "aggs": {
    "groups": {
      "terms": {
        "field": "groupId",
        "size": 10
      },
      "aggs": {
        "london": {
          "filter": {
            "term": {
              "place": "london"
            }
          }
        },
        "paris": {
          "filter": {
            "term": {
              "place": "paris"
            }
          }
        },
        "buvket": {
          "bucket_selector": {
            "buckets_path": {
              "paris_count": "paris>_count",
              "london_count": "london>_count"
            },
            "script": "params.paris_count==0 && params.london_count>=1"
          }
        }
      }
    }
  }
}

结果

"aggregations" : {
    "groups" : {
      "doc_count_error_upper_bound" : 0,
      "sum_other_doc_count" : 0,
      "buckets" : [
        {
          "key" : 123,
          "doc_count" : 1,
          "paris" : {
            "doc_count" : 0
          },
          "london" : {
            "doc_count" : 1
          }
        },
        {
          "key" : 456,
          "doc_count" : 1,
          "paris" : {
            "doc_count" : 0
          },
          "london" : {
            "doc_count" : 1
          }
        }
      ]
    }
  }

赞(0）回复(0）举报 2022-10-06

我来回答

唯一条件计数

1条答案

相关问题

热门标签

最新问答