Hi All,
We are working on finding the duplicate accounts and automate it using workflows.
Using below body we are able to extract the duplicate accounts in tenant.
{
"query": {
"query": "*"
},
"indices": [
"identities"
],
"aggregationsDsl": {
"accounts": {
"nested": {
"path": "accounts"
},
"aggs": {
"source_name": {
"terms": {
"field": "accounts.source.name.exact",
"min_doc_count": 2,
"size": 1000
},
"aggs": {
"identities": {
"terms": {
"field": "_id",
"min_doc_count": 2,
"size": 1000
},
"aggs": {
"accounts": {
"top_hits": {
}
}
}
}
}
}
}
}
}
}
This is producing too much information and workflow is throwing timeout error. Is there a way to exclude hits data in the output.
Thanks,
Abhinov