I have a ArgoCD Cluster, which is deploying lots of apps to AWS EKS, One of the Metric which I was asked to derive was the Avg. time in seconds it takes for a Job to reconcile in ArgoCD
We have native integration enabled with Datadog, I have these 3 metrics avalaible
argocd.app_controller.app.reconcile.bucket
argocd.app_controller.app.reconcile.sum
argocd.app_controller.app.reconcile.count
I created this Dashboard in Datadog, not sure if I did it correctly (or) not, please provide your insights on the same
Raw JSON of the Queries used
{
"title": "Reconciliation Count Aggregate",
"type": "timeseries",
"requests": [
{
"formulas": [
{
"alias": "Reconciliation Time Avg Rolling",
"number_format": {
"unit": {
"type": "canonical_unit",
"unit_name": "minute"
}
},
"formula": "query2 / query1"
}
],
"queries": [
{
"data_source": "metrics",
"name": "query2",
"query": "sum:argocd.app_controller.app.reconcile.sum{$region,$cluster_name}.as_rate().rollup(sum, 14400)"
},
{
"data_source": "metrics",
"name": "query1",
"query": "sum:argocd.app_controller.app.reconcile.count{$region,$cluster_name}.as_rate().rollup(sum, 14400)"
}
],
"response_format": "timeseries",
"style": {
"palette": "green",
"order_reverse": false,
"line_type": "solid",
"line_width": "normal"
},
"display_type": "bars"
}
],
"markers": []
}