I run Kube under EKS on AWS. I wish to alert when enough pod under from a deployement are in a failed phase. I do have prometheus, but I need the alerte to be in CloudWatch, therefore I am exporting the metric to CloudWatch thanks to the CW Agent.
I was planing into using the metric kube_pod_status_phase, and group it by Phase and a another label identifying my deployement.
But I realize that the kube_pod_status_phase metric only has Namespace/Pod/Phase and couple of other useless label in my case, so not enough for me to achieve what I need.
I see that with Prometheus PromQL I could make Join Query, which seem to solve my issue. But since I am using CloudWatch Metric, I can not use PromQL like this (or at least I do not know how).
Does anyone has a suggestion on how to solve this issue? How, with AWS CloudWatch, can I list for one specific deployement the list of pod in a failed state?