Questions tagged [autopilot]

45 questions
5
votes
0 answers

Does GCP Kubernetes Autopilot support zero downtime update?

First of all, I use a GCP k8s standard cluster, which works fine. I am a bit interested in autopilot, so I tried to deploy the same pods on a k8s autopilot. The thing is, I got many 502 errors when rolling updates pods implemented in Go Gin and…
Takash Futada
  • 686
  • 6
  • 17
4
votes
1 answer

K8s version 1.21 GKE autopilot - no kind "CronJob" is registered for version "batch/v1"

We are having a problem with installing CronJob using Helm chart on GKE Autopilot cluster. (when we installing the same Helm chart on Standard GKE cluster with the same GKE version installation works perfectly ) GKE version- 1.21.5-gke.1302 My…
4
votes
0 answers

How to set custom DNS in GKE Autopilot

I run Composer 2 in Autopilot mode and I need to setup custom dns server for some domains in GKE Autopilot. In GKE Standart (non autopilot) i can simple change kube-dns configmap to add stubDomains: { "domain.com": ["9.9.9.9"] } but…
4
votes
2 answers

How to configure GKE Autopilot w/Envoy & gRPC-Web

I have an application running on my local machine that uses React -> gRPC-Web -> Envoy -> Go app and everything runs with no problems. I'm trying to deploy this using GKE Autopilot and I just haven't been able to get the configuration right. I'm new…
Renee Revis
  • 111
  • 1
  • 6
3
votes
1 answer

Google Cloud Quota Miscalculation Preventing Kubernetes Pods from Scaling

I am currently facing an issue with a Kubernetes configuration on my cluster running in Google Kubernetes Engine in Autopilot mode in the us-west1 region. The configuration requires 40 replicas, each with a CPU limit of 1000m. I have an Nginx load…
3
votes
2 answers

With GKE Autopilot banning the cluster-autoscaler.kubernetes.io/safe-to-evict=false annotation, is there a way to ensure job pods do not get evicted?

Our GKE Autopilot cluster was recently upgraded to version 1.21.6-gke.1503, which apparently causes the cluster-autoscaler.kubernetes.io/safe-to-evict=false annotation to be banned. I totally get this for deployments, as Google doesn't want a…
PLPeeters
  • 1,009
  • 12
  • 26
3
votes
2 answers

Getting error while configuring an IP masquerade agent for the Autopilot cluster

We are getting below error while configuring an IP masquerade agent for the Autopilot cluster: error: failed to create configmap: configmaps is forbidden: User "abc@gmail.com" cannot create resource "configmaps" in API group "" in the namespace…
Aqap Test
  • 31
  • 1
2
votes
1 answer

GKE Autopilot with HPA

The steps I use to set up a custom metrics HPA work on a standard GKE cluster but do not work on an Autopilot one. I use an custom-metrics-stackdriver-adapter to implement an HPA based on the number of unacknowledged PubSub messages. In both cases…
Mauro Gentile
  • 1,463
  • 6
  • 26
  • 37
1
vote
1 answer

Facing scaling issues in autopilot cluster

I'm facing scaling issues in gke autopilot cluster. Getting error: node scale up failed:pod is at risk of not being scheduled I am facing this issue only with autopilot cluster Try to do basic troubleshooting steps but issue not at sort out
1
vote
1 answer

gke autopilot not collecting metrics for datadog

I'm using Datadog helm chart. I've got the exact same installation in two different clusters, both clusters have the exact same configuration. In one of them I'm collecting custom metrics, plus cluster metrics. In the other one I'm attempting…
caeus
  • 3,084
  • 1
  • 22
  • 36
1
vote
1 answer

What is the Nvidia driver version of GKE Autopilot nodes?

How do I find the driver version of the node in Autopilot? I need the 525 driver version on the node - but I suspect it's 470. Is there a way to specify a nodeSelector to provision nodes with 525 version of the driver?
GRS
  • 2,807
  • 4
  • 34
  • 72
1
vote
2 answers

Prevent pod preemption

We have an autopilot cluster in GKE. Sometimes, our pods simply get terminated, with no explanation. We suspect that k8s is preempting our pods - we only have one DAG running on a daily schedule in this cluster, but it tries to run a number of tasks…
FrustratedWithFormsDesigner
  • 26,726
  • 31
  • 139
  • 202
1
vote
1 answer

Can not pull container image to GKE Autopilot from private Artifact Registry even these in same project

According to articles below, it seems we can pull container image to GKE from Artifact Registry without any additional authentication when these in same…
1
vote
1 answer

GKE Autopilot Pod permission error when trying to publish message on GCP Pub/Sub

I have a GKE Autopilot cluster with one service to publish a message to GCP Pub/Sub topic for testing. I have created a KSA for the deployment, have used GCP Workload Identity Management to authorize the KSA to act as a GCP Service Account (GSA).…
1
vote
0 answers

metrics-server on gke autopilot

I am using GKE autopilot cluster, and I am trying to get prometheus metrics for the CPU and memory usages of my pods/containers. As far as I know metrics-server is installed by default on any GKE cluster. I can also see it in the pods list in the…
alonana
  • 171
  • 2
  • 12
1
2 3