Questions tagged [google-cloud-ai]

49 questions
6
votes
1 answer

Why does this fail with "'async for' requires an object with __aiter__ method, got coroutine"

I am attempting to call an external API (one that is provided by Google) using an async client library (the library is also provided by Google). The async method that I am attempting to call is async list_featurestores() (documentation). It provides…
jamiet
  • 10,501
  • 14
  • 80
  • 159
6
votes
1 answer

Authenticating standalone gsutil in containers in Cloud ML Engine on Kubernetes with Workload Identity

I'm launching container images on Google Cloud AI Training (Cloud ML Engine) Inside those containers I need to use gsutil. Some containers have gsutil. In that case I can use it right away without any authentication steps. Some containers do not…
Ark-kun
  • 6,358
  • 2
  • 34
  • 70
4
votes
1 answer

How to schedule repeated runs of a custom training job in Vertex AI

I have packaged my training code as a python package and then am able to run it as a custom training job on Vertex AI. Now, I wanted to be able to schedule this job to run, say every 2 weeks, and re-train the model. The Scheduling settings in the…
4
votes
3 answers

google ai platform vs ml engine

I did lots of search, but I cannot understand what the difference between google ai platform and ml engine. It seems that both of them can be used for training and deploying models. Other words like google-cloud-automl, google ai hub are also very…
3
votes
1 answer

google-cloud/aiplatform vertex AI predictionserviceclient truncated response NodeJS

I am trying to get the aiplatform client working on a NodeJS project, It seems to work, I mean the credentials are fine, and I get a "valid" response back. But the predictions' content comes truncated (using curl I get the complete list) just the…
3
votes
0 answers

Custom Model for Batch Prediction on Vertex.ai

I want to run batch predictions inside Google Cloud's vertex.ai using a custom trained model. I was able to find documentation to get online prediction working with a custom built docker image by setting up an endpoint, but I can't seem to find any…
3
votes
0 answers

Google Cloud AI Platform training job: --stream-logs stalls indefinitely with no output

I am submitting a training job with gcloud ai-platform jobs submit training [...] --stream-logs The job is submitted successfully, but no logs appear on the terminal. No output after "Job [...] submitted successfully." appears, but the command…
Anakhand
  • 2,838
  • 1
  • 22
  • 50
3
votes
0 answers

Google Cloud GPU all region quota increase request getting rejected

I have created my Google cloud account and by default the GPU quota for all region is 0. So I have submitted multiple requests for quota increase but they haven’t approved even one. I tried contacting the sales stuff but even they rejected the…
2
votes
0 answers

`steps_per_epoch` in google ai platform multi-worker distributed training

I'm training model with tensorflow==2.7.0 distributively on gcloud ai platform. I'm using ParameterServerStrategy strategy, with multiple workers. One thing I'm confused, and couldn't find answer, is how to properly set number of steps each worker…
2
votes
1 answer

Is there a way to have users access my Google Cloud AI Platform Notebook Instance easily?

I am new to Google Cloud AI Platform. I am running a AI Platform Notebook Instance with a service account. I have successfully added another user to my project, but it seems too involved for new users just to launch our shared notebook. I have…
2
votes
2 answers

Op type not registered \'IO>BigQueryClient\' with BigQuery connector on AI platform

I'm trying to parallelize the training step of my model with tensorflow ParameterServerStrategy. I work with GCP AI Platform to create the cluster and launch the task. As my dataset is huge, I use the bigquery tensorflow connector included in…
Harold G
  • 183
  • 8
2
votes
1 answer

how to set signature name to serving_default to avoid errors after deployment to GCP AI Platform

I trained a DNN model with Tensorflow on AI Platform. Then I copied the model locally to double check if preditions can be obtained from the samed model. gcloud ai-platform local predict…
Amine Jallouli
  • 3,919
  • 8
  • 36
  • 73
2
votes
1 answer

Getting 'Create Version failed. Bad model detected with error' on AI platform when trying to create a custom model on Google Cloud AI platform

I am trying to deploy custom model on AI platform. I have followed the steps as mentioned in the Google Document: https://cloud.google.com/ai-platform/prediction/docs/deploying-models#global-endpoint. The saved model is stored in Google Cloud…
2
votes
0 answers

How to solve 5xx errors in GCP AI Prediction Platform?

We have been able to deploy models (both custom prediction and Tensorflow SavedModel formats) to AI Prediction Platform, and basic testing shows things are at least functional for online predictions. We are now trying to load test a bit before…
2
votes
0 answers

How to fix: "error": "Prediction failed: unknown error." in custom prediction routine with scikit-learn?

I am trying to write a custom prediction routine on Google's AI Platform using scikit-learn's MLPClassifier. I have packaged and deployed the model successfully, but when I request online predictions via gcloud ai-platform predict, the error I get…
1
2 3 4