Questions tagged [vertex-ai-pipeline]

31 questions
23
votes
9 answers

Sudden ImportError: cannot import name 'appengine' from 'requests.packages.urllib3.contrib error on pipeline

My pipelines and schedulers were running smoothly without any problems. After I went out to lunch, I changed the number of epochs a Neural Network would run, save the .yaml file again and leave it in the bucket named "budgetff". Afterwards,…
2
votes
1 answer

Vertex AI Pipeline quota aiplatform.googleapis.com/restricted_image_training_tpu_v3_pod

I'm getting started with creating a tuned model. I've got my training data in a .jsonl file, uploaded to a bucket, everything checks out. I've run the tuning 3 times and every time it fails on step…
2
votes
1 answer

Call Vertex AI Endpoint with Node.js - Error: 3 INVALID_ARGUMENT: Failed to parse input instances

I am calling a customized model (Two Towers for retrieval) in Vertex AI endpoint with Node.js with the following code: const axios = require('axios'); const { GoogleAuth } = require('google-auth-library'); const project = '1234567'; const…
2
votes
1 answer

why is kubeflow on vertex ai pipelines not storing metadata for dataset artifact

I am trying to pass metadata between python function components by attaching it an output artifact in a vertex ai kubeflow pipeline, from the documentation this seems straightforwards, but try as I might I can't get it to work. I am trying to attach…
1
vote
0 answers

How to run multiple custom jobs at the same time in Vertex AI?

We run custom training jobs in Vertex AI. They are scheduled to run once a week using Airflow. These jobs are provisioned at the same time to Vertex AI but are running sequentially (one at a time). Each job takes around 10 minutes to run while the…
1
vote
1 answer

Google Cloud Vertex AI API not recognising PaLM model

I am currently trying to implement Google's PaLM API into python using the vertexai module. However I run into an error where my code does not recognise the model name (model is called 'chat-bison@001'). The code snippet of what I have tried doing…
1
vote
0 answers

unable to open interactive shell for vertex ai custom training job

It happens on custom training job with tpu_v2 in us-central1. I followed "launch web terminal" link under training debugging in custom training job UI, but got the following message. I should have the necessary permissions as I started the custom…
1
vote
0 answers

How to implement custom evaluation jobs (with custom metrics for custom learning tasks) in Vertex AI

From what I can see in GCP's official docs: https://cloud.google.com/vertex-ai/docs/evaluation/introduction Evaluation job comes with predefined metrics for predefined tasks. If I have a training task that's slightly more bespoke with metrics/…
1
vote
1 answer

Scheduling Vertex AI Pipeline - Error 503

I successfully trained and deployed a pipeline in Vertex AI using Kubeflow for a retrieval model. Now I want to schedule this pipeline run every 8 minutes. Here's my code: from kfp.v2.google.client import AIPlatformClient api_client =…
1
vote
1 answer

Vertex AI pipeline run fails after passing the dsl.Condition==True

I have been working with Vertex AI pipelines for 3 months. I started with migrating projects from old dedicated Kubernetes infrastructure to Vertex AI Python component based pipelines. I created a template pipelines which were running fine until…
1
vote
1 answer

How to fix TypeError: to_dict() missing 1 required positional argument: 'self' on a dsl.component

I am creating a pipeline on a VertexAI Workbench using Kubeflow Lightweight components. I start with a component to extract data from BigQuery and process it (and returning a dataframe). This dataframe will then be fed to another…
1
vote
0 answers

How can I get ImageDatasetImportDataOp to update labels?

In a Vertex AI pipeline I am updating an image dataset, thus: ds_op = gcc_aip.ImageDatasetImportDataOp( project=project, dataset=get_dataset_id_op.outputs['dataset'], gcs_source=DATASET_PATH, …
0
votes
0 answers

Vertex AI Vision App Undeployment Fails because of state error

I build a Vertex AI Vision App via the interface and deployed it. As it was just for testing I want to undeploy and delete it now but I always get the following error when I try to undeploy it: error message I tried to undeploy it via the interface…
0
votes
1 answer

Using GCSFuse vs NFS share for custom training on Vertex AI

We are currently using GCS Fuse with Google Cloud Storage during our training and are seeing very slow performance. The bug seems to be with Google and they are actively working on the Fuse Bug. I was wondering if someone has tried setting up an NFS…
0
votes
0 answers

VertexAI online prediction with timeseries data sliding windows

I already have a trained model (that does classification) that takes as input timeseries of shape [2500, 3]. I exported it on vertex as a custom model and created a public endpoint from it. I can request it with other timeseries and get them…
1
2 3