Questions tagged [google-cloud-ml-engine]

Google Cloud ML Engine is a managed, scalable service that enables building and training of machine learning models in the cloud as part of the Google Cloud suite of products: including TensorFlow, storage, BigQuery, Dataflow and others.

185 questions
15
votes
2 answers

Training and Predicting with instance keys

I am able to train my model and use ML Engine for prediction but my results don't include any identifying information. This works fine when submitting one row at a time for prediction but when submitting multiple rows I have no way of connecting the…
dobbysock1002
  • 907
  • 10
  • 15
15
votes
1 answer

Google Cloud - Compute Engine VS Machine Learning

Does anyone know what is the difference between using Google Cloud Machine Learning compare to a Virtual Machine instance in the Google Cloud Engine ? I am using Keras with Python 3 and feel like GML is more restricting (using python 2.7, older…
13
votes
1 answer

Using Training TFRecords that are stored on Google Cloud

My goal is to use training data (format: tfrecords) stored on Google Cloud storage when I run my Tensorflow Training App, locally. (Why locally? : I am testing before I turn it into a training package for Cloud ML) Based on this thread I shouldn't…
7hacker
  • 1,928
  • 3
  • 19
  • 32
8
votes
1 answer

tensorflow Found more than one graph event per run

I am loading a tensorboard for my ml engine experiment that is running in local mode and got the following warning: "Found more than one graph event per run, or there was a metagraph containing a graph_def, as well as one or more graph events. …
8
votes
2 answers

Cannot resubmit job to ml-engine because "A job with this id already exists"

I am trying to submit a job to gcloud ml-engine. For reference the job is using this sample provided by Google It went through the first time, but with errors unrelated to this question, and now I am trying reissue the command after having corrected…
slcott
  • 1,194
  • 3
  • 14
  • 26
7
votes
1 answer

reading files in google cloud machine learning

I tried to run tensorflow-wavenet on the google cloud ml-engine with gcloud ml-engine jobs submit training but the cloud job crashed when it was trying to read the json configuration file: with open(args.wavenet_params, 'r') as f: wavenet_params…
joaeba
  • 73
  • 1
  • 4
6
votes
1 answer

Deploy retrained inception SavedModel to google cloud ml engine

I am trying to deploy a retrained version of the inception model on google cloud ml-engine. Gathering informations from the SavedModel documentation, this reference, and this post of rhaertel80, I exported successfully my retrained model to a…
EffePi
  • 356
  • 2
  • 13
5
votes
2 answers

Google Cloud ML Engine GPU Utilization

If I am using --scale-tier BASIC GPU within a Google Cloud ML Engine job, how can I view the GPU utilization? I am able to view CPU Utilization and Memory utilization on the "job details" tab, but I'm wondering how much the GPU is being utilized. Is…
reese0106
  • 2,011
  • 2
  • 16
  • 46
5
votes
1 answer

Loading pre-trained word2vec to initialise embedding_lookup in the Estimator model_fn

I am solving a text classification problem. I defined my classifier using the Estimator class with my own model_fn. I would like to use Google's pre-trained word2vec embedding as initial values and then further optimise it for the task at hand. I…
Robert Lacok
  • 4,176
  • 2
  • 26
  • 38
5
votes
2 answers

Convert graph (pb) to SavedModel for gcloud ml-engine predict

I trained an object detector using Cloud Machine Learning Engine according to the recent post by Google’s Derek Chow on the Google Cloud Big Data And Machine Learning Blog and now want to predict using Cloud Machine Learning Engine. The instructions…
4
votes
1 answer

"Attempted relative import in non-package" on Google Cloud ML

Since yesterday or the day before, I get a ValueError: Attempted relative import in non-package for an import in my main trainer file like from . import mobilenet_v1 as mobilenet when running the exact same trainer code with the exact same…
4
votes
2 answers

Reading data from bucket in Google ml-engine (tensorflow)

I am having issues reading data from a bucket hosted by Google. I have a bucket containing ~1000 files I need to access, held at (for example) gs://my-bucket/data Using gsutil from the command line or other of Google's Python API clients I can…
Moss
  • 357
  • 1
  • 6
4
votes
1 answer

gcloud jobs submit prediction 'can't decode json' with --data-format=TF_RECORD

I pushed up some test data to gcloud for prediction as a binary tfrecord-file. Running my script I got the error ('No JSON object could be decoded', 162). What do you think I am doing wrong? To push a prediction job to gcloud, i use this…
Jus
  • 504
  • 3
  • 11
4
votes
1 answer

How should Tensorboard's fraction of zero values be interpreted?

I am running a cloud ML engine job and my tensorboard plots are showing the fraction of zero values for my hidden layers steadily increasing towards 1 as the number of steps increases. How should this plot be interpreted? I believe it is a good…
reese0106
  • 2,011
  • 2
  • 16
  • 46
4
votes
1 answer

Why there are insufficient accelerators when I execute gcloud ml-engine jobs?

I'm trying to run a Machine Learning Jon in Google Cloud, but it always tell me that there are insufficient accelerators available, I've tried with the parameter ----scale-tier=BASIC | BASIC_GPU | STANDARD_1 | PREMIUM_1. and is the same result. Here…
1
2 3
12 13