I have a Mobilenet build on Keras. Running it locally takes around 290 seconds every step, but when I run on the GCLoud ML Engine it takes over 400 seconds.I put the following line on my code:
K.tensorflow_backend._get_available_gpus()
And the log is the following:
['/job:localhost/replica:0/task:0/device:GPU:0']
I have also tried to change from 1 GPU to 4 GPU but the result is the same. Do I have to change any code to optimize Keras on GPU?