How to run Tensorflow on CPU

Question

I have installed the GPU version of tensorflow on an Ubuntu 14.04.

I am on a GPU server where tensorflow can access the available GPUs.

I want to run tensorflow on the CPUs.

Normally I can use env CUDA_VISIBLE_DEVICES=0 to run on GPU no. 0.

How can I pick between the CPUs instead?

I am not intersted in rewritting my code with with tf.device("/cpu:0"):

score 211 · Answer 1 · answered Jun 06 '16 at 17:29

211

You can also set the environment variable to

CUDA_VISIBLE_DEVICES=""

without having to modify the source code.

answered Jun 06 '16 at 17:29

fabrizioM

46,639
15
102
119

3

Someone said running neural nets on CPUs after the training phase is as performant as running them on GPUs -- i.e., only the training phrase really needs the GPU. Do you know if this is true? Thanks! – Crashalot Nov 07 '16 at 19:03
13

@Crashalot: This is not true. Look for various benchmarks for interference, CPUs are an order of magnitude slower there too. – Thomas Nov 17 '16 at 13:58
1

@Thomas thanks. suggestions on which benchmarks to consider? probably also varies on workload and nature of the neural nets, right? apparently the google translate app runs some neural nets directly on smartphones, presumably on the cpu and not gpu? – Crashalot Nov 17 '16 at 19:44
@fabrizioM, a toy example will be more useful. – Girishkumar Jun 08 '17 at 10:54
@Thomas, what I have observed is after the training is done and the model is frozen, doing predictions using the frozen model on CPU (around 3 ms) is slightly faster than on GPU (around 7 ms). – Nandeesh Jun 29 '17 at 11:56
How do you hide it from tensorflow without hiding it from cuda? ie, I want something besides tensorflow in the same script to have access to the gpu – lahwran Aug 01 '17 at 00:10
8

This did not work for me. :/ set the environment variable but tensorflow still uses the GPU, I'm using conda virtual env, does this make a diference? – Guilherme de Lazari Aug 06 '17 at 15:56
However, if say we have no GPU devices, then does this solution still works? Or we have to use `{device_count = {'GPU':0}}` – Bs He Oct 22 '18 at 20:03
Keep finding GPU. This may have worked on earlier versions of TF but not anymore. – Apr 05 '19 at 09:03
`CUDA_VISIBLE_DEVICES="" ./learn_tf.py` works well on a Fedora 31, TensorFlow 2.1.0, and CUDA 10.2 system. CUDA_VISIBLE_DEVICES can also be a comma separated list of device ids. Effects entire program and so does not help if you want to have CUDA code and CPU-only TensorFlow in one program. – kentavv Mar 15 '20 at 15:07
@Crashalot: for some datasets run with shallow models, the CPU is much faster even in the training phase because we can avoid the transfer time from memory to GPU on each batch. Some data feeding/preloading techniques can reduce this gap, but it sometimes still persists. 95% of the time the GPU is faster, but sometimes it's worth checking performance on the CPU. – mboss Jul 01 '20 at 04:25
@mboss thanks for the answer! is it possible to describe the cases where the CPU is faster or comparable in speed? – Crashalot Jul 01 '20 at 07:44

Aravindh Kuppusamy · Answer 2 · 2019-06-18T19:48:10.620

164

If the above answers don't work, try:

os.environ['CUDA_VISIBLE_DEVICES'] = '-1'

edited Jun 18 '19 at 19:48

answered Sep 05 '17 at 03:46

Aravindh Kuppusamy

1,893
3
14
21

3

Just for the record, the first option doesn't seem to work anymore. – Apr 05 '19 at 09:08
1

Works also for tf 2.X when using `tf.keras.Sequential` models. – Nicolas M. Jun 29 '20 at 16:26
1

Is there a way to do this without tensorflow invoking the error message "CUDA_ERROR_NO_DEVICE: no CUDA-capable device is detected" ? – Nermin Mar 22 '21 at 14:58

Ivan Aksamentov - Drop · Accepted Answer · 2018-02-09T05:11:45.260

139

You can apply device_count parameter per tf.Session:

config = tf.ConfigProto(
        device_count = {'GPU': 0}
    )
sess = tf.Session(config=config)

See also protobuf config file:

tensorflow/core/framework/config.proto

edited Feb 09 '18 at 05:11

answered Jun 06 '16 at 15:11

Ivan Aksamentov - Drop

12,860
3
34
61

3

Someone said running neural nets on CPUs after the training phase is as efficient as running them on GPUs -- i.e., only the training phrase really needs the GPU. Do you know if this is true? Thanks! – Crashalot Nov 07 '16 at 19:03
4

That doesn't work for me (tf1.1). The solution of fabrizioM does. – P-Gn May 29 '17 at 14:42
3

Isn't it better to use `CUDA_VISIBLE_DEVICES` environment variable instead of changing the config in the code? – Nandeesh Jun 30 '17 at 15:22
3

@Nandeesh I guess it depends on your needs. So far there are at least 53 people who feel more into environment variables and 35 who prefer to set number of devices in code. The advantage of first is simplicity and of another is more explicit control over (multiple) sessions from within the python program itself (that zero is not necessary to be hardcoded, it can be a variable). – Ivan Aksamentov - Drop Jun 30 '17 at 16:58
1

@Crashalot it depends on the nature of the network. For example, RNNs can be faster on CPUs for small batch sizes due to its sequential nature. CNNs will still benefit from a GPU in inference mode, but since you only need to run them once per example, a CPU _may_ be fast enough for many practical purposes. – Davidmh Sep 05 '17 at 07:29
1

I prefer this one as sometimes you maybe want to run two networks simultaneously, a small network on CPU, while the bigger one is on GPU. – Dammi Aug 29 '18 at 08:42
How does this configuration interact with `$CUDA_VISIBLE_DEVICES`? – CMCDragonkai Aug 30 '18 at 07:33
4

Do you know how to adapt this to tensorflow 2.0, as there is no more session or configproto? – Zaccharie Ramzi Jul 30 '19 at 13:25
This indeed doesn't work anymore: check out my lower answer for something that works from 2.3.1 up. – Michał Gacka Dec 28 '20 at 14:31

Michał Gacka · Answer 4 · 2021-08-02T17:44:24.783

59

The environment variable solution doesn't work for me running tensorflow 2.3.1. I assume by the comments in the github thread that the below solution works for versions >=2.1.0.

From tensorflow github:

import tensorflow as tf

# Hide GPU from visible devices
tf.config.set_visible_devices([], 'GPU')

Make sure to do this right after the import with fresh tensorflow instance (if you're running jupyter notebook, restart the kernel).

And to check that you're indeed running on the CPU:

# To find out which devices your operations and tensors are assigned to
tf.debugging.set_log_device_placement(True)

# Create some tensors and perform an operation
a = tf.constant([[1.0, 2.0, 3.0], [4.0, 5.0, 6.0]])
b = tf.constant([[1.0, 2.0], [3.0, 4.0], [5.0, 6.0]])
c = tf.matmul(a, b)

print(c)

Expected output:

2.3.1
Executing op MatMul in device /job:localhost/replica:0/task:0/device:CPU:0
tf.Tensor(
[[22. 28.]
 [49. 64.]], shape=(2, 2), dtype=float32)

edited Aug 02 '21 at 17:44

answered Oct 16 '20 at 14:15

Michał Gacka

2,935
2
29
45

3

Working with tensorflow and pytorch in one script, this approach help me to disable cuda on tensorflow but still make the pytorch use cuda. I believe this answer deserved more votes. – Vinson Ciawandy Apr 26 '21 at 08:56
2

A potential advantage of this solution is that it doesn't rely on a variable that explicitly mentions CUDA and which might as such be reserved to specific devices. For example, it works on my Apple Silicon Mac. – Eric O. Lebigot Jul 01 '21 at 11:02
Best solution, thx (because I'm on a Silicon too :D) – ker2x Dec 08 '21 at 03:36
works for me like a charm. In Jupyter notebook, just follow these steps (based on above comment) - Restart the Kernel --> put this line just beneath the tensorflow: import tf.config.set_visible_devices([], 'GPU') --> run your script – madhur Feb 19 '22 at 00:56
this finally worked for me on tensorflow 2.7.0, thanks! – Olympia May 06 '22 at 15:31
Did not work for me on Windows with TF 2.7. After running, I did `tf.config.list_physical_devices("GPU")` and got the same list of GPUs as before. – brethvoice Jun 28 '22 at 15:33
I can confirm this has worked for me on Apple Silicon M2. – talha06 Feb 18 '23 at 18:24

score 39 · Answer 5 · answered Nov 29 '18 at 13:09

39

For me, only setting CUDA_VISIBLE_DEVICES to precisely -1 works:

Works:

import os
import tensorflow as tf

os.environ['CUDA_VISIBLE_DEVICES'] = '-1'

if tf.test.gpu_device_name():
    print('GPU found')
else:
    print("No GPU found")

# No GPU found

Does not work:

import os
import tensorflow as tf

os.environ['CUDA_VISIBLE_DEVICES'] = ''    

if tf.test.gpu_device_name():
    print('GPU found')
else:
    print("No GPU found")

# GPU found

answered Nov 29 '18 at 13:09

Andrzej Gis

13,706
14
86
130

2

hi, does not work for me...I'm using tensorflow-gpu 2.4.1 – JamesAng Mar 17 '21 at 02:50
Also failed for me on Windows with TF 2.7. – brethvoice Jun 28 '22 at 15:35
"GPU found" even of tf2.0 – MosQuan Mar 29 '23 at 00:44

score 13 · Answer 6 · answered Jan 07 '21 at 12:23

As recommended by the Tensorflow GPU guide.

# Place tensors on the CPU
with tf.device('/CPU:0'):
  a = tf.constant([[1.0, 2.0, 3.0], [4.0, 5.0, 6.0]])
  b = tf.constant([[1.0, 2.0], [3.0, 4.0], [5.0, 6.0]])
  # Any additional tf code placed in this block will be executed on the CPU

score 8 · Answer 7 · answered Dec 15 '20 at 10:08

8

In my case, for tensorflow 2.4.0, none of the previous answer works unless you install tensorflow-cpu

pip install tensorflow-cpu

answered Dec 15 '20 at 10:08

Riccardo Casatta

1,180
12
18

This works for tensorflow 2.5. But, I think my GPU for tensorflow 2.5 is no longer usable in the current environment after running the command. **(I tried the recommended way above and it doesn't work) – jona May 10 '21 at 17:14
How to use this tensorflow-cpu with tf.io.decode_image? – agata Jan 20 '22 at 09:04
Thank you, this also worked for TF 2.7 on Windows (Python 3.8) – brethvoice Jun 28 '22 at 15:46
Still works and seems up-to-date to this day, thanks. – Andrew Jul 14 '23 at 20:09

score 7 · Answer 8 · answered Jun 15 '18 at 02:44

7

Just using the code below.

import os
os.environ["CUDA_DEVICE_ORDER"] = "PCI_BUS_ID"
os.environ['CUDA_VISIBLE_DEVICES'] = '-1'

answered Jun 15 '18 at 02:44

fmnijk

411
5
12

Did not work for me in Windows using TF 2.7 – brethvoice Jun 28 '22 at 15:37

Cadoiz · Answer 9 · 2021-10-06T13:51:35.663

Another possible solution on installation level would be to look for the CPU only variant

In my case, this gives right now:

pip3 install https://storage.googleapis.com/tensorflow/windows/cpu/tensorflow_cpu-2.2.0-cp38-cp38-win_amd64.whl

Just select the correct version (in this case, cp38 hints python 3.8 - moreover, Tensorflow 2.2.0 is used, the current version as of Jul 12 '20).

Bonus points for using a venv like explained eg in this answer.

Bob_D_Builder · Answer 10 · 2021-11-19T19:26:41.860

1. Fabrizio's answer worked for me:

import os
os.environ["CUDA_DEVICE_ORDER"]="PCI_BUS_ID"
os.environ["CUDA_VISIBLE_DEVICES"]="-1"

I had to make the change before importing tensorflow. I am using tensorflow 2.4.0

2. Another (sub par) solution could be to rename the cusolver64_10.dll file that is required for gpu computing. Since tensorflow can't find the dll, it will automatically use the CPU.

It should be in a place like: C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.2\bin

score 2 · Answer 11 · answered Jun 10 '20 at 23:19

You could use tf.config.set_visible_devices. One possible function that allows you to set if and which GPUs to use is:

import tensorflow as tf

def set_gpu(gpu_ids_list):
    gpus = tf.config.list_physical_devices('GPU')
    if gpus:
        try:
            gpus_used = [gpus[i] for i in gpu_ids_list]
            tf.config.set_visible_devices(gpus_used, 'GPU')
            logical_gpus = tf.config.experimental.list_logical_devices('GPU')
            print(len(gpus), "Physical GPUs,", len(logical_gpus), "Logical GPU")
        except RuntimeError as e:
            # Visible devices must be set before GPUs have been initialized
            print(e)

Suppose you are on a system with 4 GPUs and you want to use only two GPUs, the one with id = 0 and the one with id = 2, then the first command of your code, immediately after importing the libraries, would be:

set_gpu([0, 2])

In your case, to use only the CPU, you can invoke the function with an empty list:

set_gpu([])

For completeness, if you want to avoid that the runtime initialization will allocate all memory on the device, you can use tf.config.experimental.set_memory_growth. Finally, the function to manage which devices to use, occupying the GPUs memory dynamically, becomes:

import tensorflow as tf

def set_gpu(gpu_ids_list):
    gpus = tf.config.list_physical_devices('GPU')
    if gpus:
        try:
            gpus_used = [gpus[i] for i in gpu_ids_list]
            tf.config.set_visible_devices(gpus_used, 'GPU')
            for gpu in gpus_used:
                tf.config.experimental.set_memory_growth(gpu, True)
            logical_gpus = tf.config.experimental.list_logical_devices('GPU')
            print(len(gpus), "Physical GPUs,", len(logical_gpus), "Logical GPU")
        except RuntimeError as e:
            # Visible devices must be set before GPUs have been initialized
            print(e)

score 1 · Answer 12 · answered Apr 16 '22 at 18:25

# this works on tensorflow 2.8, windows 10, jupyterlab Version 3.3.2
# this is the very FIRST lines of code

import tensorflow as tf

tf.config.set_visible_devices([], 'GPU')

# if tf.test.gpu_device_name(): # this lies and tells you about all devices
if tf.config.experimental.list_logical_devices('GPU'):
    print('GPU found')
else:
    print("No GPU found")

I spent way too many hours trying to figure this out. MOST attempts left the process running partially on CPU and still loading into GPU memory?? Strange ...

Running the above code FIRST, before anything else worked.

I was able to increase my hidden later from 6k to 12k. It is running now, only using the CPU. Each epoch is taking about 10x as long as on the GPU. From about 3 minutes per epoch to a bit over 35 minutes per epoch. This is an acceptable trade-off. Training Time vs Model Size.

@user2489629 you might want to try listing physical devices instead of logical. — brethvoice, Jun 28 '22 at 16:25

score -1 · Answer 13 · answered Sep 13 '19 at 08:15

-1

In some systems one have to specify:

import os
os.environ["CUDA_DEVICE_ORDER"]="PCI_BUS_ID"
os.environ["CUDA_VISIBLE_DEVICES"]=""  # or even "-1"

BEFORE importing tensorflow.

answered Sep 13 '19 at 08:15

Fabrizio

927
9
20

How to run Tensorflow on CPU

13 Answers13

Linked

Related