As I understand, TF invokes multiple operators in parallel as long as they are independent. (link)
And the parallelism can be controlled by inter_op_parallelism_threads
and intra_op_parallelism_threads
if operators ar e running on CPU (link). However these parameters does not affect GPU operators at all. How can I control parallelism of GPU? (for example, run operators serially although there are independent operators)
EDIT:
a=tf.random_normal([N,N])
b=tf.random_normal([N,N])
c=tf.random_normal([N,N])
d=tf.random_normal([N,N])
x=tf.matmul(a,b)
y=tf.matmul(c,d)
z=tf.matmul(x,y)