Here offered the way how to profile tensorflow code. In my case I launch tf.run in several threads in parallel. How I can use this technique to profile multithreading architecture? When I use global metadata and options they log only single thread.
Thanks!