21

How does Intel TBB choose the number of threads to used for a parallel section?

Is there some kind of specification available?

chris
  • 257
  • 1
  • 2
  • 7

3 Answers3

22

As of TBB Version 2.2 the task scheduler will be automatically initialized and on runtime take care of the numbers of threads to use, if you manually want to change that number, you can use one of the following methods:

When you create the scheduler, you can specify the number of threads as

tbb::task_scheduler_init init(nthread);

else you can use

tbb::task_scheduler_init init(tbb::task_scheduler_init::automatic);

In this case, tbb scheduler creates as many threads as your CPU cores

Stephan Dollberg
  • 32,985
  • 16
  • 81
  • 107
Manny
  • 699
  • 6
  • 19
  • 3
    You must keep the `tbb::task_scheduler_init` object alive after this, or it will revert back to automatic. – Petr Hudeček Apr 04 '15 at 13:09
  • 5
    Also, the task scheduler object just affects TBB jobs launched from the current thread. When creating further jobs from another thread, you need another `tbb::task_scheduler_init`. – Wenzel Jakob May 24 '15 at 18:29
  • This is actually not a solution, because given any `nthread` value, TBB still creates # of threads equal to hardware CPU threads. And, even worse, if you specify `nthreads=2`, TBB creates a second thread and runs TBB work on it, and not on the master thread. Please see this question for details: https://stackoverflow.com/questions/59736661/how-to-completely-switch-off-threading-in-tbb-code – Dmitry Mikushin Jan 14 '20 at 15:18
12

Letting TBB decide the number of threads in the pool is the recommended option - it will usually create as many worker threads as there are logical CPUs on the machine - see Class reference for tbb::task_scheduler_init.

It's not easy to find out how many worker threads exist or are executing tasks at any given time - this is a deliberate design choice. From Intel's TBB Parallel Programming Course:

How do I know how many threads are available?

Do not ask!

  • Not even the scheduler knows how many threads really are available
  • There may be other processes running on the machine
  • Routine may be nested inside other parallel routines
Community
  • 1
  • 1
Josh Milthorpe
  • 956
  • 1
  • 14
  • 27
  • Is the task scheduler able to detect if it running in a cgroups limited environment (e.g. a docker container?). For instance openmp runtimes are typically not and will tend to cause oversubscription when running in docker containers (e.g. limited to a 2000 mCPU quota on a machine with 32 cores). – ogrisel Oct 23 '19 at 12:34
  • If several processes are running on the same host (for instance when using multiprocessing in Python to spawn workers), can the TBB scheduler detect the concurrent version of the code running in other processes and automatically tune the number of active threads to avoid oversubscription problems? What if the processes are not started by the same unix user? – ogrisel Oct 23 '19 at 12:41
  • Replying to myself: TBB can use interprocess communication to coordinate the schedulers and avoid oversubscription. This can be done by setting the ENABLE_IPC environment variables. – ogrisel Oct 24 '19 at 14:05
  • 2
    I have tried to run some numpy / MKL / TBB benchmarks in a script launched with `docker run --cpus 2` on a host with 28 cores and I observed quite sever oversubscription related performance degradation. – ogrisel Oct 24 '19 at 15:30
  • For the later issue with docker, I have filed a report at: https://github.com/intel/tbb/issues/190 – ogrisel Oct 25 '19 at 12:08
0

Documetation says just "dependent on hardware configuration". Possibly it just number of CPU cores available.

blaze
  • 4,326
  • 18
  • 23