I am using RandomForestRegressor and I want to use the largest number of trees without adding to the total runtime. My dubious assumption is that if my computer has 100 cores, by specifying a number of trees that is a multiple of 100, I am getting the most bang for my buck. Is this necessarily true?
The regression task is being performed within a hyperparameter optimization procedure, and since I specified n_jobs=1 for this tuning procedure, I am not sure if the number of trees should actually be a multiple of 99 since at least one core may be occupied.
The post How does scikit-learn handle multiple n_jobs arguments? is similar but the top answer quotes the documentation which states that n_jobs=-1 corresponds to "all processors". Again, I'm not sure if this literally means all processors, or all available processors (or if these would be the same in my situation).