I am starting to use a MultiOutputRegressor in sci-kit learn for a multi-variable target I am trying to estimate with Random Forests.
I did start implementing manually before I came across this MultiOutputRegressor, and was trying to rotate the output for single output regressors so that a single target was used at any given time - and the other target variables used as inputs - but it was becoming computationally expensive.
I have searched and reviewed some code, but am struggling to determine if the target output (y) is used as an input feature (X). Specifically:
- when y_1 is being predicted, are y_2 ... y_n used as input features?
- when y_x is being predicted, are y_1 ... y_n (excluding y_x) used as input features?
- when y_n is being predicted, are y_1 ... y_n-1 used as input features? (apologies if I'm being overly verbose)
The paper "Multi-target regression via input space expansion" explains what I am looking to achieve.
Some answers have alluded to the fact that the MultiOutputRegressor algorithm may look for correlations between the target values, but I'm hoping they're actually rotated to be inputs (or effective inputs) for the algorithm in my application.