Having walked through several tutorials, I have managed to make a script that successfully uses XGBoost to predict categorial prices on the Boston housing dataset.
However, I cannot successfully tune the parameters of the model using CV. Even after trying several solutions from tutorials and postings here on stackowerflow.
My best outcome so far is very 'hacky' and only tunes a single parameter:
steps <- seq(75,90,5)/100
for(i in steps){
.....
}
But I see all of these fancy setups, that run through several parameters automatically using MLR or Caret or NMOF. However, I haven't gotten close to getting anyone to work on these data. I suspect that it is because most are set up for binary classification but even when addressing this as best, I can I have no success. I could provide you with hundreds of line of code that does not work, but I think the easiest is to provide my code as far as it works here and hear you out how you would progress from here rather than getting swamped in my poor code.
Edit: As I have not had any success even running other peoples scripts. Here are some additional details:
> packageVersion("mlr")
‘2.11’
> packageVersion("xgboost")
‘0.6.4.1’