I have a file with biological pathways and their abundance in samples. I want to create a classification random forest model in R, but aside from removing rows with null counts, I am not sure what else is necessary. I am new to this.
- is normalization and scaling required
- how can i handle confounding variables like age, sex, and weight