The question says:
Load the data and split it into 75% training and 25% validation data using set.seed(4650).
this is what I have:
setwd("C:/Users/Downloads")
cat = read.csv("cat.csv")
set.seed(4650)
train = sample(c(TRUE, TRUE, TRUE, FALSE), nrow(cat), rep = TRUE)
validation = (!train)
And I need to provide summary of the training data.
summary(train)
which gives me
Mode FALSE TRUE
logical 830 2463
Am I splitting the data in the right way?
Thank you very much.