I have a dataset, like this mushrooms <- read.csv("mushrooms.csv")
and now I already have a mushrooms.training_set
which is 1/3 of the whole dataset. For both variables, typeof()
returns list
.
Now, I want to select the rows in the original dataset mushrooms
, that are not in the mushrooms.training_set
. How would I do this? I have tried the following:
mushrooms[c(!mushrooms.training_set),]
but this returns something in the order of 64K rows.mushrooms[!mushrooms.training_set,]
mushrooms[!duplicated(mushrooms.training_set)]
Who helps me out?