I have a dataset with 10000 samples, and 4 classes (0, 1, 2, 3) label.
>>>data.shape
(10000, 250)
>>>label.shape
(10000,)
and, I wonder are there any API that could split the data into training and test data and shuffle?
for example:
(training_data, training_label, test_data, test_label) = split_shuffle(data, label, 80) # 80 means 80% training, 20% test
What is the most efficient way to achieve such functions?
Further, what if we want 5-fold (straight) cross validation data?