I am new to machine learning. This is a binary classification problem. I want to figure out how to deal with testing data that doesn't include target (output).
Normally, I will use sklearn
:
from sklearn.model_selection import train_test_split
if the data (training + test) includes target (output) value being part of the all the data. But in my case two separate files are given. The training data file includes the target value as part of the data, however, the testing data doesn't have target value. I was wondering how I can use an sklearn classification technique to deal with this situation. I have to validate the data to check the accuracy of the classification. You can use any toy example for explanation.