type(train_x)
numpy.ndarray
train_samples = train_x.tolist()
When I print the index
of my samples, you can see that there are duplicates that are out of order. Why might this be happening?
It is messing up my pipeline downstream... but sometimes it runs fine when the index decides to preserve itself.
for tr in train_samples:
print(train_samples.index(tr))
...
11
12
13
14 # here
15
...
39
40
41
42
14 # here
...
Proving answer about duplicate entries: