Normally you would have three sets:
- train - the one used to do actual training, optimisation over
- validation - the one used to evaluate/verify training, make decisions about hyperparameters, early stopping and so on
- test - the one used as final benchmarking
For various reasons some of the above might be missing in the setup, but this is the standard approach, and every modification requires good reasons to do that.
Often datasets do not specify "validation", as fitting of hyperprameters etc. is considered part of the training, thus every data point used for that, is de facto used to train your model (thus a part of "train" dataset). In practise, this means that you have to split train set on your own, into "proper train" and "validation" (if method being used requires fitting some additional hyperparameters).