I still don't after having read documentation about tf.keras.Model.fit
and tf.data.Dataset
, when passing tf.data.Dataset
to fit function, should I call repeat
and batch
on the dataset object or should I provide the batch_size
and epochs
arguments to fit instead? or both? Should I apply the same treatment to the validation set?
And while I'm here, can I shuffle
the dataset before the fit
? (seems like it's an obvious yes)
If so, before, after calling Dataset.batch
and Dataset.repeat
(if calling them)?
Edit: When using batch_size
argument, and without having called Dataset.batch(batch_size)
previously, I am getting the following error:
ValueError: The `batch_size` argument must not be specified for the given input type.
Received input: <MapDataset shapes: ((<unknown>, <unknown>, <unknown>, <unknown>), (<unknown>, <unknown>, <unknown>)),
types: ((tf.float32, tf.float32, tf.float32, tf.float32), (tf.float32, tf.float32, tf.float32))>,
batch_size: 1
Thanks