I want to know , when we set up DVC, can I simply, add my entire directory as such, dvc add dataset
and my workflow would be to update the entire dataset folder for next iteration. The contents of this folder should be cached. And if I ever wanna go back to previous version of data, I should be able to do a dvc checkout
? Or is it better to add each file to DVC individually?
— .dvc
- config
— dataset
- fileone.cvs
- train.py
- requirements.txt
I have tracked individual files so far, but would be easier to track entire folder in the event I have 100s of files?