I'm tying out DVC (https://dvc.org/) , based on the docs provided , i followed the sample, with following commands (see below) . I created a folder called storage and ran => dvc add storage.
now if i started adding data or csv , for example somefile.csv to this folder, do i need to run dvc add storage/somefile.csv ?
and i eventually want to run this on aws , so once is setup a s3 bucket , push my data to the bucket and run my training job on aws. I'm looking at CML (cml.dev) as well, which looks like , let me do that . will CML configuration know to pull from my remote storage?
also , i'm not too familiar with CML yet , is this for just running jobs?
i've tried set up with following commands
- git init
- dvc init
- mkdir storage
#added a storage folder for data then
- dvc add storage
- git commit
#set up the remote storage
- dvc remote add remote_storage s3://somebucket
- dvc push