CI/CD for Machine Learning
Ravi Bhadauria
Machine Learning Engineer
dvc remote add
commanddvc remote add myAWSremote s3://mybucket
dvc remote modify
dvc remote modify myAWSremote connect_timeout 300
.dvc/config
change ['remote "myAWSremote"']
url = s3://mybucket
connect_timeout = 300
dvc remote add mylocalremote /tmp/dvc
-d
flagdvc remote add -d mylocalremote /tmp/dvc
core
section of .dvc/config
[core]
remote = mylocalremote
Commands to transfer data
dvc push <target>
dvc pull <target>
Similar to git push
and git pull
.dvc
is tracked by Git, not DVCdvc push data.csv
dvc push
-r
flagdvc push -r myAWSremote data.csv
dvc add /path/to/data/datafile
.dvc
file to Gitgit add /path/to/datafile.dvc
git commit /path/to/datafile.dvc -m "Dataset updates"
git push origin main
dvc push
CI/CD for Machine Learning