Skip to content

lemay-ai/week11-dvc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

python3 -m venv myenv

source myenv/bin/activate

pip install -r requirements.txt

dvc init

git add .dvc .gitignore

git commit -m "Initialize DVC"

python3 getDataset.py

aws configure

aws s3 mb s3://uottawa-dvc-bucket

dvc remote add s3_remote s3://uottawa-dvc-bucket

dvc remote modify s3_remote region us-west-2 # Change region if needed

dvc remote modify s3_remote access_key_id YOUR_AWS_ACCESS_KEY_ID

dvc remote modify s3_remote secret_access_key YOUR_AWS_SECRET_ACCESS_KEY

dvc remote default s3_remote

dvc add data/imdb

git add data/imdb.dvc .gitignore

git commit -m "Track IMDb dataset with DVC on S3"

dvc push

NOW CLONE THE REPO AGAIN SOMEWHERE ELSE. IT'S SMALL BECAUSE THE DATASET IS IN S3, NOT GIT

git clone the repo

./addRemote.sh

dvc pull

THE DATASET WILL DOWNLOAD FROM S3

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors