Install Anaconda Python 3:
- https://www.anaconda.com/download
- https://www.digitalocean.com/community/tutorials/how-to-install-the-anaconda-python-distribution-on-ubuntu-16-04
Install Needed Extra Libraries:
pip install --upgrade google-api-python-client
pip install boto3
pip install awscli --upgrade --user
Install Docker and the caffe docker image:
sudo apt-get install docker.io
docker run -ti bvlc/caffe:cpu caffe --version
Make sure your AWS credentials are setup:
Setup ~/.aws/credentials
with your key and secret:
[default]
aws_access_key_id = YOUR_KEY
aws_secret_access_key = YOUR_SECRET
Set the default region in ~/.aws/config
:
[default]
region=ap-south-1
- Export the Google Drive Spreadsheet as a csv file
- Run
./process-csv csv-file-name
- Run
./package-dataset output-folder-path
Use the flag -f to force syncing with Google drive otherwise cached local copies will be used if available.
- Run
./package-dataset directory-to-store-tarball
This will create a folder in your home directory called 'swm-ml-dataset' and save a tarball to the folder you specify.
- sudo docker run -ti -v ~/swm-ml-dataset/images:/opt/caffe/swm-ml-dataset -v /home/ubuntu/swm-ml:/opt/caffe/swm-ml bvlc/caffe:cpu /bin/bash
- Andrew Ziegler - [email protected]