Nothing to do. The TU datasets are automatically downloaded.
MNIST size is 1.39GB and CIFAR10 size is 2.51GB.
# At the root of the project
cd data/
bash script_download_superpixels.sh
Script script_download_superpixels.sh is located here. Codes to reproduce the datasets for MNIST and for CIFAR10.
ZINC size is 58.9MB.
ZINC-full size is 1.14GB.
# At the root of the project
cd data/
bash script_download_molecules.sh
Script script_download_molecules.sh is located here. Code to reproduce the ZINC dataset is here and the ZINC-full dataset is here.(../data/molecules/prepare_molecules.ipynb).
PATTERN size is 1.98GB and CLUSTER size is 1.26GB.
# At the root of the project
cd data/
bash script_download_SBMs.sh
Script script_download_SBMs.sh is located here. Codes to reproduce the datasets for PATTERN and for CLUSTER.
TSP size is 1.87GB.
# At the root of the project
cd data/
bash script_download_TSP.sh
Script script_download_TSP.sh is located here. Codes to reproduce the TSP dataset is here.
CSL size is 27KB.
# At the root of the project
cd data/
bash script_download_CSL.sh
Script script_download_CSL.sh is located here.
COLLAB size is 360MB.
No script to run. The COLLAB dataset files will be automatically downloaded from OGB when running the experiment files for COLLAB.
# At the root of the project
cd data/
bash script_download_all_datasets.sh
Script script_download_all_datasets.sh is located here.