On Using Quasirandom Sequences in Machine Learning for Model Weight Initialization

This repository is the official implementation of "On Using Quasirandom Sequences in Machine Learning for Model Weight Initialization".

View Demo

Requirements

To install requirements:

conda create -n rnd-init python==3.11
conda activate rnd-init
pip install -r requirements.txt

Note that a correct Tensorflow install can be tricky, check the instructions for your OS.

Training & Evaluation

For details (on how to choose a model, dataset, etc.), run this command:

python src/train_and_eval.py --help

Here is a sample command to train and evaluate a baseline CNN model trained on the CIFAR-10 dataset using a quasi-random Glorot Uniform initializer:

python src/train_and_eval.py \
    --experiment_id=1 \
    --sequence_scheme=auto \
    --dataset_id=cifar10 \
    --kernel_initializer=glorot-uniform \
    --initializer_type=quasi-random \
    --model_id=baseline-cnn \
    --units=64 \
    --batch_size=64 \
    --epochs=30 \
    --optimizer=adam \
    --learning_rate=0.0001 \
    --out_path=results.json

Sample execution harness is given in test.sh.

To set specific starting seed value, try test_specific_seeds.sh.

Structure

Distributions-related files

src/distributions_qr.py contains implementations of random_normal, random_uniform, and truncated_normal distributions mimicking the API of the corresponding keras.backend functions. Our implementations are driven by quasirandom Sobol sequences rather than a pseudorandom number generator.
tests/test_distributions_qr.py contains additional examples of usage of these functions.

Initializers

src/custom_random_generator.py contains custom random generator that invokes quasirandom distributions from src/distributions_qr.py.
src/custom_initializer.py contains ten custom initialization schemes that use quasirandom distributions as the underlying source of randomness. These initialization schemes adhere to the API keras.initializers classes and can be readily used to initialize Keras layers. We add the suffix QR (quasi-random) to each custom initialization schema name.

Test

Automatic unit tests

Run test cases (from the root directory of the project) and produce code coverage data using

pytest

Pytest parameters are set in pyproject.toml.

Produce HTML code coverage report using

coverage html

Note that Github CI pipeline generates the latest code coverage report automatically and places it into python-coverage-comment-action-data branch.

Citation

If you use the paper, algorithm, or code, please cite them as follows.

@misc{qrng_init_2024,
      title={On Using Quasirandom Sequences in Machine Learning for Model Weight Initialization}, 
      author={Andriy Miranskyy and Adam Sorrenti and Viral Thakar},
      year={2024},
      eprint={2408.02654},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2408.02654},
}

Contributing

To contribute to the project, please feel free to submit a pull request. For major changes, please open an issue first to discuss what you would like to change.

License

This project is licensed under the Apache License 2.0 -- see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

On Using Quasirandom Sequences in Machine Learning for Model Weight Initialization

Requirements

Training & Evaluation

Structure

Distributions-related files

Initializers

Test

Automatic unit tests

Citation

Contributing

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test.sh		test.sh
test_specific_seeds.sh		test_specific_seeds.sh

License

miranska/qrng-init

Folders and files

Latest commit

History

Repository files navigation

On Using Quasirandom Sequences in Machine Learning for Model Weight Initialization

Requirements

Training & Evaluation

Structure

Distributions-related files

Initializers

Test

Automatic unit tests

Citation

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages