simple_stories_train

Project for training small LMs. Designed for training on SimpleStories, an extension of TinyStories.

Installation

From the root of the repository, run one of

make install-dev  # To install the package, dev requirements and pre-commit hooks
make install  # To just install the package (runs `pip install -e .`)

Development

Suggested extensions and settings for VSCode are provided in .vscode/. To use the suggested settings, copy .vscode/settings-example.json to .vscode/settings.json.

There are various make commands that may be helpful

make check  # Run pre-commit on all files (i.e. pyright, ruff linter, and ruff formatter)
make type  # Run pyright on all files
make format  # Run ruff linter and formatter on all files
make test  # Run tests that aren't marked `slow`
make test-all  # Run all tests

Usage

First, download the TinyShakespeare dataset:

python simple_stories_train/tinyshakespeare.py

Then, train llama on it: python simple_stories_train/train_llama.py --model d12 --input_bin simple_stories_train/tinyshakespeare/tiny_shakespeare_val.bin

You may be asked to enter your wandb API key. You can find it in your wandb account settings. Alternatively, to avoid entering your API key on program execution, you can set the environment variable WANDB_API_KEY to your API key.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github		.github
.vscode		.vscode
simple_stories_train		simple_stories_train
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
ACCESS.md		ACCESS.md
Makefile		Makefile
README.md		README.md
conftest.py		conftest.py
pyproject.toml		pyproject.toml
setup_pkg.sh		setup_pkg.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

simple_stories_train

Installation

Development

Usage

About

Releases

Packages

Contributors 3

Languages

danbraunai/simple_stories_train

Folders and files

Latest commit

History

Repository files navigation

simple_stories_train

Installation

Development

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages