v1.1.0 - breaking change for v5 (treat it as beta3)
WARNING: this version breaks existing v5 models, use v5-beta2 for previous models. Until an official v5 1.5B or higher official model is released, we will not be treating v5 breaking changes as major version changes. The existing v5 code (r3) has not been throughly tested, and maybe subjected to future changes.
What's Changed
- Upgrading v5 to be in sync with v5r3 from blinks official repo
- Move existing v5 code to v5-beta2 folder (as i know some folks have already started experimenting with v5)
- various readme / documentation / example changes
- Added data offset and limit params (to document)
- WIP docker container
- Fix for older python/lightning version for multi-gpu sync
Additional changes that was merged in
- limited dataloader num_worker max to 8 by @diannaojiang in #17
- (optional) Added token 0 to the tokenizer. by @m8than in #18
- Dataset Sorting + multi column suffix features by @m8than in #23
New Contributors
- @diannaojiang made their first contribution in #17
- @m8than made their first contribution in #18
Full Changelog: v1.0.2...v1.1.0