Skip to content

v1.1.0 - breaking change for v5 (treat it as beta3)

Compare
Choose a tag to compare
@PicoCreator PicoCreator released this 06 Sep 20:56
· 345 commits to main since this release

WARNING: this version breaks existing v5 models, use v5-beta2 for previous models. Until an official v5 1.5B or higher official model is released, we will not be treating v5 breaking changes as major version changes. The existing v5 code (r3) has not been throughly tested, and maybe subjected to future changes.

What's Changed

  • Upgrading v5 to be in sync with v5r3 from blinks official repo
  • Move existing v5 code to v5-beta2 folder (as i know some folks have already started experimenting with v5)
  • various readme / documentation / example changes
  • Added data offset and limit params (to document)
  • WIP docker container
  • Fix for older python/lightning version for multi-gpu sync

Additional changes that was merged in

  • limited dataloader num_worker max to 8 by @diannaojiang in #17
  • (optional) Added token 0 to the tokenizer. by @m8than in #18
  • Dataset Sorting + multi column suffix features by @m8than in #23

New Contributors

Full Changelog: v1.0.2...v1.1.0