Skip to content

Release 0.2.0-rc.0

Pre-release
Pre-release
Compare
Choose a tag to compare
@jhen0409 jhen0409 released this 29 Aug 03:46
· 137 commits to main since this release

0.2.0-rc.0 (2023-08-29)

Bug Fixes

  • cpp: incorrect prob convert if the str is already a known token (d296d09)
  • ios: do manual release due to disabled ARC in pod (6623e96)
  • ios: incorrect sent_token_probs_index (ac7c554)
  • send rest of content on stop (8a4b863)

Features

Breaking changes

  • sync llama.cpp
    • A new file format has been introduced: GGUF
    • Tokenizer fixes and API change
    • GGUFv2 adds support for 64-bit sizes + backwards compatible
    • Added support for Falcon models