v0.5.0 #72
benbrandt
announced in
Announcements
v0.5.0
#72
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
What's New
tokenizers
ortiktoken-rs
crates by applying binary search when attempting to find the next matching chunk size. @benbrandt and @bradfier in Use binary search in TextChunks #71Breaking Changes
tokenizers
is now0.15.0
tiktoken-rs
is now0.5.6
with_trim_chunks(true)
. It is a tradeoff, but would have made the binary search code much more complicated to keep the exact same behavior.chunk_size
method onChunkSizer
now needs to accept aChunkCapacity
argument, and return aChunkSize
struct instead of ausize
. This was to help support the new binary search method in chunking, and should only affect users who implemented customChunkSizer
s and weren't using one of the provided ones.fn chunk_size(&self, chunk: &str, capacity: &impl ChunkCapacity) -> ChunkSize;
Full Changelog: v0.4.5...v0.5.0
This discussion was created from the release v0.5.0.
Beta Was this translation helpful? Give feedback.
All reactions