1.2.0: Bert Pipe
- Add building blocks for BERT tokenizer construction, including
AddWhitespaceAroundCJK
,AddWhitespaceAroundPunctuation
,MergeWhiteSpaceCharacters
,StripWhiteSpaceCharacters
,
StripAccentToken
,WhiteSpaceTokenizer
andSpanSubwords
. - Bert pipes are created in
uttut/pipeline/bert/
.