You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Added Lingo::Attendee::LsiFilter to correlate semantically related terms
(LSI) over the
"corpus" of all files processed during a single program invocation; requires lsi4r which in turn requires rb-gsl. [EXPERIMENTAL: Interface may
be changed or removed in next release.]
Added Lingo::Attendee::HalFilter to correlate semantically related terms
(HAL) over
individual documents; requires hal4r
which in turn requires rb-gsl.
[EXPERIMENTAL: Interface may be changed or removed in next release.]
Added Lingo::Attendee::AnalysisFilter and associated lingoctl tooling.
Multiword dictionaries can now identify hyphenated variants (e.g. automatic data-processing); set hyphenate: true in the
dictionary config.
Lingo::Attendee::Tokenizer no longer considers hyphens at word edges as part
of the word. As a consequence, Lingo::Attendee::Dehyphenizer has been
dropped.
Dropped Lingo::Attendee::NonewordFilter; use Lingo::Attendee::VectorFilter
with option lexicals: '\?' instead.
Lingo::Attendee::TextReader and Lingo::Attendee::TextWriter learned encoding option to read/write text that is not UTF-8 encoded;
configuration files and dictionaries still need to be UTF-8, though.
Lingo::Attendee::TextReader and Lingo::Attendee::TextWriter learned to
read/write Gzip-compressed files (file extension .gz or .gzip).
Lingo::Attendee::Sequencer learned to recognize 0 in the pattern to match
number tokens.
Fixed Lingo::Attendee::TextReader to recognize BOM in input files; does not
apply to input read from STDIN.
Fixed regression introduced in 1.8.6 where Lingo::Attendee::Debugger would
no longer work immediately behind Lingo::Attendee::TextReader.
Fixed lingoctl copy commands when overwriting existing files.
Refactored Lingo::Database::Crypter into a module.