-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[pt] Enable multi-token spell-checking #10052
Commits on Jan 5, 2024
-
[pt] Adapt resources to multiwords, dictionary fixes
p-goulart committedJan 5, 2024 Configuration menu - View commit details
-
Copy full SHA for 6df212c - Browse repository at this point
Copy the full SHA 6df212cView commit details -
Add smart titlecase method to StringTools
p-goulart committedJan 5, 2024 Configuration menu - View commit details
-
Copy full SHA for ba0e768 - Browse repository at this point
Copy the full SHA ba0e768View commit details -
[pt] Fix multiword prepositions
p-goulart committedJan 5, 2024 Configuration menu - View commit details
-
Copy full SHA for 916e7a6 - Browse repository at this point
Copy the full SHA 916e7a6View commit details -
- improve handling of percent signs (was: [50%OFF], will be: [50%, OFF]); - add some tests due to the latest dictionary version.
p-goulart committedJan 5, 2024 Configuration menu - View commit details
-
Copy full SHA for a8c4f4f - Browse repository at this point
Copy the full SHA a8c4f4fView commit details -
[pt] Add speller tests due to latest dictionary
p-goulart committedJan 5, 2024 Configuration menu - View commit details
-
Copy full SHA for a77bd8d - Browse repository at this point
Copy the full SHA a77bd8dView commit details -
Add titlecasing step to MultiWordChunker class
- multitoken suggestions were failing because we were only checking if they were present in the dictionary by upcasing their first letter; - this failed to account for titlecasing (either naively or a little more smartly), which is relatively frequent; - cf. stuff like "The Lord of the Rings".
p-goulart committedJan 5, 2024 Configuration menu - View commit details
-
Copy full SHA for 34d261b - Browse repository at this point
Copy the full SHA 34d261bView commit details -
p-goulart committed
Jan 5, 2024 Configuration menu - View commit details
-
Copy full SHA for d14d5b1 - Browse repository at this point
Copy the full SHA d14d5b1View commit details -
Improve titlecase logic in MultiWordChunker
p-goulart committedJan 5, 2024 Configuration menu - View commit details
-
Copy full SHA for 57a9ef0 - Browse repository at this point
Copy the full SHA 57a9ef0View commit details
Commits on Jan 8, 2024
-
Add titlecasing option to multi-word chunker
- only Portuguese has it *on*, all other locales have it set to false; - add a simple StringTools method to check if all words in a multi-token string are lowercase (and tests).
p-goulart committedJan 8, 2024 Configuration menu - View commit details
-
Copy full SHA for 1a0dbab - Browse repository at this point
Copy the full SHA 1a0dbabView commit details