Persian language support #773
Replies: 1 comment 1 reply
-
Hello @Ja7ad, First, we need to define the Language specificities we want to support; I don't know anything about Persian, so we'll have to help each other. There are 2 main steps during the tokenizing process that we could specialize for Persian:
Then, when we define the changes to make, you'll be able to create a PR on Charabia; the repository is made easy to contribute to, even for people unfamiliar with Rust. The CONTRIBUTING.md file is an excellent starting point for implementing a normalizer or a segmenter. Thank you for creating this discussion. 🪴 See you! |
Beta Was this translation helpful? Give feedback.
-
Hello,
In the new version (1.10), index settings support the
localizedAttributes
feature based on this issue.I checked Notion, and I couldn't find support for the Persian language, only Arabic is supported.
Language settings in v1.10
How can I add the Persian language?
Beta Was this translation helpful? Give feedback.
All reactions