Our vision is to bring Natural Language Processing capabilities in Hebrew to a level on par with international industry standards, keeping up with state-of-the-art techniques by providing open source implementations to new algorithms and tools, and making these capabilities publicly available for both public and commercial use.
- Creating, maintaining, adapting and spreading resources that enable high-quality, production-ready, open-licensed Natural Language Processing in Hebrew.
- Enable, foster and catalyze cooperation between stakeholders in academia, private and the public sectors, in order to promote better Open Source Hebrew NLP solutions, and share existing knowledge and tools.
- Public Knowledge Workshop
- DataHack
- Your company/organization/faculty, we hope!
-
Forming a group of volunteers to start work on the core components, during developer meetings of the Public Knowledge Workshop and in other frameworks - including events like hackathons and as part of educational and research projects.
-
Adapting and integrating existing Hebrew NLP Python tools with existing popular frameworks - an NLTK plugin for Hebrew is one option - and creating those tools when they are missing, focusing on:
- Tokenization. Specifically stemming and lemmatization.
- A word embeddings model for Hebrew
- Part-of-speech tagger
-
Encouraging the open-licensing of high quality, open-licensed, tagged and labelled datasets from various domains (social media, articles, research papers, etc.) and for various tasks (part-of-speech tagging, text classification, sentiment analysis, named entity recognition, etc.), and helping in authoring these datasets where they are missing.
- Email us at [email protected]
- Join the discussion in our Facebook group.
- Come to our meetings. Email us for details.
- If you are associated with an organization that already has good, working solutions for some of the problems we are interested in, and you'd like to consider sharing those solutions (or a subset thereof) in a suitable open license, we'd love to hear from you!
- A great list of NLP tools for Hebrew can be found here: https://github.com/iddoberger/awesome-hebrew-nlp
- We are trying to create a more general list of resources for NLP in Hebrew here: https://github.com/NLPH/NLPH/blob/master/Hebrew_NLP_Resources.rst