Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Filter profanity from public user input in backend too #665

Open
4 tasks
devmount opened this issue Sep 16, 2024 · 0 comments
Open
4 tasks

Filter profanity from public user input in backend too #665

devmount opened this issue Sep 16, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@devmount
Copy link
Collaborator

Is your feature request related to a problem? Please describe.
With #574 we have a working profanity filter for the frontend site. To prevent folks from submitting their own HTTP requests to our API via the firefox devconsole or network tools to bypass client side validation, we need to filter profanity on the backend site too.

Describe the solution you'd like
No solution, but more like questions to discuss:

  • 1. Do multilingual profanity lists exist anywhere in the Mozilla universe?
  • 2. Do we want to curate those lists ourselves or use a third party package?
  • 3. Is word matching enough? Or do we need/want some kind of sentiment analysis?
  • 4. Automatic profanity filtering is never accurate. How do we deal with false positives or true negatives? Do we need some kind of "Report"-button for users to tell us about inappropriate content?

Additional context
Maybe we could use something like this:
https://pypi.org/project/profanity-check/
It uses a Bag-of-words model to vectorize input strings before feeding them to a linear classifier.

@devmount devmount added the enhancement New feature or request label Sep 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant