Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add 'source_code_default' tokenizer with preset filters. #3655

Merged
merged 4 commits into from
Jul 18, 2023

Conversation

fmassot
Copy link
Contributor

@fmassot fmassot commented Jul 17, 2023

The source_code tokenizer can be used to define a customized tokenizer with filters.

By default, we offset a source_code_default tokenizer shipped with filters lowercase, remove_long, ascii_folding. We target to have an ascii_folding with preserve_original.

@fmassot fmassot requested a review from fulmicoton July 17, 2023 14:33
@fmassot fmassot changed the title Add 'source_code_default' tokenizer with a preset filters. Add 'source_code_default' tokenizer with preset filters. Jul 17, 2023
@fmassot fmassot force-pushed the fmassot/rename-source-code-tokenizer branch from 0278dbd to c904007 Compare July 17, 2023 16:58
@fmassot fmassot force-pushed the fmassot/rename-source-code-tokenizer branch from a1b1bdf to f0c1664 Compare July 17, 2023 17:13
// For commercial licensing, contact us at [email protected].
//
// AGPL:
// This program is free software: you can redistribute it and/or modify
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this file is here by mistake

Copy link
Contributor

@fulmicoton fulmicoton left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove the http_source maybe?

@fmassot fmassot requested a review from fulmicoton July 18, 2023 02:15
@fmassot fmassot merged commit 24010ff into main Jul 18, 2023
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants