Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Encoding issue #17

Open
mquillot opened this issue Mar 24, 2017 · 1 comment
Open

Encoding issue #17

mquillot opened this issue Mar 24, 2017 · 1 comment

Comments

@mquillot
Copy link

Hi !

Maybe you can add an option when lauching the script.
unicode_error= '....'

You can define this variable when loading the model thanks gensim. To avoid editing the script, maybe you can offer the possibility to the user to add this option.

Line of code :

model = models.Word2Vec.load_word2vec_format(inputfile, binary=$binary, unicode_errors=$error) 
(with true variables) 

without it, i've some errors with my model.

Thanks a lot for you work.
Bye ;)

@lechatpito
Copy link
Contributor

Good idea. Feel free to submit a PR if I take too long to do it 😉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants