Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix broken unicode #15

Open
rosnfeld opened this issue Jan 22, 2014 · 3 comments
Open

Fix broken unicode #15

rosnfeld opened this issue Jan 22, 2014 · 3 comments
Assignees

Comments

@rosnfeld
Copy link
Owner

Some of the Spanish descriptions have odd characters in them - is this a problem in the source data, the "downcasting" to utf-8 from utf-16, or something in the display?

Repro case: look at project number "SCR.550651.1.B2007".

@ghost ghost assigned rosnfeld Jan 22, 2014
@rosnfeld
Copy link
Owner Author

Hmm, looking at the raw unicode for the repro case I suspect it is the source data that is wrong. Emacs displays the same characters as Chrome does, when reading a freshly-unzipped CSV - so python doesn't even enter into it.

Also: I can see from the HTTP headers that django is sending UTF-8, and pasting various "unicode tests" into my html renders properly.

@rosnfeld
Copy link
Owner Author

Still, perhaps there is a way to "fix" the source data in these cases.

@rosnfeld
Copy link
Owner Author

I am changing the milestone on it and removing the "bug" label as I think it's now (yet another) project to fix CRS data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant