⭐ Star it
⤴️ if you love it!
Explains Unicode and character encoding to software engineers, and the pitfalls of working with international characters in Java, JavaScript and Python.
- Overview Goes over concepts of international character identification and Unicode.
- Unicode Support in Programming Languages
Goes over how to handle international text in many programming languages, using powerful regular expressions, converting case, and extracting numeric data from text.
- Java
- Code Examples
- Encoding
Goes over how to encode international text when reading and writing files, and what programmers need to be careful about in order not to get garbled data.
- Encoding Concepts
- Encoding Details
- Java
- JavaScript
- Python
- Code Examples
- Regular Expressions
Goes over how to craft regular expressions to match Unicode characaters in different languages.
- Regular Expression Concepts
- Code Examples
For additional code, please see utf8db2.
Video presentation of the What a Character content.
"So I have an announcement to make: if you are a programmer working in 2003 and you don't know the basics of characters, character sets, encodings, and Unicode, and I catch you, I'm going to punish you by making you peel onions for 6 months in a submarine. I swear I will."