Unicode for Programers
What is Unicode
- Unicode: Character Set, Encoding, UTF-8, Codepoint
- Unicode: Codepoint
- Unicode: Character Name
- ASCII Characters
- Unicode: UTF-8 Encoding
- Unicode: UTF-16 Encoding
- Unicode: Surrogate Pair
- Unicode: BOM, Byte Order Mark
- What Encoding Do Chinese Websites Use?
- Best Unicode Fonts for Programer
- Unicode UTF8 History, by Rob Pike
Syntax, Semantics, Design
- Problems of Symbol Congestion in Computer Languages; ASCII Jam vs Unicode
- Unicode BOM Hack
- HTML Entities, Ampersand, Unicode, Semantics
- URL Percent Encoding and Unicode
- Syntax Design: Use of Unicode Matching Brackets as Specialized Delimiters
- Syntax Semantics Design: Use of Unicode Ellipsis Symbol vs Dot Dot Dot
- Unicode Semantics: the ∀ in Turn A Gundam
- Semantics of Symbols: Examples Unicode Symbols Usage
- Unicode Symbol for “e.g.” (exempli gratia)
- Unicode RIGHTWARDS BLACK ARROW and BLACK RIGHTWARDS ARROW Problem ⬅ ➡ ⮕
Unicode Support in Computer Languages
- Unicode Support in Programing Language Function Name and Operator
- Character Sets and Encoding in HTML
- HTML XML Entities
- Using Unicode in HTML Attributes
- Python: Unicode 🐍
- Ruby: Unicode Tutorial 💎
- Perl: Unicode Tutorial 🐪
- WolframLang: Source Code Encoding and Unicode
- Unicode in Java
- Linden Scripting Language (LSL) Unicode Support
- JS: Default Charset/Encoding
- JS: String is 16-Bit Unit Sequence
- JS: String.fromCodePoint
- JS: Unicode Character Escape Sequence
- JS: Allowed Characters in Identifier
Unicode Support in Tools
- Unicode Support in File Names: Windows, Mac, Emacs, Unison, Rsync, USB, Zip
- Unicode Character Equivalence Support in Web Browsers
- Unicode Font Comparison: Arial Unicode MS vs DejaVu Sans
Processing Unicode
- Python: Get Unicode Name, Codepoint
- Python: Convert File Encoding
- Converting Charset Encoding with Java
Unicode and Emacs
- Emacs and Unicode Tips
- Emacs Unicode Browser (xub-mode)
- Emacs: xah-math-input.el
- Emacs Keyboard Macro Example: Insert All Unicode Bullets
- Emacs: Remapping Keys Using key-translation-map
misc, unsorted
- Invisible Character from Twitter
- the Journey of a Foreign Character thru Internet
- Linux Hacker Propaganda on UTF-8 Encoding