Unicode: Code Point (Char ID)
What is Code Point
Code Point is a integer number that is a unique ID for a character in unicode.
Code point starts at 0.
Code Point is represented either in decimal or Hexadecimal.
| char | name | code point | code point in Hexadecimal |
|---|---|---|---|
| a | LATIN SMALL LETTER A | 97 | 61 |
| ฮฑ | GREEK SMALL LETTER ALPHA | 945 | 3B1 |
| ๐ฆ | BUTTERFLY | 129419 | 1F98B |
| LINE FEED (non-printable) | LINE FEED | 10 | A |
Code Point may be non-printable
A Code Point may be non-printable. For example, space, non-breaking space, word joiner, ASCII control character (e.g. Null, Form Feed, Horizontal Tab, Bell), right to left mark, surrogate pair, etc.
Standard Notation for Code Point
The standard notation for code point is โU+โ followed by its code point in Hexadecimal. e.g.
U+3B1
How to Find a Character's Code Point
How to Find a Character, Given Its Code Point
Unicode and Encoding Explained
- Unicode for Coders. Index
- Unicode: Character Set, Encoding, UTF-8, Code Point
- Unicode: Code Point (Char ID)
- Unicode: Character Name
- ASCII Characters
- Unicode: Basic Multilingual Plane
- Unicode: UTF-8 Encoding
- Unicode: UTF-16 Encoding
- Unicode: Surrogate Pair
- Unicode: Byte Order (Endianness)
- Unicode: BOM, Byte Order Mark
- Set Text Editor File Encoding
- Unicode: Letter Character
- Unicode: Variation Selector