Unicode: Basic Multilingual Plane
What is Basic Multilingual Plane (BMP)
Basic Multilingual Plane (BMP) is a range of characters from U+0000 to U+FFFF (decimal 65535)
Characters in this range are designed for the most basic character set of human writing systems.
What characters are outsite BMP?
Most emoji (e.g. ๐ ๐ ๐ ๐ฅฒ ๐ ๐), are outside of BMP. (because emoji came 20 years later in Unicode history.)
Some rarely used chinese characters, are outside of BMP.
Dead language's characters, are often outside of BMP. e.g. Egyptian Hieroglyph ๐, Unicode: Cuneiform ๐ท, Unicode: Linear B ๐, Unicode: Phoenician ๐ค
Constructed languages, are often outside of BMP. e.g. Unicode: Deseret Alphabet ๐, Unicode: Shavian Alphabet ๐
How to determine if a character is in BMP?
If the character's Code Point can be expresed as 4 hexadecimal digits, it is part of BMP. Else, no.
If the character's Code Point in decimal is less than or equal to 65535, it is in BMP. Else, no.
Unicode and Encoding Explained
- Unicode: Character Set, Encoding, UTF-8, Code Point
- Unicode: Code Point (Char ID)
- Unicode: Character Name
- ASCII Characters
- Unicode: Basic Multilingual Plane
- Unicode: UTF-8 Encoding
- Unicode: UTF-16 Encoding
- Unicode: Surrogate Pair
- Unicode: Byte Order (Endianness)
- Unicode: BOM, Byte Order Mark
- Set Text Editor File Encoding
- Unicode: Letter Character
- Unicode: Variation Selector