Chinese Encoding, Introduction
Here's a summary of major Chinese encoding.
- GB2312
- year . Simplified Characters only. Was used in China. ASCII compatible.
- GBK
- year . Extended GB2312. Includes traditional chars.
- GB18030
- year . Extended both GB2312 and GBK. Charset equivalent to Unicode. . Contains both simplified and traditional characters. ใsee ็ฎไฝ็น้ซๅญ่กจ List of Simplified/Traditional Chinese Charactersใ ASCII compatible. Used in China.
- BIG5
- year . Traditional chars only. Invented in Taiwan. Was used in Taiwan and Hong Kong, before Unicode became popular.
Note, as of 2022, vast majority of Chinese websites use UTF-8, including in China, Taiwan, Hong Kong. ใsee Chinese Websites Encoding survey, Year 2022ใ
Unicode and Encoding Explained
- Unicode: Character Set, Encoding, UTF-8, Codepoint
- Unicode: Codepoint
- Unicode: Character Name
- ASCII Characters
- Unicode: UTF-8 Encoding
- Unicode: UTF-16 Encoding
- Unicode: Surrogate Pair
- Unicode: Byte Order (Endianness)
- Unicode: BOM, Byte Order Mark
- Set Text Editor File Encoding
- Unicode Letter Character
- Unicode: Variation Selector