Chinese Encoding, Introduction

By Xah Lee. Date: .

Here's a summary of major Chinese encoding.

GB2312
year . Simplified Characters only. Was used in China. ASCII compatible.
GBK
year . Extended GB2312. Includes traditional chars.
GB18030
year . Extended both GB2312 and GBK. Charset equivalent to Unicode. . Contains both simplified and traditional characters. [see ็ฎ€ไฝ“็น้ซ”ๅญ—่กจ List of Simplified/Traditional Chinese Characters] ASCII compatible. Used in China.
BIG5
year . Traditional chars only. Invented in Taiwan. Was used in Taiwan and Hong Kong, before Unicode became popular.

Note, as of 2022, vast majority of Chinese websites use UTF-8, including in China, Taiwan, Hong Kong. [see Chinese Websites Encoding survey, Year 2022]

Unicode and Encoding Explained