Chinese Encoding, Introduction

By Xah Lee. Date: .

Here's a summary of major Chinese encoding.

GB2312
year . Simplified Characters only. Was used in China. ASCII compatible.
GBK
year . Extended GB2312. Includes traditional chars.
GB18030
year . Extended both GB2312 and GBK. Charset equivalent to Unicode. . Contains both simplified and traditional characters. ใ€”see ็ฎ€ไฝ“็น้ซ”ๅญ—่กจ List of Simplified/Traditional Chinese Charactersใ€• ASCII compatible. Used in China.
BIG5
year . Traditional chars only. Invented in Taiwan. Was used in Taiwan and Hong Kong, before Unicode became popular.

Note, as of 2022, vast majority of Chinese websites use UTF-8, including in China, Taiwan, Hong Kong. ใ€”see Chinese Websites Encoding survey, Year 2022ใ€•

Unicode and Encoding Explained