Chinese Encoding, Introduction

By Xah Lee. Date: . Last updated: .

Here's a summary of major Chinese encoding.

GB2312

year . Simplified Characters only. Was used in China. ASCII compatible.

GBK

year . Extended GB2312. Includes traditional chars.

GB18030

year . Extended both GB2312 and GBK. Charset equivalent to Unicode.

Contains both simplified and traditional characters. [see ็ฎ€ไฝ“็น้ซ”ๅญ—่กจ List of Simplified/Traditional Chinese Characters] ASCII compatible. Used in China.

BIG5

year . Traditional chars only. Invented in Taiwan. Was used in Taiwan and Hong Kong, before Unicode became popular.

Note, as of 2022, vast majority of Chinese websites use UTF-8, including in China, Taiwan, Hong Kong. [see Chinese Websites Encoding survey, Year 2022]

Chinese Characters in Computer Languages