GB18030-2000 is a new character set standard issued by China (PRC) on March 17, 2000, and it is a mandatory standard to require all software sold in China (PRC) to comply after January 1, 2001.
GB18030-2000 is a superset of the existing character set GB2312 and GBK and is backward-compatible with them. The current requirement is to support the additional 6582 Unicode Extension-A characters and 2501 minority characters.
For web applications to support GB18030 mainly means two aspects:
-
Insert GB18030 data into data source via browsers
-
Retrieve and display GB18030 data without loss
This article will focus on the support of GB18030 and it will cover the two aspects below above and browser's behavior as well.
Note: Without explicit explanation, “GB18030 data” in this article means the additional 6582 Unicode Extension-A characters and 2501 minority characters.
Leon Zhu
Globalization Certification Lab |