Wiley --> wiley.com

The Web Testing Companion: The Insider's Guide to Efficient and Effective Tests

Lydia Ash

Language Guides: Chinese

IME Chinese (PRC)
Keyboard layout Chinese (Simplified) - MS PinYin98
Keystrokes ni [space] [enter] hao [space] [enter]
Input characters
Unicode positions [U+4F60] [U+597D]
Code page points - 936 [C4/E3] [BA/C3]
Display 你好
Pronunciation Ni hao - meaning Hello in Mandarin
Unicode ranges U+2E80 - U+2EFF CJK Radicals Supplement
U+2F00 - U+2FDF KangXi Radicals
U+2FF0 - U+2FFF Ideographic Description Characters
U+3000 - U+303F CJK Symbols and Punctuation
U+3100 - U+312F Bopomofo
U+3190 - U+319F Kanbun
U+31A0 - U+31BF Bopomofo Extended
U+3200 - U+32FF Enclosed CJK Letters and Months
U+3300 - U+33FF CJK Compatibility
U+3400 - U+4DB5 CJK Unified Ideographs Extension A
U+A000 - U+A48F Yi Syllables
U+A490 - U+A4CF Yi Radicals
U+4E00 - U+9FFF CJK Unified Ideographs
U+F900 - U+FAFF CJK Compatibility Ideographs
U+FE30 - U+FE4F CJK Compatibility Forms
Fonts Windows: Simplified - GB2312:  MS Hei, MS Song, NSimSun, SimHei, SimSun, SonTi Traditional - Big5: MingLiU, PMingLiU, MS MingliU
Macintosh: Simplified - Hei
Unix: Simplified - Arphic, Bitstream Cyberbit Traditional - Arphic


One of the things that makes Chinese difficult is that there are two separate alphabets, depending on if you are referring to Simplified Chinese (used in the People's Republic of China and Singapore) or Traditional Chinese (used in Taiwan and Hong Kong and sometimes just referred to as Chinese). Each of these has a different standard that applied to it prior to the development of Unicode.

Simplified Chinese (CHS) was standardized by the GB 2312-80, which was developed from the ISO 646 standard, and contains some Hanzi corrections. One of the problems that existed with the GB 2312 standard was that all copies of this code page were done by hand, so there was the possibility of slight alterations from one copy to the next, or of slight changes. Microsoft's Windows 936 code page is the more typical standard for this language. The government of China recently reworked the Chinese code pages into a more comprehensive one called GB 18030 (or more accurately GB 18030-2000). This code page is much larger than the typical double-byte character set since it goes up to 4 bytes, making for many great test cases if your application will be supporting this code page.

Chinese Traditional (CHT) was standardized by the code page known as Big Five, which is also the name of the encoding applied to that particular code page. The Windows 950 code page takes Big Five and adds in row 89 of the ETen extensions to support Taiwanese as well.

Chinese is the most widely spoken language in the world; however, because it is divided into dialects, it can appear as though it were smaller. Close to 1.5 billion people speak Chinese throughout the People's Republic of China, Taiwan, Hong Kong, Malaysia, Singapore, Vietnam, Brunei, Cambodia, Thailand, and the United States. It numbers almost twice as many speakers as English.

When testing Chinese, you have several accept languages to check on MS Internet Explorer. For Chinese Traditional, you can select Chinese (Taiwan) [zh-tw], Chinese (Hong Kong) [zh-hk], Chinese (Macau) [zh], or just Chinese [zh]. For Chinese Simplified, you will want to select Chinese (PRC) [zh-cn], or Singapore [zh-sg].
When testing with Netscape Navigator, you will need to use Chinese [zh] or Chinese/Taiwan [zh-tw] for Chinese Traditional, and Chinese/China [zh-cn] for Chinese Simplified.

Fonts and encoding will need to be set appropriately for what you are testing.

 



Cover

ISBN 0-4714-30218
578 Pages
May, 2003

Wiley Technology Publishing
Timely. Practical. Reliable.

 
[Book Home] [Links] [App. B] [App. G] [App. L] [Lang Guides] [Code Pgs] [Samples] [HTTP Responses] [Questions] [Templates] [System Guides] [Readings]