The Web Testing Companion: The Insider's Guide to Efficient and Effective Tests
Lydia Ash
Language Guides: Chinese
| IME |
Chinese (PRC) |
| Keyboard layout |
Chinese (Simplified) - MS PinYin98
|
| Keystrokes |
ni [space] [enter] |
hao [space] [enter] |
| Input characters |
你 |
好 |
| Unicode positions |
[U+4F60] |
[U+597D] |
| Code page points - 936 |
[C4/E3] |
[BA/C3] |
| Display |
你好 |
| Pronunciation |
Ni hao - meaning Hello in Mandarin
|
| Unicode ranges |
U+2E80 - U+2EFF CJK Radicals Supplement
U+2F00 - U+2FDF KangXi Radicals
U+2FF0 - U+2FFF Ideographic Description Characters
U+3000 - U+303F CJK Symbols and Punctuation
U+3100 - U+312F Bopomofo
U+3190 - U+319F Kanbun
U+31A0 - U+31BF Bopomofo Extended
U+3200 - U+32FF Enclosed CJK Letters and Months
U+3300 - U+33FF CJK Compatibility
U+3400 - U+4DB5 CJK Unified Ideographs Extension A
U+A000 - U+A48F Yi Syllables
U+A490 - U+A4CF Yi Radicals
U+4E00 - U+9FFF CJK Unified Ideographs
U+F900 - U+FAFF CJK Compatibility Ideographs
U+FE30 - U+FE4F CJK Compatibility Forms |
| Fonts |
Windows: Simplified - GB2312:
MS Hei, MS Song, NSimSun, SimHei, SimSun, SonTi Traditional - Big5:
MingLiU, PMingLiU, MS MingliU
Macintosh: Simplified - Hei
Unix: Simplified - Arphic, Bitstream Cyberbit Traditional
- Arphic |
One of the things that makes Chinese difficult is that there are two separate
alphabets, depending on if you are referring to Simplified Chinese (used
in the People's Republic of China and Singapore) or Traditional Chinese
(used in Taiwan and Hong Kong and sometimes just referred to as Chinese).
Each of these has a different standard that applied to it prior to the
development of Unicode.
Simplified Chinese (CHS) was standardized by the GB 2312-80, which was
developed from the ISO 646 standard, and contains some Hanzi corrections.
One of the problems that existed with the GB 2312 standard was that all
copies of this code page were done by hand, so there was the possibility
of slight alterations from one copy to the next, or of slight changes.
Microsoft's Windows 936 code page is the more typical standard for this
language. The government of China recently reworked the Chinese code pages
into a more comprehensive one called GB 18030 (or more accurately GB 18030-2000).
This code page is much larger than the typical double-byte character set
since it goes up to 4 bytes, making for many great test cases if your
application will be supporting this code page.
Chinese Traditional (CHT) was standardized by the code page known as
Big Five, which is also the name of the encoding applied to that particular
code page. The Windows 950 code page takes Big Five and adds in row 89
of the ETen extensions to support Taiwanese as well.
Chinese is the most widely spoken language in the world; however, because
it is divided into dialects, it can appear as though it were smaller.
Close to 1.5 billion people speak Chinese throughout the People's Republic
of China, Taiwan, Hong Kong, Malaysia, Singapore, Vietnam, Brunei, Cambodia,
Thailand, and the United States. It numbers almost twice as many speakers
as English.
When testing Chinese, you have several accept languages to check on MS
Internet Explorer. For Chinese Traditional, you can select Chinese (Taiwan)
[zh-tw], Chinese (Hong Kong) [zh-hk], Chinese (Macau) [zh], or just Chinese
[zh]. For Chinese Simplified, you will want to select Chinese (PRC) [zh-cn],
or Singapore [zh-sg].
When testing with Netscape Navigator, you will need to use Chinese [zh]
or Chinese/Taiwan [zh-tw] for Chinese Traditional, and Chinese/China [zh-cn]
for Chinese Simplified.
Fonts and encoding will need to be set appropriately for what you are
testing.
|
|
|