The XML Logo (from the 
      XML FAQ)

Chinese XML Now! ´ú¸Õ¥ÎÀÉ®×

1999-01-24

³o¨Ç´ú¸Õ¥ÎÀɮ׬O¨Ñ´ú¸Õ XML ³nÅé¤u¨ãªº¤@¨Ç¤pªº XML ÀÉ¡C¨C­ÓÀÉ®× ¥u°w¹ï³nÅ骺¬Y¶µ¥\¯à¶i¦æ´ú¸Õ¡C³o¨ÇÀɮצP®É¥H .xml ©Î .txt ÀɦW©µ¦ù (extension) ´£¨Ñ¡C [»¡©ú]

§Ú­Ì¤]´£¨Ñ¤@­Ó±N©Ò¦³ªº´ú¸ÕÀɦX¨Ö¦b¤@°_ªº ¦X¨Ö´ú¸ÕÀÉ¡C³o¼Ë¤è«K±z¥i¥H«Ü§ÖªºÀ˵ø³o¨ÇÀɮסC §Ú­Ì¤]´£¨Ñtar ÀɤÀ°e

°²¦p±z¥Î XML Web ÂsÄý¾¹Åª¨ú³o¨ÇÀɮסA¦ý¨S¦³¥¿½TªºÅã¥Ü¥XÀɮפº®e¡A ½Ð¤Å¾á¼~¡C¦]¬°¦³¨ÇÂsÄý¾¹¨S¦³©w¸q "¤º©wªº¼Ë¥Ü" (default style)¡A ¦]¦Ó©ÒÅã¥Üªºµ²ªG¥i¯à¤£¥¿½T©Î¤@¹Î¼Ñ¡C¦Ó¦³¨ÇÂsÄý¾¹¥i¯à¥u±µ¨ü¦³ ¼Ë¥Üªí (stylesheet) ªº XML ÀÉ¡C¥t¤@ºØ±¡ªp¬OÂsÄý¾¹¨S¦³¥]§t¤¤¤å¦r§Î (Han Ideogram)¡A³o¨Ç¦r´N·|¥H¤è®æ©ÎªÅ¥Õ¨ÓÅã¥Ü¡C³o¨Ç³£¬O¦X©ó³W©w ªºª¬ªp¡C

¦ý¬O°²¦pÂsÄý¾¹±N¼Ð¥Ü (markup) ¸ÑĶ¿ù»~©ÎµLªk±µ¨ü¤¤¤åªº¤¸¯À¦WºÙ (element name)¡A©ÎµLªk±N¼Æ¦r¦r¤¸°Ñ¤Þ (numeric character references) µø¬° ISO 10646 ªº¦r¶°³B²zµ¥°ÝÃD³£¬O³nÅé³]­p¤Wªº¿ù»~ (bug)¡C°²¦p±z ¨Ï¥Îªº¬O beta ª©ªº³nÅé¡A¤£­n·Pı®ð¾k¡A¼t°Ó¥i¯à¥¿¦b§ä¥X³o¨Ç¿ù»~¡C

³o¨ÇÀɮץѳo­Ó Web ¦øªA¾¹°e¥X®É¡A¨ä MIME Ãþ§O¬O³]©w¦p¤U¡G

[³o­Ó Web ¦øªA¾¹¨Ã¨S¦³°e¥X MIME ¦r¶°¸ê°T (charset information)¡C]

®æ¦¡¨}¦n (Well-formed) ªº XML ÀÉ¡A¤£¥]§t¼Ë¦¡ªí (Stylesheet)¡A ¨S¦³ DOCTYPE «Å§i¡A¨S¦³¦WºÙ»â°ì (Namespace)

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[µù¡G¨S¦³´£¨Ñ°w¹ï XML ªº PI ¤Îµù¸Ñ (Comment) ªº´ú¸Õ]

®æ¦¡¨}¦n (Well-formed) XML, ¦³¼Ë¦¡ªí (Stylesheet), ¨S¦³ DOCTYPE «Å§i, ¨S¦³ ¦WºÙ»â°ì (Namespace)

¤U¦C´ú¸Õ³£·|±Ò°ÊCSS Style Sheet¡C ³o­Ó¼Ë¦¡ªí¤¤¨Ï¥Î¤F°w¹ï¤¤¤åªº¤¸¯À¦WºÙ (Element Type Name, GI) ¶i¦æ¼Ë¦¡³]©w¡C³o¬O´ú¸Õ³nÅé¬O§_¤ä´©¥»¦a»y¨¥¼Ð¥Ü (Native Language Markup) ªº¥\¯à¡Cµù¡G³o­Ó CSS ¼Ë¦¡ªí¬O¤@­Ó¤@¯ë¤å¦rÀÉ¡FBig5¡AGB2312¡AUTF-8 µ¥ ©Ò¦³ªº´ú¸Õ¥ÎÀɳ£¨Ï¥Î¥¦¡C

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[µù¡G¨S¦³´£¨Ñ°w¹ï XML ªº PI ¤Îµù¸Ñ (Comment) ªº´ú¸Õ]

®æ¦¡¨}¦n (Well-formed) XML, ¨S¦³¼Ë¦¡ªí (Stylesheet), ¦³ DOCTYPE «Å§i, ¨S¦³¦WºÙ»â°ì (Namespace)

¤U¦C´ú¸Õ¥ÎÀɳ£¥]§t¤@­Ó DOCTYPE «Å§i¡C¦b¦¹«Å§i¤¤¡A SYSTEM ÃѧO²Å (identifier) ¤¤ªº¼Ð¥Ü«Å§i ¨Ï¥Î¤F¤¤¤å (hanzi, kanji) °µ¬°¤¸¯ÀÃþ§O¦WºÙ (Element Type Name, GI)¡C µù¡G³o­Ó«Å§i¤¤ªº¹êÅé (entity) ¥u´£¨Ñ UTF-8 ½s½X¡FBig5, GB2312 ¤Î UTF-8 ´ú¸Õ¥ÎÀɳ£¨Ï¥Î³o­Ó¹êÅé¡C

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[µù¡G¨S¦³´£¨Ñ°w¹ï XML ªº PI ¤Îµù¸Ñ (Comment) ªº´ú¸Õ]

®æ¦¡¨}¦n (Well-formed) XML, ¨S¦³¼Ë¦¡ªí (Stylesheet), ¨S¦³ DOCTYPE «Å§i, ¦³¦WºÙ»â°ì (Namespace)

¦b³o¨Ç´ú¸Õ¥ÎÀÉ xmlns Äݩʪº¦WºÙ»â°ì (Namespace) ¥u¬O­Ó¦WºÙ¦Ó¤w¡C ¦Ó¨S¦³°w¹ï¬Y­Ó schema ©w¸qÀÉ¥i³Q¸ÑĶ (resolve)¡C

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[µù¡G¨S¦³´£¨Ñ°w¹ï XML ªº PI ¤Îµù¸Ñ (Comment) ªº´ú¸Õ]

¦X¨Ö´ú¸ÕÀÉ

®æ¦¡¨}¦n (Well-formed) XML, ¦³¼Ë¦¡ªí (Stylesheet), ¦³ DOCTYPE «Å§i, ¨S¦³¦WºÙ»â°ì (Namespace), xml:lang attributes, standalone.

³o­Ó´ú¸Õ¥]§t«e­±ªº¦UºØª¬ªp¡D³o­Ó´ú¸Õ°w¹ï¤¤¤å¤å¥ó¤¤ªº¦h¶µ­«­nªº °ò¥»»Ý¨D¥[¥H´ú¸Õ.

Test
UTF-8
Big5
GB2312
13) This file has all the test cases of test files 1 to 12.
UTF-8
Big5
GB2312

¬ÛÃö´ú¸Õ¥ÎÀÉ

Test UTF-8 Big5 GB2312
Charles Muller's Dictionary of East Asian Buddhist Terms is a multilingual resource. Excellent for demonstrating. XML instances, DTD, XSL stylesheet. . .
FujiXerox Japanese document xml . .
  A test of the html:lang and xml:lang attributes. Chinese and English.   HTML-in-XML . .
  A test of the xml:lang attributes. Chinese and English.   WF XML . .
  Chinese and English, with xml:lang attribute.   English - Chinese XML Glossary (en, zh)) English - Chinese XML Glossary (zh, en ) English - Chinese XML Glossary (zh, en)

¥t¥~ÁÙ¦³¤@¨Ç°w¹ï¤£¦P»y¨¥¤Î½s½Xªº HTML ´ú¸ÕÀÉ, ¨ä¤¤¤]¥]§t¦b MIME ¼ÐÀY ¦³ "charset" ªº¼Ð¥Ü, ±z¥i¥H¦b Vancouver ºô¯¸ªº Using Multiple Languages in HTML ºô­¶¤¤¨ú±o.



TAR ÀɤÀ°e (Distribution)

¥i¥Ñ¤U¦CÀɮרú±o©Ò¦³ªº´ú¸Õ¥ÎÀɤΨä¦U­Ó¥Ø¿ý¡G http://www.ascc.net/xml/zh-xml-test.tar.gz. (Àɮפj¤p¬ù 70 K)¡C Àɮ׬O¨Ï¥Î UNIX ªº tar ¤Î GNU ªº gzip ®æ¦¡¦s©ñ¡A¦ý¬O±z¤]¥i¥H¥Î PC Àô¹Òªº¸ÑÀ£ÁYµ{¦¡ (¨Ò¦p¡GWinZIP) ¨Ó¸Ñ¶}¡Cµù¡G¦³¨Ç¸ÑÀ£ÁYµ{¦¡ ·|±NÀɦW§ó§ï¬° zh-xml-test_tar.gz¡F±z¥i¥H¦A¥H¤â°Ê ¤è¦¡§ï¦^¬°.tar¡C

°²¦p±z¥Î¨ì³o­ÓÀɮסA½Ðª`·N¨ä¤¤ªº¤º®e¥i¯à·|¸g±`ªº§ó§ï¡A©Ò¥H½Ð ³s±µ¨ì¥»ºô¯¸¡A¦ÓÁ×§K¦b±zªººô­¶¤¤¤À°e³o­ÓÀɮסC



³Ì«á...³o¨à¬O­Ó HTTP ¼ÐÀY¦^À³Åã¥ÜªA°È (header echo service) ±z¥i¥H¬Ý¨ì±zªºÂsÄý¾¹°e¥X¤°»ò¸ê°T¡C³o­ÓªA°È¬O Pascal's Header Echo¡C

[­^¤å­¶]
­^¤å­¶
(UTF-8)
[¤¤¤å­¶]
¤¤¤å­¶
(UTF-8)
[¤¤¤å­¶]
ÁcÅé
¤¤¤å­¶
(Big5)
[¤¤¤å­¶]
²Åé
¤¤¤å­¶
(GB 2312)

¤¤¥¡¬ã¨s°|

[­^¤å­¶][¤¤¤å­¶]

[ª©ÅvÁn©ú]