The XML Logo (from the 
      XML FAQ)

Chinese XML Now! 測試用檔案

1998-12-28

這些測試用檔案是供測試 XML 軟體工具的一些小的 XML 檔.每個檔案 只針對軟體的某項功能進行測試.這些檔案同時以 .xml.txt 檔名延伸 (extension) 提供. [說明]

假如您用 XML Web 瀏覽器讀取這些檔案,但沒有正確的顯示出檔案內容, 請勿擔憂.因為有些瀏覽器沒有定義 "內定的樣示" (default style), 因而所顯示的結果可能不正確或一團槽.而有些瀏覽器可能只接受有 樣示表 (stylesheet) 的 XML 檔.另一種情況是瀏覽器沒有包含中文字形 (Han Ideogram),這些字就會以方格或空白來顯示.這些都是合於規定 的狀況.

但是假如瀏覽器將標示 (markup) 解譯錯誤或無法接受中文的元素名稱 (element name),或無法將數字字元參引 (numeric character references) 視為 ISO 10646 的字集處理等問題都是軟體設計上的錯誤 (bug).假如您 使用的是 beta 版的軟體,不要感覺氣餒,廠商可能正在找出這些錯誤.

這些檔案由這個 Web 伺服器送出時,其 MIME 類別是設定如下:

[這個 Web 伺服器並沒有送出 MIME 字集資訊 (charset information).]

格式良好 (Well-formed) 的 XML 檔,不包含樣式表 (Stylesheet), 沒有 DOCTYPE 宣告,沒有名稱領域 (Namespace)

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[註:沒有提供針對 XML 的 PI 及註解 (Comment) 的測試]

格式良好 (Well-formed) XML, 有樣式表 (Stylesheet), 沒有 DOCTYPE 宣告, 沒有 名稱領域 (Namespace)

下列測試都會啟動CSS Style Sheet. 這個樣式表中使用了針對中文的元素名稱 (Element Type Name, GI) 進行樣式設定.這是測試軟體是否支援本地語言標示 (Native Language Markup) 的功能.註:這個 CSS 樣式表是一個一般文字檔;Big5,GB2312,UTF-8 等 所有的測試用檔都使用它.

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[註:沒有提供針對 XML 的 PI 及註解 (Comment) 的測試]

格式良好 (Well-formed) XML, 沒有樣式表 (Stylesheet), 有 DOCTYPE 宣告, 沒有名稱領域 (Namespace)

下列測試用檔都包含一個 DOCTYPE 宣告.在此宣告中, SYSTEM 識別符 (identifier) 中的標示宣告 使用了中文 (hanzi, kanji) 做為元素類別名稱 (Element Type Name, GI). 註:這個宣告中的實體 (entity) 只提供 UTF-8 編碼;Big5, GB2312 及 UTF-8 測試用檔都使用這個實體.

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[註:沒有提供針對 XML 的 PI 及註解 (Comment) 的測試]

格式良好 (Well-formed) XML, 沒有樣式表 (Stylesheet), 沒有 DOCTYPE 宣告, 有名稱領域 (Namespace)

在這些測試用檔 xmlns 屬性的名稱領域 (Namespace) 只是個名稱而已. 而沒有針對某個 schema 定義檔可被解譯 (resolve).

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[註:沒有提供針對 XML 的 PI 及註解 (Comment) 的測試]

相關測試用檔

Test UTF-8 Big5 GB2312
FujiXerox Japanese document xml . .
  A test of the html:lang and xml:lang attributes. Chinese and English.   HTML-in-XML . .
  A test of the xml:lang attributes. Chinese and English.   WF XML . .


TAR 檔分送 (Distribution)

可由下列檔案取得所有的測試用檔及其各個目錄: http://www.ascc.net/xml/zh-xml-test.tar.gz. (檔案大小約 70 K). 檔案是使用 UNIX 的 tar 及 GNU 的 gzip 格式存放,但是您也可以用 PC 環境的解壓縮程式 (例如:WinZIP) 來解開.註:有些解壓縮程式 會將檔名更改為 zh-xml-test_tar.gz;您可以再以手動 方式改回為.tar

假如您用到這個檔案,請注意其中的內容可能會經常的更改,所以請 連接到本網站,而避免在您的網頁中分送這個檔案.



最後...這兒是個 HTTP 標頭回應顯示服務 (header echo service) 您可以看到您的瀏覽器送出什麼資訊.這個服務是 Pascal's Header Echo

[英文頁]
英文頁
(UTF-8)
[中文頁]
中文頁
(UTF-8)
[中文頁]
繁體
中文頁
(Big5)
[中文頁]
簡體
中文頁
(GB 2312)

中央研究院

[英文頁][中文頁]

[版權聲明]