The XML Logo (from the 
      XML FAQ)

Chinese XML Now! 測試用檔案

1999-01-24

這些測試用檔案是供測試 XML 軟體工具的一些小的 XML 檔。每個檔案 只針對軟體的某項功能進行測試。這些檔案同時以 .xml .txt 檔名延伸 (extension) 提供。 [說明]

我們也提供一個將所有的測試檔合併在一起的 合併測試檔。這樣方便您可以很快的檢視這些檔案。 我們也提供tar 檔分送

假如您用 XML Web 瀏覽器讀取這些檔案,但沒有正確的顯示出檔案內容, 請勿擔憂。因為有些瀏覽器沒有定義 "內定的樣示" (default style), 因而所顯示的結果可能不正確或一團槽。而有些瀏覽器可能只接受有 樣示表 (stylesheet) 的 XML 檔。另一種情況是瀏覽器沒有包含中文字形 (Han Ideogram),這些字就會以方格或空白來顯示。這些都是合於規定 的狀況。

但是假如瀏覽器將標示 (markup) 解譯錯誤或無法接受中文的元素名稱 (element name),或無法將數字字元參引 (numeric character references) 視為 ISO 10646 的字集處理等問題都是軟體設計上的錯誤 (bug)。假如您 使用的是 beta 版的軟體,不要感覺氣餒,廠商可能正在找出這些錯誤。

這些檔案由這個 Web 伺服器送出時,其 MIME 類別是設定如下:

[這個 Web 伺服器並沒有送出 MIME 字集資訊 (charset information)。]

格式良好 (Well-formed) 的 XML 檔,不包含樣式表 (Stylesheet), 沒有 DOCTYPE 宣告,沒有名稱領域 (Namespace)

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[註:沒有提供針對 XML 的 PI 及註解 (Comment) 的測試]

格式良好 (Well-formed) XML, 有樣式表 (Stylesheet), 沒有 DOCTYPE 宣告, 沒有 名稱領域 (Namespace)

下列測試都會啟動CSS Style Sheet。 這個樣式表中使用了針對中文的元素名稱 (Element Type Name, GI) 進行樣式設定。這是測試軟體是否支援本地語言標示 (Native Language Markup) 的功能。註:這個 CSS 樣式表是一個一般文字檔;Big5,GB2312,UTF-8 等 所有的測試用檔都使用它。

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[註:沒有提供針對 XML 的 PI 及註解 (Comment) 的測試]

格式良好 (Well-formed) XML, 沒有樣式表 (Stylesheet), 有 DOCTYPE 宣告, 沒有名稱領域 (Namespace)

下列測試用檔都包含一個 DOCTYPE 宣告。在此宣告中, SYSTEM 識別符 (identifier) 中的標示宣告 使用了中文 (hanzi, kanji) 做為元素類別名稱 (Element Type Name, GI)。 註:這個宣告中的實體 (entity) 只提供 UTF-8 編碼;Big5, GB2312 及 UTF-8 測試用檔都使用這個實體。

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[註:沒有提供針對 XML 的 PI 及註解 (Comment) 的測試]

格式良好 (Well-formed) XML, 沒有樣式表 (Stylesheet), 沒有 DOCTYPE 宣告, 有名稱領域 (Namespace)

在這些測試用檔 xmlns 屬性的名稱領域 (Namespace) 只是個名稱而已。 而沒有針對某個 schema 定義檔可被解譯 (resolve)。

Test
UTF-8
Big5
GB2312
0) ASCII-codes-only WF file with encoding name in all uppercase or lowercase.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
1) ASCII-codes-only WF file with encoding name using recommended form.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
2) ASCII-codes-only WF file with decimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
3) ASCII-codes-only WF file with hexadecimal NCR in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
4) ASCII-codes-only WF file with decimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
5) ASCII-codes-only WF file with hexadecimal NCR in attribute value.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
6) ASCII-codes-only WF file with decimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
7) ASCII-codes-only WF file with hexadecimal NCR in CDATA section.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
8) The file includes one ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
9) The file includes a more troublesome ideographic character encoded directly in data.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
10) The file includes one ideographic character encoded directly in an attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
11) The file includes one ideographic character encoded directly in an element type name.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
12) The file includes one ideographic character encoded directly in an ID attribute and later in an IDREF attribute.
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain
application/xml
text/xml
text/plain

[註:沒有提供針對 XML 的 PI 及註解 (Comment) 的測試]

合併測試檔

格式良好 (Well-formed) XML, 有樣式表 (Stylesheet), 有 DOCTYPE 宣告, 沒有名稱領域 (Namespace), xml:lang attributes, standalone.

這個測試包含前面的各種狀況.這個測試針對中文文件中的多項重要的 基本需求加以測試.

Test
UTF-8
Big5
GB2312
13) This file has all the test cases of test files 1 to 12.
UTF-8
Big5
GB2312

相關測試用檔

Test UTF-8 Big5 GB2312
Charles Muller's Dictionary of East Asian Buddhist Terms is a multilingual resource. Excellent for demonstrating. XML instances, DTD, XSL stylesheet. . .
FujiXerox Japanese document xml . .
A test of the html:lang and xml:lang attributes. Chinese and English. HTML-in-XML . .
A test of the xml:lang attributes. Chinese and English. WF XML . .
Chinese and English, with xml:lang attribute. English - Chinese XML Glossary (en, zh)) English - Chinese XML Glossary (zh, en ) English - Chinese XML Glossary (zh, en)

另外還有一些針對不同語言及編碼的 HTML 測試檔, 其中也包含在 MIME 標頭 有 "charset" 的標示, 您可以在 Vancouver 網站的 Using Multiple Languages in HTML 網頁中取得.



TAR 檔分送 (Distribution)

可由下列檔案取得所有的測試用檔及其各個目錄: http://www.ascc.net/xml/zh-xml-test.tar.gz. (檔案大小約 70 K)。 檔案是使用 UNIX 的 tar 及 GNU 的 gzip 格式存放,但是您也可以用 PC 環境的解壓縮程式 (例如:WinZIP) 來解開。註:有些解壓縮程式 會將檔名更改為 zh-xml-test_tar.gz;您可以再以手動 方式改回為.tar

假如您用到這個檔案,請注意其中的內容可能會經常的更改,所以請 連接到本網站,而避免在您的網頁中分送這個檔案。



最後...這兒是個 HTTP 標頭回應顯示服務 (header echo service) 您可以看到您的瀏覽器送出什麼資訊。這個服務是 Pascal's Header Echo

[英文頁]
英文頁
(UTF-8)
[中文頁]
中文頁
(UTF-8)
[中文頁]
繁體
中文頁
(Big5)
[中文頁]
簡體
中文頁
(GB 2312)

中央研究院

[英文頁][中文頁]

[版權聲明]