The newly published W3C
Resource Description Framework
is not specified in a convenient way.
Here are ISO standard XML markup declarations.
There is also a version in text/plain
(1999-02-25)
The new XML
version of the Question
and Answer Markup Language DTD for FAQs.
(QAML 2.1 has been written in conjunction with people from FAQ.org)
This version supports language, stylesheets, ratings and
linking better.
There is also a version in text/plain.
A QAML FAQ is now available, in XML.
If you want to use Big5 XML documents with SGML
systems,
you will need a special SGML declaration. SGML was not designed
to cope with multiple-byte character encodings, so the SGML
declaration
has to be hacked to make it work. (1999-02-05)
[XML] [SGML
loose] [SGML tight]
This note defines
elements you can use to incorporate line-delimited records
and
fixed-column data into XML documents. The
DTD is also available.
There is also a version in text/plain
This DTD can be used to distribute TEI Lite documents
over the World Wide Web: it is the TEI Lite DTD, loosened according to the
requirements of XML. TEI is the Text
Encoding Initiative; you can make TEI DTDs to mark up all sorts of literature
and reference material. Please note that this DTD may change over the next few months,
in slight ways. (1999-02-10)
(TEI) Lite and Loose is also available using DDML, see next
entry.
There is also a version in text/plain
The basic ISO
and SPREAD-2 public entity sets
for special characters.
Dave Ragget's Tidy (HTML->XHTML+CSS converter) with extensions Chinese Extensions for Big5 and UTF-8 (Also, ShiftJIS) (1999-05-17)
html2qaml.xom is
an OmniMark script
to help conversions from HTML to QAML. It does not do everything. (1999-03-24)
DDML (Document Description Markup Language)
is for DTDs declared using elements
rather than XML markup declarations.
There are OmniMark scripts to convert XML markup declarations into DDML:
there
are two scripts: xml2ddml.xom and
xml2ddml2.xom.
Also, there are examples of DDML:
the XBEL DTD,
our version of the DDML DTD,
our (TEI) Lite and Loose DTD,
and MathML, all in DDML.
The Exclusion Validator for XHTML
finds certain errors that an XML validatory cannot.
See note below for details.
Note: based on previous draft of XSL. (1999-02-05)
Example: Exclusion Validator for XHTML
(1999-02-26)
This is a
C header
for an interface class
for the W3C's Document Object Model API.
(1999-05-31)
(Note: This is not ASCC source code.)
This is a snapshot of an experimental general-purpose lexical analyser (scanner) which can be configured for XML and a wide range of different syntaxes. There is also an explanatory article about how XXX relates to validation. (1999-05-31)
html2qaml.xom is an OmniMark script
to help conversions from HTML to QAML. It does not do
everything. (1999-03-24)
These (UNIX shell and Omnimark) tools exract all
phrases from FAQs
and create a permuted, sorted index file in HTML:
makeindex.sh, getphrase.xom, and
mkhtml.xom
(1999-04-07)
This is temporarily available, in XML in UTF-8.
The Academia Sinica full text database
has several hundred meg more.
The document is some ancient Chinese poems
or lyrics (about 33K).
It uses our new (TEI) Lite and Loose DTD. (1999-02-10)
An alternate
version formatted using a CSS
stylesheet is also available. (1999-02-15)
Please note that this document is still being tested.
This is the overview of a seminar series at Academia Sinica.
You can use it to compare the XML and CSS implementations of various browsers.
You can see the same document marked up as
HTML,
HTML-in-XML with namespaces,
and
XML.
The HTML documents use one
Cascading Style Sheet, the XML document
uses
another.
As a further example, you can
see the same document with
VML graphic objects.
Even though the documents may look the same (if your browser accepts them),
if you view source, you can see they are all marked up very differently. (1999-02-11)
The QAML FAQ (en),
Chinese XML Now FAQ (en, zh), and
the Chinese Metadata FAQ (en)
with a simple Cascading Style Sheet. (1999-04-06)
An introduction to XML
| Seminar Notes (Note: This material is copyright. You have permission to download and read it for private, reference, archive and educational purposes. If you intend to use it in any commercial setting, please contact us.) |
|
|---|
This note shows how you can use XSL (eXensible
Stylesheet Language)
to validate document structures. (1999-02-05)
Example: Exclusion Validator for XHTML
(1999-02-26)
Francis Norton has published an excellent note
persuing ideas in this paper,
it shows how to generate a validator
from input in a schema language. (1999-05-20)
This note suggests that content models (regular expressions on name tokens) are a powerful technique which can be used to explain, constrain and validate many things other than element's contents. This paper is a continuation of the previous paper, and also may serve as an backgrounder to people looking at DOM, XML Schemas, the XML Information set, XSL and XLink. (1999-05-31)
This note continues from the previous note. If a generic regular expression processr can perform several checks usually associated with well-formedness checking, does this allow us to make a simpler front-end for reading XML? Can we make this tokenizer fit in with the ideas proposed in XML Notation Schemas. (1999-05-31)
This retrospective note looks at ERCS after 5 years, its influence on HTML, SGML and XML. What has succeeded, what has failed? This article is of historic and technical interest for the background to WWW internationalization.(1999-05-30)
Logos to show which Chinese character sets software accepts or generates.
Quick and unreliable guide to software supporting XML
English (UTF-8) |
Chinese (UTF-8) |
Traditional Chinese (Big5) |
Simplified Chinese (GB 2312) |