Re: character-sets for dummies?

From: stuart yeates <stuart.yeates_at_nyob>
Date: Thu, 17 Dec 2009 08:28:14 +1300
To: CODE4LIB_at_LISTSERV.ND.EDU
Ken Irwin wrote:
> Hi all,
> 
> I'm looking for a good source to help me understand character sets and how to use them. I pretty much know nothing about this - the whole world of Unicode, ASCII, octal, UTF-8, etc. is baffling to me.

Other people have recommended a whole lot of fabulous resources, so I 
won't cover ground they already have.

If, however, you need to deal with characters which don't qualify for 
inclusion in Unicode (or which do qualify but which haven't yet been 
assigned code points). I recommend tei:glyph:

http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-glyph.html

We use this to represent typographically interesting but short-lived 
approaches to the representation of Māori in printed works. See for 
example the 'wh' ligature (which looks like a 'vh' and is pronounced in 
modern usage like 'f') in the following text:

http://www.nzetc.org/tm/scholarly/tei-Auc1911NgaM-t1-body-d4.html

for the underlying TEI XML representation see:

http://www.nzetc.org/tei-source/Auc1911NgaM.xml

cheers
stuart
-- 
Stuart Yeates
http://www.nzetc.org/       New Zealand Electronic Text Centre
http://researcharchive.vuw.ac.nz/     Institutional Repository
Received on Wed Dec 16 2009 - 14:02:38 EST