Re: iso5426 (MAB2?) to UTF8 and back: finally!

From: Karen Coyle <lists_at_nyob>
Date: Thu, 27 Dec 2012 07:13:50 -0800
To: CODE4LIB_at_LISTSERV.ND.EDU
Marc,

If you are looking for MARC records, here is a very large file of 
Library of Congress MARC records on the Internet Archive:
    http://archive.org/details/marc_records_scriblio_net

It is broken into chunks. The early chunks will not have interesting 
characters, so you might start with a later file.

kc

On 12/26/12 4:57 PM, Marc Chantreux wrote:
> hello perl mongers and librarians,
>
> I just released an "almost working" iso5426 ucm and Encode::ISO5426 on
> github. This is an XS module way faster than the Koha C4::Charset and
> the encode (to iso5426) feature works.
>
> https://github.com/eiro/p5-encode-iso5426
>
> I know there are missing chars in the table so my goal is now to run
> MARC::MIR on the largest set of records i can to find them. Also, if
> someone have a (even incomplete) test suite or table: i would be really
> please to read.
>
> any other feedbacks are very welcome.
>
> regards,

-- 
Karen Coyle
kcoyle@kcoyle.net http://kcoyle.net
ph: 1-510-540-7596
m: 1-510-435-8234
skype: kcoylenet
Received on Thu Dec 27 2012 - 10:15:10 EST