On Tue, Apr 20, 2010 at 22:26, Eric Lease Morgan <emorgan_at_nd.edu> wrote:
> IMHO, this is EXACTLY correct. Its is the ISBD/AACR2 sort of stuff that is so field specific and inconsistently applied that it is almost impossible to parse MARC, MARCXML, or MODS without going through a million conditional statements.
It does surprise me somewhat that all you smart folks don't get
together and create a framework for washing and cleaning up MARC
records (making it convertible to whatever else you want or need). I
know OCLC does it as part of their thing, but this thing should be
open and extendable. Let all geeks use it and fix it. There were some
Perl cleaners back in the days, but surely a better option can be
made. :) Heck, even a RegEx library might prove useful to clean it,
and then some lookups to wash it, and maybe a service or two to match
it (and hence tag it), and maybe someone nice could throw in a
modeller to make it all RDF / Topic Maps friendly as well. Whammo! And
we're back in the driving seat of our own destiny, right? Make it as a
pass-through filter, and people can wash their own dirty laundry to
avoid copyright issues (or ignore this problem, and become copyleft
heros!)
Surely this is worth spending a bit of time on? Surely this is in the
interest of all, including top-brass? Surely there is no better way?
Surely this has all the political clout and librarian chutzpah needed
to get going?
Regards,
Alex
--
Project Wrangler, SOA, Information Alchemist, UX, RESTafarian, Topic Maps
--- http://shelter.nu/blog/ ----------------------------------------------
------------------ http://www.google.com/profiles/alexander.johannesen ---
Received on Tue Apr 20 2010 - 08:40:12 EDT