Re: Indexing Rare Book collections

From: Owen Stephens <owen_at_nyob>
Date: Thu, 1 Mar 2012 15:18:21 +0000
To: NGC4LIB_at_LISTSERV.ND.EDU
Thanks Eric - would you be able to share any of the code you use to manipulate the data and the SolrMARC index config and associated scripts? I'm particularly interested in how you map the MARC data (which fields, how much manipulation is required) and extract facets etc from it.

Thanks,

Owen

Owen Stephens
Owen Stephens Consulting
Web: http://www.ostephens.com
Email: owen_at_ostephens.com
Telephone: 0121 288 6936

On 1 Mar 2012, at 13:55, Eric Lease Morgan wrote:

> The "Catholic Portal" is an implementation of a "next-generation library catalog" using MARC records from rare book rooms, etc. -- http://bit.ly/nIF8wl
> 
> The Portal is a part of the Catholic Research Resources Alliance. The goal of the Alliance is to bring together rare, uncommon, and infrequently held materials from libraries, archives, etc. in an effort to support scholarly research. It is a VuFind implementation whose metadata comes in a number of flavors: MARC, EAD, and incarnations of Dublin Core. This metadata is first made accessible on the Web by member institutions. I then harvest it, validate it, enhance it (slightly), and stuff the result into VuFind's underlying Solr index. The end result is not really a "catalog" but an interface to an index. 
> 
> We are slowly working on various enhancements including harvesting and indexing full text, as all as supporting text mining interfaces against full text so the content can be "read" distantly.
> 
> -- 
> Eric Lease Morgan
> University of Notre Dame
> 
> (574) 631-8604
Received on Thu Mar 01 2012 - 10:25:52 EST