On Dec 21, 2007, at 10:05 AM, Prestamo, Anne wrote:
> We implemented AquaBrowser last summer. We were astounded at the
> number of "variant spellings" that appeared the Word Cloud, and
> initially thought that our catalogers would be deluged with
> requests to make corrections in our records. As we investigated
> further we realized that a lot of the spelling variants are
> legitimate spellings that in our case come largely from two
> sources: 1) the ~90,000 records for Early English Books online;
> and 2) the SyndeticsICE searchable tables of contents.
I had the same experience with my Alex Catalogue where I indexed the
full text of documents. For example, the work "mississippi" was
legitimately spelled in a bunch of different ways in the texts. Using
something like Aspell do find similar spellings turns up very
interesting results. Try:
http://www.infomotions.com/alex/?cmd=search&query=mississipi
--
Eric Lease Morgan
University Libraries of Notre Dame
Received on Fri Dec 21 2007 - 10:39:12 EST