Re: Relevancy-ranking LCSH?

From: Walter Lewis <lewisw_at_nyob>
Date: Wed, 7 Feb 2007 14:18:19 -0500
To: NGC4LIB_at_listserv.nd.edu
Art Rhyno wrote:
> One idea that can be found in Query By Example (QBE) engines used for
> image retrieval is to create a composite representation of an image,
> and then use it as sort of a fingerprint for similar content. If you
> had the full text of the objects gathered under a particular LCSH, and
> used something like Latent Semantic Indexing (LSI) or other techniques
> that try to identify relationships between underlying terms and
> documents, it might be possible to use a QBE approach where the
> content that best matches the most common composite is a good
> indicator of the most representative sample in a collection. Of
> course, for fiction, the most representative sample might actually be
> the worst read of the lot since it would likely be the most formulaic.
> But maybe that would make it the most relevant?
This begins to feel like the engines purportedly used by various
anti-plagiarism "services".

Think of the wonderful collection of unintended consequences.  :)

Walter Lewis
Halton Hills
Received on Wed Feb 07 2007 - 13:15:10 EST