Art Rhyno wrote:
> One idea that can be found in Query By Example (QBE) engines used for
> image retrieval is to create a composite representation of an image,
> and then use it as sort of a fingerprint for similar content. If you
> had the full text of the objects gathered under a particular LCSH, and
> used something like Latent Semantic Indexing (LSI) or other techniques
> that try to identify relationships between underlying terms and
> documents, it might be possible to use a QBE approach where the
> content that best matches the most common composite is a good
> indicator of the most representative sample in a collection. Of
> course, for fiction, the most representative sample might actually be
> the worst read of the lot since it would likely be the most formulaic.
> But maybe that would make it the most relevant?
This begins to feel like the engines purportedly used by various
anti-plagiarism "services".
Think of the wonderful collection of unintended consequences. :)
Walter Lewis
Halton Hills
Received on Wed Feb 07 2007 - 13:15:10 EST