Re: Resignation

From: Julia Bauder <juliabauder_at_nyob>
Date: Thu, 30 Aug 2007 09:29:21 -0700
To: NGC4LIB_at_listserv.nd.edu
Sperr, Edwin wrote:

> I'm sorry, but I just don't believe you.
>
> Show us.  Point us to these applications that can
*currently* slurp in
> 250 pages of full text and return 5 to 7 reasonably
good, controlled
> vocabulary subject headings (or topics or topic maps
or well-formed RDF
> triples or what have you).  Point to one *real world
example* of this
> happening -- not a lab, with pre-selected documents
from a single topic
> domain, or test runs against 5 paragraphs.  This is
*not* a trivial
> task. To say that it is misapprehends the entire
scope of what we're
> talking about.
>
>
The World Bank catalogs its documents this way using a
program called Teragram. (And the World Bank generates
a lot of lengthy documents in a very wide variety of
topic domains.)

Teragram:
http://www.teragram.com/solutions/categorizer.htm

Slides from a presentation by Denise Bedford, the head
of metadata operations at the World Bank, about their
metadata/search system:
http://www.collectionscanada.ca/obj/014005/f2/014005-05209-c-e.pdf
(The slides that most directly address Teragram and
how they use it start with slide 44.)

It's not easy, and it's not cheap to set up, but it
can be done.

Julia Bauder
Wayne State University
Soon-to-Be MLIS (December 2007)
Received on Thu Aug 30 2007 - 12:29:21 EDT