Marja Haapalainen wrote:
> ...We are currently looking at different ways of aggregating
> metadata (mainly about e-articles, e-journals and e-books) from
> different sources/publishers...
Till Kinstler wrote:
> The whole data aggregation workflow really is a pain....
I think the idea of aggregating metadata is/was long time in coming. Kudos. And yes, one of the bigger challenges, besides the licensing/purchasing issues, will be the workflow and data normalization processes.
On the other hand, the benefits can be huge. I see libraries as institutions who do collection, preservation, organization, and dissemination of data, information, and knowledge for specific audiences. By aggregating the metadata -- as opposed to licensing access to it -- libraries can fulfill their goals and provide useful services at the same time.
Actually having possession of the data/metadata opens quite up a number of possibilities. The creation of a unified search interface is just one of them. No information silos. If the data/metadata is all in the same index, then relevancy ranking algorithms and statistical analysis will be much more valid. (Federated searching. Ick.) Once the data is housed locally, it can be more easily integrated into the wider community of the library. It is easier to seamlessly insert it into course management systems. It is easier to create current awareness services. It is easier to distribute. Once the data/metadata is housed locally, then it is easier to apply "digital humanities" computing techniques against it. Create concordances. Extract statistically significant n-grams (one, two, or n-length phrases). There are things called "champion lists" -- sets of words denoting a theme -- that can then be applied a corpus and used to calculate which items are more relevant. With direct acc!
ess to the data/metadata it is easier to literally chart and graph texts thus illustrating similarities between them. Access to the full text allows you to find similar phrases or paragraphs between texts, and this leads to the ability to trace ideas and authors through time.
Finding information is much less of a problem compared to ten or fifteen years ago. Everybody can find. What is needed now are tools that allow you to USE. This, I believe, is a big opportunity for the profession. Having the data/metadata makes such a thing much more possible.
--
Eric Lease Morgan
Head, Digital Access and Information Architecture Department
University of Notre Dame
Received on Tue Feb 16 2010 - 14:26:17 EST