Re: Releasing bibliographic data

From: B.G. Sloan <bgsloan2_at_nyob>
Date: Wed, 24 Mar 2010 08:18:45 -0700
To: NGC4LIB_at_LISTSERV.ND.EDU
 
Jim Weinheimer said: "Before I moved to Rome I was in the midst of compiling lists of transliteration tables for different Cyrillic languages to place in the Slavic Cataloging Manual for use of catalogers, but it turned out to be a huge task and I never managed to finish it."
 
I once managed a project where an outside vendor did data entry for the conversion of the University of Illinois library shelflist. My job was quality control, and part of that was making sure the vendor got the transliterations right. It was obvious that a lot of their data entry folks didn't quite understand the process of transliteration.
 
Bernie Sloan

--- On Wed, 3/24/10, Weinheimer Jim <j.weinheimer_at_AUR.EDU> wrote:


From: Weinheimer Jim <j.weinheimer_at_AUR.EDU>
Subject: Re: [NGC4LIB] Releasing bibliographic data
To: NGC4LIB_at_LISTSERV.ND.EDU
Date: Wednesday, March 24, 2010, 11:10 AM


Based on my experience, I always look at Russian entries. In this case, I found this record at random (I've tried to remove the MARC diacritics because they don't go through email very well):

Titel:      Očetnyj doklad Central'nogo Komiteta KPSS XXVI S-ezdu Kommunističeskoj Partii Sovetskogo Sojuza i očerednye 
             zadači partii v oblasti vnutrennej i vnešnej politiki : 23 fevralja 1981 goda
Sprache:    rus
Jahr:       1981
Verf.:      Brežnev, Leonid I
Umfang:     110 S.
Verlag:     Izd. političeskoj lit.
Ort:        Moskva

And the LC version:
LC Control No.:      81479163
LCCN Permalink:     http://lccn.loc.gov/81479163
Type of Material:     Book (Print, Microform, Electronic, etc.)
Personal Name:     Brezhnev, Leonid Ilich, 1906-1982.
Main Title:     Otchetnyi doklad Tsentralnogo Komiteta KPSS XXVI sezdu Kommunisticheskoi partii Sovetskogo Soiuza i ocherednye zadachi partii v oblasti vnutrenneĭi vneshnei politiki, 23 fevralia 1981 goda / L.I. Brezhnev.
Published/Created:     Moskva : Izd-vo polit. lit-ry, 1981.
Description:     110 p. ; 20 cm.

Lots of differences there.

Before I moved to Rome I was in the midst of compiling lists of transliteration tables for different Cyrillic languages to place in the Slavic Cataloging Manual for use of catalogers, but it turned out to be a huge task and I never managed to finish it.

Somehow these records will have to be reconciled, and now it will have to be done through automatic methods. Now, the author information can be solved through a URI, e.g. http://viaf.org/viaf/7387544 but the remainder will be more difficult. 

This should be possible, by entering all the transliteration tables to allow fuzzy matching. Of course, it wouldn't be perfect, but better than wading through massive numbers of duplicate records. 

James Weinheimer  j.weinheimer_at_aur.edu
Director of Library and Information Services
The American University of Rome
via Pietro Roselli, 4
00153 Rome, Italy
voice- 011 39 06 58330919 ext. 258
fax-011 39 06 58330992


-----Original Message-----
From: Next generation catalogs for libraries [mailto:NGC4LIB_at_LISTSERV.ND.EDU] On Behalf Of Bernhard Eversberg
Sent: Wednesday, March 24, 2010 1:34 PM
To: NGC4LIB_at_LISTSERV.ND.EDU
Subject: Re: [NGC4LIB] Releasing bibliographic data

The data released by Universität zu Köln (Cologne University) last week
stirred up some interest here. While working on a new search software,
I rigged up a database from that stuff just for experimentation. Here it
is, for those who want to learn more about the data:

http://www.biblio.tu-bs.de/db/a30/cologne.htm

The records haven't been exploited in entirety. Specifically, only
personal name authorities have been included, not corporate names,
subjects or other, like copy data. Not all descriptive fields are
included either. Not all special characters have survived although
the software works with UTF-8.

You can do a search or browse several indexes, like personal names,
title words, title phrases, authority forms of names, publishers,
series, ISBNs. (Pulldown menu on the upper right).
See how far you get - the interface is not all English :-(
(It's some 3.2 mio title records and 1.5 mio name authorities. The title
records include the authority id numbers only, not the names, but the
name index does make the real names searchable.)

B.Eversberg
Received on Wed Mar 24 2010 - 11:19:18 EDT