Re: Search/retrieve access is to library data what Gopher was to the web?

From: Karen Coyle <lists_at_nyob> Date: Thu, 21 Aug 2008 10:04:11 -0700 To: NGC4LIB_at_LISTSERV.ND.EDU

Allowing a search engine to index the data shouldn't be a legal problem, 
I wouldn't think. The problem is that the data isn't available in a 
crawlable way -- it's stuck in databases that only speak Z39.50. Where 
the legal problem comes in is that people feel they can't put their data 
out on the open web, right?

I was thinking that one of the barriers is that what we have is this 
highly formalized metadata. The first problem with that is that we have 
it in MARC, which no one other than libraries understands. The second is 
that metadata is highly concentrated -- and web search tends to be on 
full text and takes a shotgun approach rather than the precise approach 
of library catalogs. Because the data is concentrated, keyword searching 
is often unsatisfactory -- and the web thrives on keyword searching.

If we COULD surface all of the library metadata to the web, then I think 
that we'd need to do something other than just treat each record as a 
web page. I think we'd need to create a layer of merged data so that 
each book (manifestation) is represented as few times as possible 
(ideally once, but we know how hard that is), and we'd need a work layer 
as well. And we'd need ways to navigate, not just search. Linking books 
that cite each other (like following urls). I guess that's the other 
problem with our metadata -- no interaction between records, few links 
(I'm thinking of the 'related works' fields). So much to do!

kc

Jonathan Rochkind wrote:
> Sadly, I think much of the barrier is legal/business :  OCLC members 
> are not allowed (or believe they are not allowed) to share their 
> complete records with all and sundry.  Being able to share their 
> complete corpus with all and sundry is what would set the ground for 
> innovation. You never know who is going to provide this, but once you 
> make it possible, somebody will.
>
> Jonathan
>
> Martin Malmsten wrote:
>> Hi all,
>>
>> I am simply going to throw down the gauntlet and say that 
>> search/retrieve access to library data is not good enough. For too 
>> long have library data been trapped within data-silos only accessible 
>> through obscure protocols. Why is access to library data still an 
>> issue? This was solved in a matter of months on the web, when Excite 
>> (or whichever search engine was first) was introduced. Why are there 
>> not at least ten search engines containing the majority of the worlds 
>> bibliographic data?
>>
>> Yes, I am stating/asking the obvious.
>>
>> So, Linked Data for libraries, anyone?
>>
>> Best regards,
>>   Martin, who really wants a discussion about Linked Data
>>
>

-- 
-----------------------------------
Karen Coyle / Digital Library Consultant
kcoyle@kcoyle.net http://www.kcoyle.net
ph.: 510-540-7596   skype: kcoylenet
fx.: 510-848-3913
mo.: 510-435-8234
------------------------------------