Re: internet archive api

From: Eric Lease Morgan <emorgan_at_nyob>
Date: Mon, 18 Sep 2017 16:25:59 -0400
To: CODE4LIB_at_LISTS.CLIR.ORG
Eric wrote:

> Is there an Internet Archive API that will allow me to get the contents of a
> collection as a stream of data and not as a stream of HTML… https://archive.org/details/bplsceep


On Sep 18, 2017, at 3:42 PM, Mark Jordan <mjordan_at_SFU.CA> wrote:

> Eric, yes, there is. I've implemented a simple tool using it, check out https://github.com/mjordan/archiveit_auditor. Still is work in progress but I hope it illustrates the API enough to get you going.


On Sep 18, 2017, at 3:43 PM, Francis Kayiwa <kayiwa_at_POBOX.COM> wrote:

> https://archive.readme.io/docs


Cool! The beginnings of my answer are, first, download/install the cool ia tool [1]. Then, use the tool (in a rather obscure way) to get metadata:

  $ ./ia search 'collection:bplsceep'

The result is a list of identifiers in the collection.

Fun!

[1] tool - http://internetarchive.readthedocs.io/en/latest/cli.html

—
Eric Morgan
Received on Mon Sep 18 2017 - 16:27:05 EDT