Text analysis for MARC data?

From: Alison Clemens <alison.clemens_at_nyob>
Date: Thu, 4 Jan 2024 11:26:42 -0500
To: CODE4LIB_at_LISTS.CLIR.ORG
Hi, all,

Has anyone here done text analysis-type work on MARC data, particularly on
topical subject headings? I work closely with my library's digital
collections, and I am interested in seeing what kinds of topics (as
indicated in our descriptive data) are represented in our
digital collections. So, I have the corresponding MARCXML for the
materials and have extracted the 650s as a string (e.g., *650 $a World War,
1914-1918 $x Territorial questions $v Maps*), but I'm a little stuck on how
to meaningfully analyze the data. I tried feeding the data into Voyant, but
I think it's too large of a corpus to run properly there, and regardless,
the MARC data is (of course) delimited in a specific way.

Any / all perspectives or experience would be welcome -- please do get in
touch directly (at alison.clemens_at_gmail.com), if you'd like.

With thanks,
Alison Clemens
Beinecke Rare Book and Manuscript Library, Yale University
Received on Thu Jan 04 2024 - 10:48:11 EST