Re: Looking for lightweight tool to identify PII

From: Lane, Jennifer (Library) <Jennifer.Lane_at_nyob>
Date: Fri, 19 Apr 2019 17:44:21 +0000
To: CODE4LIB_at_LISTS.CLIR.ORG
Could you use the patterns feature in Acrobat and regex? http://blogs.adobe.com/acrolaw/2011/05/creating_and_using_custom_redact/


Jenny Lane | NPL | 615-880-1622                                              


-----Original Message-----
From: Code for Libraries [mailto:CODE4LIB_at_LISTS.CLIR.ORG] On Behalf Of Kimberly Kennedy
Sent: Friday, April 19, 2019 12:26 PM
To: CODE4LIB_at_LISTS.CLIR.ORG
Subject: [CODE4LIB] Looking for lightweight tool to identify PII

Attention: This email originated from a source external to Metro Government. Please exercise caution when opening any attachments or links from external sources.


Hello!

We are beginning a digitization project at my institution that involves
scanning archival documents that may contain personal identifying
information, such as social security numbers or credit card numbers.  I'm
looking for a tool that will examine the PDFs and identify the ones that
may contain PII, so we can then redact them.

I've experimented a bit with Bulk Extractor Viewer but haven't been able to
get it to work on the scanned PDFs I've created.  I talked to a sales rep
at Spirion and that program seems like overkill for our purposes.  Any
suggestions for other things to try would be appreciated!

Thanks,

Kim


Kimberly Kennedy
Digital Production Coordinator
Northeastern University Library
kimberlymkennedy_at_gmail.com
Received on Fri Apr 19 2019 - 13:47:28 EDT