Re: PDF->text extraction

From: Eric Lease Morgan <emorgan_at_nyob>
Date: Tue, 21 Jun 2011 10:28:39 -0400
To: CODE4LIB_at_LISTSERV.ND.EDU
On Jun 21, 2011, at 10:23 AM, Owen Stephens wrote:

> We've tried iText but had issues with quality
> We moved to PDFBox but are having performance issues


I have been satisfied with pdftotext which is a part of the Xpdf suite of tools -- http://bit.ly/kIHD1x

-- 
Eric Lease Morgan
University of Notre Dame

(574) 631-8604
Received on Tue Jun 21 2011 - 10:29:59 EDT