Re: PDF->text extraction

From: Pottinger, Hardy J. <PottingerHJ_at_nyob>
Date: Tue, 21 Jun 2011 14:03:19 -0500
To: CODE4LIB_at_LISTSERV.ND.EDU
On 6/21/11 12:36 PM, "Boheemen, Peter van" <Peter.vanBoheemen_at_WUR.NL>
wrote:

>The most used open source software for this (and many other mime types)
>is tika: http://tika.apache.org/

Thanks for this link, Tika looks great!


--
HARDY POTTINGER <pottingerhj_at_umsystem.edu>
University of Missouri Library Systems
http://lso.umsystem.edu/~pottingerhj/
"No matter how far down the wrong road you've gone,
turn back." --Turkish proverb
Received on Tue Jun 21 2011 - 15:05:12 EDT