Re: web-based ocr

From: Richard Sarvas <Richard.Sarvas_at_nyob>
Date: Wed, 13 Mar 2013 17:55:52 +0000
To: CODE4LIB_at_LISTSERV.ND.EDU
FYI - the Image::OCR::Tesseract install was a real pain for me on RHEL. I kept running into problems getting one of the dependency modules Time::Format (Date::Manip::TZ_Base errors) installed on RHEL. Eventually I had to install Date::Manip via YUM, then do a force install of Time::Format. After that Image::OCR::Tesseract refused to recognice that the Tesseract executable was installed because the source code compile and installed placed the executable in /usr/local/bin and not /usr/bin. Once I moved the Tesseract executable to /usr/bin the Image::OCR::Tesseract module install worked fine (ImageMagick and Leptonica having been previously installed).


Rick


-----Original Message-----
From: Code for Libraries [mailto:CODE4LIB_at_LISTSERV.ND.EDU] On Behalf Of Eric Lease Morgan
Sent: Wednesday, March 13, 2013 8:54 AM
To: CODE4LIB_at_LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] web-based ocr

On Mar 13, 2013, at 8:07 AM, Ben Brumfield <benwbrum_at_GMAIL.COM> wrote:

> https://github.com/idigbio-aocr/RESTAPI/tree/master/doc

Interesting. Printed for future reference. Thank you.

BTW, I did finally get Image::OCR::Tesseract to make, make test, and make install correctly. I did not have the correct/proper libraries installed for Tesseract's supporting Leptonica library. Now I need to find a PDF library similar to libtff and libpng. 

--
Eric Morgan
Received on Wed Mar 13 2013 - 13:56:41 EDT