Re: pdf2txt [encoding]

From: Eric Lease Morgan <emorgan_at_nyob>
Date: Sat, 12 Oct 2013 10:03:28 -0400
To: CODE4LIB_at_LISTSERV.ND.EDU
On Oct 11, 2013, at 6:39 PM, Mark Pernotto <mark.pernotto_at_GMAIL.COM> wrote:

> Just from a curiosity standpoint, what encoding is being utilized?  I know
> nothing about Perl.  It seemed to have no problem parsing a dash (-) if it
> was up against another character (2007-2012), but barfs when it's by itself
> (2007 � 2012). I'm only referring to 'extracted text' mode.


Encoding, another good point. I think I can fix that. Hmmm… --Eric
Received on Sat Oct 12 2013 - 10:03:54 EDT