Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] pdftotext

From: Thomas Dowling <tdowling(at)not-real.ohiolink.edu>
Date: Tue Mar 10 2009 - 10:52:10 GMT
On 03/10/2009 06:23 AM, Michelangelo Rezzonico wrote:
> Hi all,
> 
> I use pdftotext to index pdf-files.
> This works ok.
> The only problem is that in the output of pdftotext there are many spaces.
> 
> If in the pdf-file there is the string "2001", then in the output of
> pdftotext I find "2 0 0 1".
> 

I don't see this behavior with pdftotext 3.02.

The original may actually have space characters as a way to do faux
letter spacing.  What happens if you copy the text from the PDF file and
paste it into a text editor?


-- 
Thomas Dowling
tdowling@ohiolink.edu

_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Tue Mar 10 06:52:13 2009