On 03/10/2009 06:23 AM, Michelangelo Rezzonico wrote:
> Hi all,
>
> I use pdftotext to index pdf-files.
> This works ok.
> The only problem is that in the output of pdftotext there are many spaces.
>
> If in the pdf-file there is the string "2001", then in the output of
> pdftotext I find "2 0 0 1".
>
I don't see this behavior with pdftotext 3.02.
The original may actually have space characters as a way to do faux
letter spacing. What happens if you copy the text from the PDF file and
paste it into a text editor?
--
Thomas Dowling
tdowling@ohiolink.edu
_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Tue Mar 10 06:52:13 2009