Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] problem indexing pdf

From: Manasa Kandula <m.kandula(at)not-real.RUG.nl>
Date: Thu Jul 03 2008 - 09:30:33 GMT
Hey Bill,
I tried your suggestion. It works!
Thanks
----- Original Message ----- 
From: "Bill Moseley" <moseley@hank.org>
To: "Swish-e Users Discussion List" <users@lists.swish-e.org>
Sent: Wednesday, July 02, 2008 4:59 PM
Subject: Re: [swish-e] problem indexing pdf


> On Wed, Jul 02, 2008 at 12:42:39PM +0200, Manasa Kandula wrote:
>> The pdf file in the website has been successfully converted to the html 
>> format.
>> But, once I index the output of the spider
>> (swish-e -f index.swish-e -c swish.config -S prog -i stdin < output1.txt)
>> , the part whose pathname ends with the pdf extention do not get indexed. 
>> (in this example it is the entire document that doesn't get indexed).
>
> What happens if you don't specify the config file?
>
>    swish-e -S prog -i stdin < output1.txt
>
>
> -- 
> Bill Moseley
> moseley@hank.org
>
> Unsubscribe from or help with the swish-e list:
>   http://swish-e.org/Discussion/
>
> Help with Swish-e:
>   http://swish-e.org/current/docs
>
> _______________________________________________
> Users mailing list
> Users@lists.swish-e.org
> http://lists.swish-e.org/listinfo/users
> 

_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Fri Jul 4 05:00:30 2008