Skip to main content.
home | support | download

Back to List Archive

[swish-e] Indexing pdf

From: Manasa Kandula <m.kandula(at)not-real.RUG.nl>
Date: Wed Jul 02 2008 - 10:28:25 GMT
Hello,
I am currently using the SWISH-E 2.4.5 version.
I have used swish spider.pl to crawl some websites. I have used the default configuration setting. 
The pdf files in the website have been successfully converted to the html format.
But, once I index the output of the spider, the parts whose pathnames end with the pdf extention do not get indexed.
How can I index these documents?
Manasa



_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Wed Jul 2 06:21:40 2008