I am currently using the SWISH-E 2.4.5 version.
I have used swish spider.pl to crawl some websites. I have used the default configuration setting.
The pdf files in the website have been successfully converted to the html format.
But, once I index the output of the spider, the parts whose pathnames end with the pdf extention do not get indexed.
How can I index these documents?
Users mailing list
Received on Wed Jul 2 06:21:40 2008