Dear experts,
I got one question on http indexing, below is my swish.conf:
IndexDir spider.pl
SwishProgParameters spider.conf
IndexOnly .htm .html .txt .pdf .doc .ppt .xml .tex .eps .ps .log .jpg .cc
.cxx .cpp .h
IndexContents TXT* .txt
DefaultContents HTML*
ParserWarnLevel 9
The index was generated by:
/usr/local/bin/swish-e -c swish.conf -S prog
but it seems that swish-e indexing all the files in the webpage which I do
not want them to be indexed, such as *.root, *.gz etc.
Any help? Thanks.
Best Regards,
Xinchun
_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Wed Sep 3 22:03:23 2008