OK, let's start over. . .
I want to index the site.
Only .htm and .html
I don't want to index directories containing .htaccess
I don't want to index documents beginning with "dsc_" )
Swish-e version: 2.4.5
Current run string: swish-e -S prog -c swish.conf
# Swish-e config
SwishProgParameters default http://nottherealsitename.com/
Metanames swishtitle swishdocpath
IndexOnly .htm .html
IgnoreWords File: /usr/local/swish-e-2.4.5/conf/stopwords/english.txt
StoreDescription TXT* 10000
StoreDescription HTML* <body> 10000
Need some help.
Bill Moseley wrote:
> On Wed, May 23, 2007 at 10:35:47PM -0400, Frank Hunt wrote:
>> this fails:
>> IndexDir spider.pl
>> SwishProgParameters default http://website.com/
>> FileRules directory contains ^\.htaccess
>> run string: swish-e -S prog -c swish.conf2
> -S prog means you are not reading from the file system -- FileRules is
> only for reading from the file system.
confused linux admin
part time windows(r) washer
rochester hills, mi
Users mailing list
Received on Thu May 24 07:33:13 2007