Skip to main content.
home | support | download

Back to List Archive

Re: not ignoring content (leave those files alone!)

From: David L Norris <dave(at)not-real.webaugur.com>
Date: Sun Jun 11 2006 - 02:11:14 GMT
On Sat, 2006-06-10 at 18:44 -0700, Linda W. (that's swishey, not
squishey!) wrote:
> My Money Backup.mbf
> 
> It's a 2.2meg file and it seems just to hang at this point --
> actually it does eventually come back after about 5 minutes...with
> 7 words indexed....(sigh).
> 
> Any idea why it isn't skipping this file?

Well, NoContents doesn't tell it to skip the file.  If you do want to
skip the file you can create some FileRules in the config.  Or if you
want to index only files with certain extensions then list them with
IndexOnly.
  http://www.swish-e.org/docs/swish-config.html#filerules
  http://www.swish-e.org/docs/swish-config.html#indexonly

> I'm not sure if it is properly skipping the other file types
> I have set to "nocontent", it says it is running the HTML2 parser
> on them and comes back with 5 words on files like:
> 
>    Normal3.fgs - Using DEFAULT (HTML2) parser -  (5 words)
>    Default.fbt - Using DEFAULT (HTML2) parser -  (5 words)
>    default.fgs - Using DEFAULT (HTML2) parser -  (5 words)
> ---
> 
> All of those types are in the "nocontent" section...

It will still add the file to the index if there's a NoContents rule.  I
don't recall exactly what gets added.  You can use -T debug options to
see what is added to the index.

-- 
 David L Norris
  http://webaugur.com/
  ICQ - 412039
Received on Sat Jun 10 19:11:15 2006