Skip to main content.
home | support | download

Back to List Archive

Re: Swish.config question regarding ExtractPath

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Fri Apr 02 2004 - 16:21:57 GMT
On Fri, Apr 02, 2004 at 07:04:24AM -0800, Jody Cleveland wrote:
> Hello,
> 
> I just setup swish-e 2.4.2 on a redhat 3 es server. I'm setting up
> swish-e to spider sites I have on windows boxes. I've got one:
> http://www.oshkoshpubliclibrary.org/citydirs/
> 
> That I want to spider. When I put that url in for SwishProgParameters,
> it ends up indexing not only the citydirs directory, but also
> directories above that in the tree. I'd like it to stick with citydirs,
> and it's subfolders.

That's a common question:

   test_url => sub { return $_[0]->path =~ m[^/citydirs/] },

> I was looking at the config documentation, and came across ExtractPath.
> Is this what I should use to do that?

No.  It's used for extracting part of the path out of the path and
indexing that text -- it's so you can limit *searches* to a subset of
files based on some text in the path name.

-- 
Bill Moseley
moseley@hank.org
Received on Fri Apr 2 08:21:57 2004