Skip to main content.
home | support | download

Back to List Archive

Re: SWISH-E index limits

From: Gerald Klaas <gklaas(at)not-real.arb.ca.gov>
Date: Mon Apr 22 2002 - 19:28:19 GMT
Bill Moseley wrote:
> And for that I believe in test_url you could even do
> 
>      $uri->path( lc $uri->path );
> 
> to "normalize" all the paths.
> 

Sure enough, Bill, since some of my documents are on NT and
some on linux, you helped me come up with this () to 
"normalize" only the MS-host ones (based on the doc path).
Again, works like a charm!  :-)

---snip from spider.pl test_url---
my $NT_dirs = join '|', qw/ aaqm ab2588 vapor videos vbb /; 
    $url->path( lc $url->path ) if $url->path =~ m!^/($NT_dirs)/!i; 
    return 1; 
---end snip---

Gerald
Received on Mon Apr 22 19:28:26 2002