adivey1@cox.net wrote on 06/17/2004 11:59 AM:
> So would the following work?
>
It looks like it to me, from my reading of the docs, but only you can
tell us for sure... :)
> (delete the DefaultContents line)
IndexOnly HTML* .htm .html .cfm .doc .pdf .ppt
NoContents .doc .pdf .ppt
>
> The Docs, under NoContents, say that having different file types in
> each property won't work. So from what I am understanding, using
> spider.pl and this config file, IndexOnly means the spider will only
> index the files in which I specify the extension, and then the
> contents of those files will be indexed, EXCEPT for the file types in
> NoContents.
>
> Is this a correct assumption? I'm not sure I understand how using
> IndexOnly and DefaultContents would work.
>
From what I can tell, DefaultContents assigns which *parser* to use for
documents that are not explicitly named in IndexOnly.
> Thanks so much! -Alan
>
>
>> From: Peter Karman <karman@cray.com> Date: 2004/06/17 Thu PM
>> 12:03:10 EDT To: Multiple recipients of list
>> <swish-e@sunsite3.berkeley.edu> Subject: [SWISH-E] Re: PPT &
>> swish.cgi (trying again)
>>
>> You likely want the IndexOnly config in addition to
>> DefaultContents. Either that or NoContents, which will still index
>> the name of the .ppt file but not the contents.
>>
>> adivey1@cox.net wrote on 06/17/2004 08:58 AM:
>>
>>> No one responded so I'm assuming the is because the message
>>> looked ridiculous coming from my webmail app.
>>>
>>> Here's a link of my message... I know this is an extra step but I
>>> would really appreciate the help.
>>>
>>> http://members.cox.net/adivey1/swishhelp.txt
>>>
>>> Thanks, Alan
>>
>> -- Peter Karman - Software Publications Programmer - Cray Inc
>> phone: 651-605-9009 - mailto:karman@cray.com
>>
>>
--
Peter Karman - Software Publications Programmer - Cray Inc
phone: 651-605-9009 - mailto:karman@cray.com
Received on Thu Jun 17 17:05:25 2004