Thanks for the reply. I managed to get it to work when I add:
IndexContents HTML .html .htm
My understanding was that this line was only necessary if I
wanted to tell swish-e to index files with non-default
extensions, eg. in association with FilterFile. Anyway I am
getting descriptions indexed, but unfortunately I have another
issue.
The lines I have in my config file are:
StoreDescription TXT 150
StoreDescription HTML <BODY> 150
StoreDescription XML <desc> 150
IndexComments 0
# because of doc2txt.pm
IndexContents TXT .doc
# because of pdf2xml.pm
IndexContents XML .pdf
IndexContents HTML .htm .html .php .php3
If I use 2.1-dev-20 then the description picks up some javascript
from the head of the html file, which has been inserted by
Dreamweaver. (The javascript uses < and >.) However, if I use
the latest cvs version then the description is correctly extracted
from the BODY tag.
So I would like to use 2.1-dev-21 but ... sorry there had to be a
but.
I have found two issues:
1) I used this command line:
swish-e -f tt.full_index -w "gavin" -x "<swishreccount> | <swishtitle> |
<swishrank> | <swishdocpath> | <swishdocsize> | <swishlastmodified> |
<swishdescription> | <swishstartpos> | <swishdbfile> \n"
and get err:(unknown datatype <swishdocsize>) and
err:(unknown datatype <swishstartpos>) from an HTML file that returns
this data with swish-e 2.1-dev-20
2) Also I was going to use the perl module with my webpages but I will
need 2.1-dev-21 perl module to do this and when I run make on the perl
module I get this:
UNIX> make
mkdir blib
mkdir blib/lib
mkdir blib/arch
mkdir blib/arch/auto
mkdir blib/arch/auto/SWISHE
mkdir blib/lib/auto
mkdir blib/lib/auto/SWISHE
cp SWISHE.pm blib/lib/SWISHE.pm
/tools/bin/perl -I/apps/perl5.6.0/lib/5.6.0/sun4-solaris
-I/apps/perl5.6.0/lib/5.6.0 /apps/perl5.6.0/lib/5.6.0/ExtUtils/xsubpp
-typemap /apps/perl5.6.0/lib/5.6.0/ExtUtils/typemap SWISHE.xs >
SWISHE.xsc && mv SWISHE.xsc SWISHE.c
gcc -c -fno-strict-aliasing -I/usr/local/include -O
-DVERSION=\"0.01\" -DXS_VERSION=\"0.01\" -fPIC
-I/apps/perl5.6.0/lib/5.6.0/sun4-solaris/CORE SWISHE.c
SWISHE.xs: In function `XS_SWISHE_SwishNext':
SWISHE.xs:64: structure has no member named `Prop'
make: *** [SWISHE.o] Error 1
Sorry to be a pain. If you or anybody else can help with any of
this I would be grateful.
Thanks in advance
Gavin.
Bill Moseley wrote:
>
> At 01:06 PM 06/15/01 -0700, Gavin Walker wrote:
> >I would be grateful if somebody could confirm that I can use the
> >-S prog method and the StoreDescription configuration together.
>
> Yes. All three input methods (http, fs, prog) all feed into the different
> parsers (html, xml, txt), which is where the description is pulled out.
>
> So yes, it shouldn't matter which input method you use.
>
> Bill Moseley
> mailto:moseley@hank.org
Received on Fri Jun 15 22:16:32 2001