Skip to main content.
home | support | download

Back to List Archive

Re: Comparaison for indexation beetwen stable ans dev versions of SWISH-E

From: <m.chartoire(at)not-real.ipnl.in2p3.fr>
Date: Thu Aug 22 2002 - 07:43:31 GMT
Le 21 Aug, Bill Moseley a ecrit :
> At 07:10 AM 08/21/02 -0700, m.chartoire@ipnl.in2p3.fr wrote:
>>But for the number of indexed files, it seem that 2.1-dev-25 do not index 
>>equivalent server. As you can see in config files I have the same directives:
>>
>>EquivalentServer http://lyoinfo.in2p3.fr http://snovae.in2p3.fr/ipnl
>>EquivalentServer http://lyoinfo.in2p3.fr http://doc.in2p3.fr/delphi/ipnl
> 
> You may specify more than one "equivalent" server on the same line, if you
> like.
> 
> Here's the quick hack patch:
> 
> Index: http.c
> ===================================================================
> RCS file: /cvsroot/swishe/swish-e/src/http.c,v
> retrieving revision 1.26
> diff -u -u -r1.26 http.c
> --- http.c      15 Aug 2002 05:32:16 -0000      1.26
> +++ http.c      21 Aug 2002 15:43:27 -0000
> @@ -183,7 +183,7 @@
>                  /* Add a new entry to this list */
>                  slist = (struct swline *) emalloc(sizeof(struct swline));
>  
> -                slist->line = sl->word[i];
> +                slist->line = estrdup( sl->word[i] );
>                  slist->next = list->list;
>                  list->list = slist;
>              }
> 
  With the patch, the "EquivalentServer" are now indexed.

 In indexing my stopwords file I see that in 2.1-dev characters like "t"
 was indexed (in words like can't, didn't ...) because in conf. files
 "'" isn't in "WordCharacters".

 My stopwords file is a mixte of french and english words and in french
 we use a lot of : l', d', c',s'. Perhaps the difference in unique
 indexed words come from this fact.


-- 
Martial Chartoire, Service Informatique | E-mail: m.chartoire@ipnl.in2p3.fr
Institut de Physique Nucleaire de Lyon  | phone : +33 472 448 430
43, BD du 11 Novembre 1918              | fax   : +33 472 448 004
F 69622 Villeurbanne Cedex              |
Received on Thu Aug 22 07:47:11 2002