Le 21 Aug, Bill Moseley a ecrit :
> At 07:10 AM 08/21/02 -0700, m.chartoire@ipnl.in2p3.fr wrote:
>>But for the number of indexed files, it seem that 2.1-dev-25 do not index
>>equivalent server. As you can see in config files I have the same directives:
>>
>>EquivalentServer http://lyoinfo.in2p3.fr http://snovae.in2p3.fr/ipnl
>>EquivalentServer http://lyoinfo.in2p3.fr http://doc.in2p3.fr/delphi/ipnl
>
> You may specify more than one "equivalent" server on the same line, if you
> like.
>
> Here's the quick hack patch:
>
> Index: http.c
> ===================================================================
> RCS file: /cvsroot/swishe/swish-e/src/http.c,v
> retrieving revision 1.26
> diff -u -u -r1.26 http.c
> --- http.c 15 Aug 2002 05:32:16 -0000 1.26
> +++ http.c 21 Aug 2002 15:43:27 -0000
> @@ -183,7 +183,7 @@
> /* Add a new entry to this list */
> slist = (struct swline *) emalloc(sizeof(struct swline));
>
> - slist->line = sl->word[i];
> + slist->line = estrdup( sl->word[i] );
> slist->next = list->list;
> list->list = slist;
> }
>
With the patch, the "EquivalentServer" are now indexed.
In indexing my stopwords file I see that in 2.1-dev characters like "t"
was indexed (in words like can't, didn't ...) because in conf. files
"'" isn't in "WordCharacters".
My stopwords file is a mixte of french and english words and in french
we use a lot of : l', d', c',s'. Perhaps the difference in unique
indexed words come from this fact.
--
Martial Chartoire, Service Informatique | E-mail: m.chartoire@ipnl.in2p3.fr
Institut de Physique Nucleaire de Lyon | phone : +33 472 448 430
43, BD du 11 Novembre 1918 | fax : +33 472 448 004
F 69622 Villeurbanne Cedex |
Received on Thu Aug 22 07:47:11 2002