Skip to main content.
home | support | download

Back to List Archive

Re: http access with swish-e 2.0.2

From: <jmruiz(at)not-real.boe.es>
Date: Thu Sep 28 2000 - 15:29:46 GMT
Hi Bryan,

can you make
On 28 Sep 2000, at 7:41, Bryan Heidorn wrote:

> Yes, I had that problem too and Jose sent this patch.
> http indexing seems to work fine now but I have a new problem. The 
> resulting index contains the words that I need but search can not find them.
> I can make the index with http and run a
> swish-e2 -D -f index.file | grep abstract
> and I find a line for abstract
> 
> abstract: 2 1 1 9 1 138 2 7 1 9 1 4670 2 9 1 73 1.......
> 
> but then if I search
> swish-e2 -w "abstract" -f index.file
> I get
> ....
> # SWISH format 2.0
> # Search words: abstract
> err: no results
> 
> Bryan
> At 03:09 AM 9/25/00 -0700, jmruiz@boe.es wrote:
> >Hi Bryan,
> >
> >Sorry, it was my fault!!
> >Just change line 251 of http.c:
> >
> >From:
> >
> >fgets(buffer, sizeof(buffer), fp);
> >
> >To:
> >
> >fgets(buffer, lenbuffer, fp);
> >
> >Explanation:
> >I changed buffer from a internal function variable to a char pointer
> >that is allocated dinamically. In this case, sizeof(buffer) is always 4
> >(the size of a pointer) instead of the true length of the buffer. For this
> >reason the contenttype was never read properly and the contents
> >was not indexed.
> >
> >Sorry for the inconvenience. Let me know if this fix the problem.
> >
> >cu
> >Jose
> At 04:38 AM 9/28/00 -0700, Stephan Engelke wrote:
> >Hi everyone,
> >
> >I tried to index a site by using hte http access method using swish-e
> >version 2.0.2.  Unfortunatly the only words which ended up in the
> >index were the components of the filenames.  The swishspider script
> >returns the contents of the pages correctly.
> >
> >I am using the configuration file which came with the distribution and
> >modified only the relevant paths.
> >
> >Has anyone else experienced this, too?  Or is it a known bug?
> >
> >Regards -- Stephan
> >--
> >Stephan Engelke                                                engelke@gmx.net
> >               *** I.R.S.: We've got what it takes to take what you've 
> > got! ***
> 
> --
> --------------------------------------------------------------------
>    P. Bryan Heidorn    Graduate School of Library and Information Science
>    pheidorn@uiuc.edu   University of Illinois at Urbana-Champaign MC-493
>    (V)217/ 244-7792    501 East Daniel St., Champaign, IL  61820-6212
>    (F)217/ 244-3302    http://alexia.lis.uiuc.edu/~heidorn
> 
> 
Received on Thu Sep 28 15:30:15 2000