Skip to main content.
home | support | download

Back to List Archive

Re: Re: Bug in stemmer.c

From: Bill Moseley <moseley(at)>
Date: Tue Oct 19 1999 - 17:20:25 GMT
At 09:53 AM 10/19/99 -0700, Roy Tennant wrote:
>All patches are put in the Patch directory at:
>(Bill's is "stemmer.c" for example)

Note that the above Patch only contains a bug fix.  The version of
stemmer.c I use has other "adjustments."

For example, I don't stem any words that stem to one or less characters. 
I also call Stem() in a loop until a word no longer stems.  

If this isn't done then, for example,

  "playing" stems to "play"
  "play" stems to "plai"

So searching for "playing" fails to find "play" (and "plays", "played").

I guess it's debatable if that's a bug or not.

Bill Moseley
Received on Tue Oct 19 10:21:23 1999