Skip to main content.
home | support | download

Back to List Archive

Re: Problems indexing german umlauts

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Fri May 07 2004 - 14:39:46 GMT
On Fri, May 07, 2004 at 07:17:35AM -0700, Sven Schupp wrote:
> Hi,
> 
> Bill Moseley wrote:
> > On Sun, Apr 25, 2004 at 09:12:22PM -0700, Jignesh Jani wrote:
> > 
> >>Hello Experts,
> >>
> >>I am facing problem in indexing German Umlauts, problem is swish-e =
> >>converts all the characters in 7-bit ASCII format. Hence when i try to =
> >>search a word containing umlauts i am not able to find it all the time. =
> >>I know about Translate Chars. But i need to do something more with it. =
> >>Is it possible to make swish-e work with 8-bit encoding.
> > 
> > 
> > Probably.  Add the Umlauts to WordCharacters (and to BeginCharacters and
> > EndCharacters if you expect it at the start or end of a word).
> 
> I am facing a problem only if an umlaut is the first char of a word.

Did you add it to BeginCharacters, too?

> 
> As an example, we have a word "Überbrückungsgeld".
> 
> If I try to search for "Überbrückungsgeld" swish-e gives me no results. 
> But if I search for "überbrückungsgeld" it'll give me a list with all 
> hits. Surprisingly all occurrances of this word have an big "Ü" as the
> first char!

Did you add both characters to your config?

-- 
Bill Moseley
moseley@hank.org
Received on Fri May 7 07:39:46 2004