Hi Rainer,
I have been testing a page like this:
<title>City of München</title>
Bla, bla, München, bla, bla
Looking for München in title works fine (option -t)
Looking for München in the file also works fine.
But, you are right, when I change ü for ü
I could not find München because HTML entities
are not properly decoded.
Look at
http://sunsite.berkeley.edu/SWISH-E/Patches/parseTitle
This is an old partial patch to this problem. I have not applied
it yet because is limited to only the first 40 chars.
cu
Jose Ruiz
jmruiz@boe.es
Received on Tue Apr 25 10:20:26 2000