Skip to main content.
home | support | download

Back to List Archive

Indexing UTF-8 IIS Pages

From: <Mammitzsch.T(at)not-real.zdf.de>
Date: Wed Aug 04 2004 - 11:51:18 GMT
Hi everybody,

i try to spider an IIS 6.0 which delivers pages with utf-8 in the
http-header. As far as i understood the manual, swish-e converts utf-8 to
iso-8859-1 if i use libxml2 (html2-parser). Unfortunately special chars like
german umlauts are not recognized if i search through the swish.cgi
frontend. Also results with umlauts are not displayed correctly. swish-e
runs on a sun e450 with solaris 5.8. Any ideas?

best regards,

_______________________________________ 

Thomas Mammitzsch
Received on Wed Aug 4 04:51:31 2004