Is there such an animal? (freeware + html-aware)
Present Swish-E does not work with arbitrary 8-bit data
(or specific alphabets). Not right out of the box anyway.
So, does anyone know, are there any other solutions for
indexing and searching 8-bit rich sites?
The --presumably commercial-- SFgate search engine at the
Polish newspaper of record "Rzeczpospolita" can serve here
as an example, as it allows searching of the indices using
several encodings of the Polish alphabet (a legacy of the
lack of standards AND Microsoft market dominance), as well
as using a symbolic, meta-characterized representation of
diacriticals (ie. aogonek = /a; cacute = /c; etc).
The search engine itself is accessible via
http://www.rzeczpospolita.pl/cgi-bin/SFgate.cgi
and the front end resides at
http://www.rzeczpospolita.pl/archiwa/index.html
__Ian
Received on Fri Apr 24 13:58:01 1998