Thanks Bill.
The encoding is in Windows CP1251, which is 8-bit. I'll be working with
Macedonian texts, and there is already one swish-e search site in
Macedonian.
George.
Foreign Languages tel. 334-844-6376
6030 Haley Center fax. 334-844-6378
Auburn University
Auburn, AL 36849
home: www.auburn.edu/~mitrege
>>> Bill Moseley <moseley@hank.org> 11/12/05 4:03 PM >>>
On Sat, Nov 12, 2005 at 12:37:26PM -0800, George Mitrevski wrote:
> Hi folks.
>
> I am a new subscriber to the list, so please bear with me. I am
> currently working on constructing a text corpus for an East European
> language from texts that I have downloaded from the internet.
Are there encodings other than 8859-1? Because swish uses that
encoding internally (swish-e works only with 8-bit chars).
> Is it possible to get a copy of the script that runs the Swish-e
> List Archive (or a similar one) search interface?
The one on the list archive is the example code from the swish-e
distribution, basically. Go to http://swish-e.org and follow links
(at the top) to CVS and you can see the entire website's source.
--
Bill Moseley
moseley@hank.org
Unsubscribe from or help with the swish-e list:
http://swish-e.org/Discussion/
Help with Swish-e:
http://swish-e.org/current/docs
swish-e@sunsite.berkeley.edu
Received on Sat Nov 12 16:23:16 2005