We spent many hours trying to install Lucene and the whole development
environment, and never got it running (we don't have much java expertise
here). In contrast, we had swish-e running very quickly, and have been
quite pleased with the performance. We have about 6 million documents,
some with just titles and authors, others with the full text, averaging
about 10 pages of text.
You're welcome to test the search speed at http://tobaccodocuments.org,
though the speed is somewhat masked by the processing time in the
highlighting.
Tac
-----Original Message-----
From: swish-e@sunsite3.berkeley.edu
[mailto:swish-e@sunsite3.berkeley.edu] On Behalf Of Venkatraju
Sent: Monday, November 15, 2004 8:01 AM
To: Multiple recipients of list
Subject: [SWISH-E] Swish-e scalability, performance
Hi,
I am evaluating open source search engines for use in a project where
the data to be indexed could get pretty big (a few GB of documents, each
of about 10-20KB). I would like to hear the experiences of anybody who
has used Swish-E in such a scenario. Any hints/tips or caveats to be
aware of? What kind of search performance can I expect (given that I run
the search on a recent machine with lots of RAM)?
Also, has anyone compared Swish-E vs. Lucene in terms of scalability and
performance?
Thanks in advance,
Venkat
Received on Mon Nov 15 08:27:03 2004