RE: How many documents can SWISH index?

Date: Tue Jul 11 2000 - 09:10:01 GMT

I remember that from the code of version 1.3.2, the number of different 
words is stored on 16 bits values, which would mean that probably 64k (or 
eventually 32k) different words can be indexed.

I do not know if this limit still exists in version 2.0. 


IMO not the number of docs is relevant, but the length/wordcount
of a document.

Currently we are indexing on a Solaris server @ 17000 docs, (html, txt,
pdf and docs). The PDFs have a lenght from 10 up to 100 print pages.

- rainer

I have been asked to pass this question on to the SWISH discussion group:

How many documents can SWISH index?

By this they meant: does SWISH fall over after n documents?
Are there any limits beyond basic memory and disk space?

What would be really useful is if people could say how many documents they
are indexing and under which method, i.e. FILESYSTEM or HTTP.

I have heard people mention that they are indexing up to 20,000 documents.
Is that correct? Has anyone indexed more?
I am especially interested in people indexing using HTTP.

