Skip to main content.
home | support | download

Back to List Archive

Does not count files

From: Paul Thomas <paul(at)not-real.cuenet.com>
Date: Mon Mar 04 2002 - 04:30:29 GMT
Hi,

I index for searching email archives with Swishe-e. When I setup
a new archive that only has a few emails in it, Swishe-e does
not count the words. After enough emails accumulate, Swishe-e
will count the words as well as files.

The following is from an index that does not count the words. I
am running Swishe-e on Linux.

Any comments appreciated.

Thanks,

--Paul

Removing very common words...
380 words removed.
44 words removed not in common words array:
s, pt, 2, 13, 45, 49, 3, 02, 5, 09, 40, am, tc, i, 4, 6, 11, ns, 10, 75,
9, 00, 01, 12, gi, es, 14, 08, 38, 0, u, en, rv, go, b, 22, 54, 46, hi, p,
m, 26, 03, t, 
Writing main index...
Computing hash table ...
Writing header ...
Writing index entries ...
Writing stopwords ...
no unique words indexed.
Writing file index...
Writing file list ...
Writing file offsets ...
Writing MetaNames ...
Writing offsets (2)...
6 files indexed.
Running time: 1 second.
Indexing done!

# SWISH format 2.0
# Swish-e format 2.0
# Name: PTN-Support index
# Saved as: index
# Counts: 6 files
# Indexed on: 03/03/2002 20:20:48 PST
# Description: This is an index of the PTN-Support Mailing List.
# Pointer: (no pointer)
# Maintained by: (no maintainer)
# DocumentProperties: Enabled
# Stemming Applied: 0
# Soundex Applied: 0
# WordCharacters: #&0123456789;abcdefghijklmnopqrstuvwxyz
# MinWordLimit: 3
# MaxWordLimit: 30
# BeginCharacters: "&'(0123456789abcdefghijklmnopqrstuvwxyz
# EndCharacters: "'),.0123456789\abcdefghijklmnopqrstuvwxyz
# IgnoreFirstChar: "'(
# IgnoreLastChar: "'),.;
0500
11months
20011019
2002
21mo
about
action
address
addy
agent
alternative
angle
anomalies
anus
archive
archives
...etc

--
Kilgore Trout: "The universe is a big place, perhaps the biggest."
Received on Mon Mar 4 04:31:06 2002