I've included the following swish.conf:
=20
IndexName "Hardware Datasheets"
IndexDescription "This is an index of hardware datasheets from external
sources."
IndexPointer C:\"Program Files"\\SWISH-E
IndexAdmin "Swish-e Configuration Admin (holly.caruso@tenix.com)"
IndexOnly .pdf
FileFilter .pdf C:\"Program
Files"\\SWISH-E\\share\\doc\\swish-e\\filter-bin\\_pdf2html.pl
=20
MetaNames title subject author swishdocpath
=20
UndefinedMetaTags ignore
=20
WordCharacters abcdefghijklmnopqrstuvwxyz0123456789.-#,\/=3D+:
=20
IndexReport 3
=20
IgnoreWords of or and the a to i
=20
TranslateCharacters :ascii7:
=20
BumpPositionCounterCharacters |.
=20
StoreDescription TXT* 10000
StoreDescription HTML* <body> 10000
=20
With the following command: C:\Program Files\SWISH-E>swish-e -i
AM29LV128.pdf -c swish.conf -T indexed_words
The output is as follows:
=20
Indexing Data Source: "File-System"
Indexing "AM29LV128.pdf"
=20
Checking file "AM29LV128.pdf"...
AM29LV128.pdf - Using DEFAULT (HTML2) parser -
Adding:[1:swishdocpath(13)]
'am29lv128.pdf' Pos:2 Stuct:0x1 ( FILE )
Usage: C:\Program
Files\SWISH-E\share\doc\swish-e\filter-bin\_pdf2html.pl <filen
ame>
(no words indexed)
=20
Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 1 words alphabetically
Writing header ...
Writing index entries ...
Writing word text: Complete
Writing word hash: Complete
Writing word data: Complete
1 unique word indexed.
5 properties sorted.
1 file indexed. 652,348 total bytes. 1 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!
=20
So its still not indexing a single file. Any help?
=20
-----Original Message-----
From: Peter Karman [mailto:peter@peknet.com]=20
Sent: Thursday, 6 July 2006 10:34 PM
To: CARUSO Holly
Cc: Multiple recipients of list
Subject: Re: [SWISH-E]
=20
=20
=20
CARUSO Holly scribbled on 7/6/06 3:54 AM:
=20
=20
> I have done what is suggested, running the index on a single file with
the =3D
> following command:
>=20
> C:\Program Files\SWISH-E>swish-e -i AM29LV128.pdf -T indexed_words
>=20
> =3D20
>=20
> I presume this commands doesn't use the swish.conf... some of the
output fr=3D
> om this commands is as follows:
>=20
=20
=20
the swish-e command doesn't know how to index .pdf files without a=20
config file to tell it how. So yes, you are correct in presuming that=20
swish.conf is not used. You need to use it in order for swish-e to=20
filter the .pdf file through the appropriate xpdf filters.
=20
--=20
Peter Karman . http://peknet.com/ . peter(at)not-real.peknet.com
Disclaimer :
The contents of this e-mail including any attachments are intended only
for the person or entity to which this e-mail is addressed. If you are not,
or believe you may not be, the intended recipient, please advise the sender
immediately by return e-mail, delete this e-mail and destroy any copies.
Tenix does not warrant nor guarantee that this email communication is free
from errors, virus, interception or interference.
*********************************************************************
Due to deletion of content types excluded from this list by policy,
this multipart message was reduced to a single part, and from there
to a plain text message.
*********************************************************************
Received on Thu Jul 6 17:34:00 2006