Skip to main content.
home | support | download

Back to List Archive

Re: Error Message: Index file error: Could not open

From: Peter Karman <karman(at)not-real.cray.com>
Date: Mon Jun 14 2004 - 16:07:09 GMT
Just so we're all on the same page:

1. are you running on *nix or Windows?
2. do you have a pdf->txt converter installed (like xpdf or the like)?

pek


Kaplan, Andrew H. wrote on 6/14/04 10:45 AM:

> I've continued work on trying to get Swish-e to be able to index the pdf
> files. I went through the motions
> of setting up the swish.conf file according to the instructions listed on
> the website. Here is what the file
> text looks like:
> 
> # Swish-e config to index the /www directory
> #
> # Use spider.pl for indexing (location of spider.pl set at installation
> time)
> IndexDir spider.pl
> 
> # Use spider.pl's default configuration and specify the URL to spider
> SwishProgParameters default http://localhost/www
> 
> # Allow extra searching by title, path Metanames swishtitle swishdocpath
> Metanames swishtitle swishdocpath
> # Set StoreDescription for each parser to display context with search
> results
> StoreDescription HTML* <body> 200000
> StoreDescription TXT* <body> 200000
> 
> I ran the command swish-e -S prog -c swish.conf and the result was the
> following:
> 
> Indexing Data Source: "External-Program"
> Indexing "spider.pl"
> External Program found: /usr/local/lib/swish-e/spider.pl
> Removing very common words...
> no words removed.
> Writing main index...
> err: No unique words indexed!
> .
> 
> I have had no luck in resolving this issue. I am the point where I am ready
> to install a pdf to word converter
> program that will change all the pdf files to .doc and/or .rtf files. Unless
> there is something else that I 
> have missed, I have run out of ideas.
> 
> -----Original Message-----
> From: Bill Moseley [mailto:moseley@hank.org]
> Sent: Thursday, June 10, 2004 6:00 PM
> To: Kaplan, Andrew H.
> Subject: Re: [SWISH-E] Re: Error Message: Index file error: Could not
> open
> 
> 
> Please keep your questions to the list in the future.
> 
> On Thu, Jun 10, 2004 at 05:48:17PM -0400, Kaplan, Andrew H. wrote:
> 
>>Hi there --
>>
>>I'm not trying to index MP3 tags. The reason the MP3 package is installed
> 
> is due
> 
>>to my following the installation instructions.
>>Will my removing the MP3::Tag package and running the script again resolve
> 
> that
> 
>>issue?
> 
> 
> Yes, or following the instructions I sent on updating Filter.pm.
> 
> 
> 
>>If the indexing method that I mentioned does not automatically index the
> 
> pdf
> 
>>files, what else do I need to configure to get the
>>pdf files to appear on the list? When the indexing takes place, it appears
> 
> that
> 
>>swish-e is reading the pdf files and creating an
>>index based on them.
> 
> 
> Swish-e doesn't know how to index PDF files without using a helper
> program.  There's a few ways to use a helper program:
> 
> 1) use FileFilter -- swish-e will pass the content through the filter
> during indexing.
> 
> 2) use spider.pl in the default config setup and it will attempt to use
> a Perl module called SWISH::Filter and automatically filter PDF and MS
> Word files.
> 
> 3) other ways you don't need to worry about right now.
> 
> I've explained all of this before.  The documentation explains this.
> I also pointed you to instructions on how to ask questions that will
> help get your problem solved.  Please review all of that again.
> 
> Thanks,
> 

-- 
Peter Karman - Software Publications Engineer - Cray Inc
phone: 651-605-9009 - mailto:karman@cray.com
Received on Mon Jun 14 16:07:15 2004