Skip to main content.
home | support | download

Back to List Archive

Problem with indexing xml with "prog" option

From: Cristiano Corsani <cristiano.corsani(at)not-real.bncf.firenze.sbn.it>
Date: Thu Dec 27 2001 - 11:38:34 GMT
It is the very first time I use swish-e.

I'm testing it with ms-access on w2000, but
the final version will run on linux mysql.

I wrote a script in vbs that query the database
and produce:
----------------------------------------------------------------------------
Path-Name: ANA0003056.xml
Content-Length: 128
Last-Mtime: 41194
<bid>ANA0003056</bid><author></author><title>arazzi rubensiani e tessuti
preziosi dei musei diocesani di ancona e osimo</title>
Path-Name: ANA0002686.xml
Content-Length: 111
Last-Mtime: 41194
<bid>ANA0002686</bid><author>anselmi sergio</author><title>immagini delle
marche negli archivi alinari</title>
----------------------------------------------------------------------------

I launch swish-e with parameters:
swish-e -S prog -c myconf.config

where myconf is:
----------------------------------------------------------------------------
IndexReport 4
ParserWarnLevel 3
IndexContents XML .xml
DefaultContents XML
MetaNames bid title
IndexFile /progetti/swish/myIndex.index
EnableAltSearchSyntax yes
IndexDir /winnt/system32/cscript.exe
SwishProgParameters //Nologo myScript.vbs
----------------------------------------------------------------------------

Well...the scripts works but the nswerto my index
process is:
----------------------------------------------------------------------------
Indexing Data Source: "External-Program"
Indexing "/winnt/system32/cscript.exe"

Warning: Unknown header line:
'<bid>ANA0003056</bid><author></author><title>arazzi rubensiani e tessuti
preziosi dei musei diocesani di ancona e osimo</title>' from program
/winnt/system32/cscript.exe

Warning: Unknown header line: '<bid>ANA0002686</bid><author>anselmi
sergio</author><title>immagini delle marche negli archivi alinari</title>'
from program /winnt/system32/cscript.exe

Removing very common words...
no words removed.
Writing main index...
err: No unique words indexed!
.
----------------------------------------------------------------------------

I don't really understand what's the problem.
Can anyone help me? Please!!!!!

Cristiano Corsani
----------------------------------------
Biblioteca Nazionale Centrale di Firenze
Piazza Cavalleggeri 1
50122 Firenze
Tel.: +39 055 24919 220
mailto:cristiano.corsani@bncf.firenze.sbn.it
http://www.bncf.firenze.sbn.it
Received on Thu Dec 27 11:40:07 2001