Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] multiple Warnings: 'could not be encoded to charset 'ISO-8859-1'

From: Dr Michael Daly <"Dr>
Date: Thu, 15 Mar 2012 23:56:35 +1100 (EST)
for 'http://localhost:104/_docs/test3, via browser I just see the
previously described two .doc files.

I deleted a no. of html files from the ./_docs (parent) directory, and now
I get this output (and it just goes on & on & on trying to index .xls
files in the ./_docs dir, even though that dir is not configured to be
indexed

(only http://localhost:104/_docs/test3/Reception-duties.doc  and
http://localhost:104/_docs/test3 are to be indexed):

swish-e -S prog -c /share/MD0_DATA/swish-e-files/swish-e-conf/web_1.conf
Indexing Data Source: "External-Program"
Indexing "spider.pl"
External Program found: /opt/lib/swish-e/spider.pl
Missing argument in sprintf at /opt/lib/swish-e/spider.pl line 38.
Missing argument in sprintf at /opt/lib/swish-e/spider.pl line 38.
/opt/lib/swish-e/spider.pl: Reading parameters from 'default'

Summary for: http://localhost:104/_docs/test3/Reception-duties.doc
             Connection: Close:     1  (1.0/sec)
                   Total Bytes: 1,217  (1217.0/sec)
                    Total Docs:     1  (1.0/sec)
                   Unique URLs:     1  (1.0/sec)
application/msword->text/plain:     1  (1.0/sec)
Warning: document 'http://localhost:104/_docs/test3/' could not be encoded
to charset 'ISO-8859-1'
Warning: document 'http://localhost:104/_docs/' could not be encoded to
charset 'ISO-8859-1'
Warning: document 'http://localhost:104/' could not be encoded to charset
'ISO-8859-1'
http://localhost:104/_docs/2008%20CASH%20FLOW%20ESTIMATES.xls:317: error:
Unexpected end tag : table
</table>
        ^
http://localhost:104/_docs/2008%20CASH%20FLOW%20ESTIMATES.xls:318: error:
Unexpected end tag : table
</table>
        ^
Warning: document 'http://localhost:104/_docs/21st_aug/' could not be
encoded to charset 'ISO-8859-1'
http://localhost:104/_docs/xxx%20sims%20st.xls:396: error: Unexpected end
tag : table
</table>
        ^
http://localhost:104/_docs/xxx%20thom%20st.xls:191: error: Unexpected end
tag : table
</table>
        ^
http://localhost:104/_docs/xxx%20thom%20st.xls:192: error: Unexpected end
tag : table
</table>
        ^
Syntax Error: Couldn't read xref table
Syntax Warning: PDF file is damaged - attempting to reconstruct xref table...
http://localhost:104/_docs/Book1.xls:14648: error: Unexpected end tag : table
</table>
        ^
http://localhost:104/_docs/CASH%20FLOW%20AIMA%202007.xls:213: error:
Unexpected end tag : table
</table>
        ^
http://localhost:104/_docs/CASH%20FLOW%20AIMA%202007.xls:214: error:
Unexpected end tag : table
</table>
        ^
http://localhost:104/_docs/Contractor%20May%202011.xls:108: error:
Unexpected end tag : table




Dr Michael Daly wrote on 3/14/12 8:51 PM:
> Hi
> The error report seems to be related to the *directory name* itself.

[snip]


> Warning: document 'http://localhost:104/_docs/test3/' could not be
> encoded
> to charset 'ISO-8859-1'

what content does your webserver return for that URL?


--
Peter Karman  .  http://peknet.com/  .  peter(at)not-real.peknet.com
_______________________________________________
Users mailing list
Users(at)not-real.lists.swish-e.org
http://lists.swish-e.org/listinfo/users



Thanks

Michael
_______________________________________________
Users mailing list
Users(at)not-real.lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Thu Mar 15 2012 - 13:06:42 GMT