Skip to main content.
home | support | download

Back to List Archive

Re: XML2 parser error?

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Thu Jul 29 2004 - 18:48:52 GMT
On Thu, Jul 29, 2004 at 10:14:43AM +0100, Jonas Wolf wrote:
> I did some more testing, and indeed swish-e is doing everything correctly. 
> The XML parser recognises &#64; sequences fine, but breaks down on 
> characters below 32, such as &#4;, which is also correct behaviour. (As a 
> side note, this never generates an error message, it just stops indexing 
> the document at that point - Can you force error messages?).

ParserWarnLevel.  

I'm not 100% of the behavior but I think libxml2 will just abort
processing.


> The problem seems to be HTML::Entities::encode_entities, which
> generates these invalid character sequences.

Can you post a complete example?

-- 
Bill Moseley
moseley@hank.org

Unsubscribe from or help with the swish-e list: 
   http://swish-e.org/Discussion/

Help with Swish-e:
   http://swish-e.org/current/docs
   swish-e@sunsite.berkeley.edu
Received on Thu Jul 29 11:49:18 2004