Bohl, Thomas (StBA Krumbach) wrote on 6/25/07 10:58 AM:
> Hello everybody,
>
> some hours ago i have updated from 2.4.2 to 2.4.5. Now, when i will create the index, i get hundredths of error messages:
>
>
> Indexing Data Source: "HTTP-Crawler"
> Indexing "http://bkru-intranet.stmi.bayern.de/search-index.php"
> http://bkru-intranet.stmi.bayern.de/list.php?tn=1:38: error: htmlParseEntityRef: expecting ';'
> <li><a href=nachricht.php?tn=1;&ID=326>Verbindungsabbrüche in Outlook</a><br>
> ^
> http://bkru-intranet.stmi.bayern.de/list.php?tn=2:39: error: htmlParseEntityRef: expecting ';'
> <li><a href=nachricht.php?tn=2;&ID=3>Einheitliche Datenablagestruktur und Dateib
> ^
> http://bkru-intranet.stmi.bayern.de/list.php?tn=4:39: error: htmlParseEntityRef: expecting ';'
> <li><a href=nachricht.php?tn=4;&ID=10>Informationen zum Datenaustausch mit GE-Of
> ^
> ...and so on.
>
>
> I don't get the point! Why semicolon are expected there?
>
> The index is fine, but my errorlog is full with these messages.
Sorry this thread is so old, Thomas. You may have figured out the issue. But for
the archives:
In 2.4.5 the libxml2 parser warnings were turned on by default. You can set the
level of warnings with the -W option at the command line, or in the config file
with ParserWarnLevel (note that the config option overrides the -W flag if present).
The warning you are getting is likely because you have a & in your URLs instead
of a & -- the latter is XML-compliant, while the former is not.
In fact, it looks like you have 2 delimiters: the ';' and the '&'. Only one is
necessary, and the ';' is preferred because it is XML compliant (and shorter).
--
Peter Karman . http://peknet.com/ . peter(at)not-real.peknet.com
_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Fri Aug 17 00:03:20 2007