[Twisted-Python] XML parsing on twisted

Amaury Forgeot d'Arc amauryfa at gmail.com
Sat Oct 3 18:42:06 EDT 2009


Hi,

2009/10/4 gary clark <burslem2001 at yahoo.com>:
> Hello,
>
> I went to use the SAX library to perform parsing. However I hit an unusual
> problem.
>
> Its having a problem parsing a token that has a '&' it just crashes. It certainly woke me up when I witnessed this.
>
> <books> Jim & Nancy </books>
>
> are passed to the ContentHandler class. When characters(self,chars) member gets called I perform self.book+=chars to reassemble the string.
>
> I dump out the self.book string as its being assembled. If I replace the '&' character with "and" then its Ok. Of course I dont want to have to replace '&' with 'and'.
>
> Not good. So has anybody else seen this problem. I'm assuming it thinks its a reference and becomes confused when I attempt to add a reference.
>
> Much appreciated,

the & character is a special character in XML. You are parsing an
invalid XML string.
This page says it well:
http://www.devx.com/tips/Tip/14068
(a 10 years old "tip of the day"!)

-- 
Amaury Forgeot d'Arc



More information about the Twisted-Python mailing list