I have encountered this as well.  It seems that Xerces incorrectly allows 
characters to be formatted that it and other XML parsers will fail to parse.  
We've had to add a pre-filter to the text before it is added to the XML 
document, perhaps there is a better approach?
john

-----Original Message-----
From: Mosa Yeung [mailto:[email protected]] 
Sent: Monday, June 28, 2010 8:32 AM
To: [email protected]
Subject: Exceptions due to control characters

Hi guys,

I've encountered a situation where I have control characters (in this case, 
&#xF) being written to xml and read back out for output. XMLFormatter was able 
to write it to the xml file without escaping it, but XMLSanner emits an 
'XMLErrs::InvalidCharacterRef' on attempting to parse it. I actually don't care 
to have the control character pass through correctly, I just want it not to 
cause the whole parse to fail.

What's the "correct" way to deal with these control characters?

Mosa

Reply via email to