[
https://issues.apache.org/jira/browse/XERCESC-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Scott Cantor updated XERCESC-2120:
----------------------------------
Affects Version/s: 3.2.3
3.2.2
3.2.1
3.2.4
> DOM Serialization does not correctly validate Surrogate Pairs
> -------------------------------------------------------------
>
> Key: XERCESC-2120
> URL: https://issues.apache.org/jira/browse/XERCESC-2120
> Project: Xerces-C++
> Issue Type: Bug
> Components: DOM
> Affects Versions: 3.2.0, 3.2.1, 3.2.2, 3.2.3, 3.2.4
> Reporter: Andrew Blackton
> Priority: Major
> Attachments: DOMCharacterValidationTest.cpp, DomStringValidation.patch
>
>
> When attempting to write an xml document containing valid UTF-16 surrogate
> pairs an error occurs during validation. This causes the write to fail.
> It appears as though this issue was introduced with
> https://issues.apache.org/jira/browse/XERCESC-1854 in the following commit
> http://svn.apache.org/viewvc/xerces/c/trunk/src/xercesc/dom/impl/DOMLSSerializerImpl.cpp?r1=768978&r2=1226891.
> I have supplied a reproducible and a potential patch. The string validator
> should be responsible for determining if the codepoint is part of a surrogate
> pair. However, I may also like to make the argument that this may not be the
> right location to be doing the string validation. As it will leave the output
> document in an inconsistent (half-written) state.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]