[
https://issues.apache.org/jira/browse/XERCESC-1663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Scott Cantor updated XERCESC-1663:
----------------------------------
Fix Version/s: (was: 2.9.0)
> IconvGNU and IconvFBSD based transcoders assume UCS-2 as XMLCh encoding
> -----------------------------------------------------------------------
>
> Key: XERCESC-1663
> URL: https://issues.apache.org/jira/browse/XERCESC-1663
> Project: Xerces-C++
> Issue Type: Bug
> Components: Utilities
> Affects Versions: 2.7.0
> Environment: any
> Reporter: Boris Kolpackov
> Assignee: Alberto Massari
> Priority: Blocker
> Fix For: 3.0.0
>
>
> I was studying the code in IconvGNU and IconvFBSD transcoders and it appears
> that they assume UCS-2 is the encoding for XMLCh when it's actually UTF-16. I
> believe this can result in the loss of data.
> The encoding that is used for XMLCh is stored in the fUnicodeCP variable
> which is initialized in the Iconv{GNU,FBSD}TransServices c-tor. The
> initialization code just tries all encodings from the
> gIconv{GNU,FBSD}Encodings array which for GNU contains only UCS-2 and for
> FreeBSD contains UCS-2 and UCS-4 encodings.
> I tried to add a UTF-16LE to this array (as a first item) and it works fine
> for GNU (I double checked that UTF-16LE ends up in fUnicodeCP). Does anybody
> knows what's going on here? Should we add UTF-16 to these arrays?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]