From: jon at hiveminds dot net Operating system: Windows 2000 / SP4 PHP version: 5.0.0 PHP Bug Type: Strings related Bug description: htmlentities() failure
Description: ------------ This string function fails on the "š" character (and other Latin-2 characters). Test: echo htmlentities("Ketšua"); Returns: Ketšua Reproduce code: --------------- Affected code: $value = htmlentities($value); preg_match_all("/&([^;]*);/", $value, $matches); $parts = preg_split("/&|;/", $value, -1, PREG_SPLIT_NO_EMPTY); foreach($parts as $part) $td->appendChild( in_array($part, $matches[1]) ? $doc->createEntityReference($part) : $doc->createTextNode($part) ); Point of failure: the string "Ketšua" Expected result: ---------------- Character should be converted to Š (upper) / š (lower) per HTML spec (see special-1.ent listing). Failing this, should it not be possible to create a TextNode containing this character when specifying a Latin-2 or Unicode charset? Actual result: -------------- Function returns the literal "š" character with no conversion. It is not possible to create a TextNode containing this character using DOMDocument::createTextNode(). I have written a workaround which replaces the Lat-2 chars with their entity equivalents using str_replace(), but even so PHP still issues several warnings when sending output to the browser using DOMDocument::saveHTML(): Warning: output conversion failed due to conv error in O:\webs\mysqli\oop-multi-select-with-dom.php on line 172 Warning: Bytes: 0x92 0x49 0x76 0x6F in O:\webs\mysqli\oop-multi-select-with-dom.php on line 172 Warning: xmlOutputBufferWrite: encoder error in O:\webs\mysqli\oop-multi-select-with-dom.php on line 172 The correct entity (š) does get sent, but I have to suppress the error with an "@", which I don't like doing. -- Edit bug report at http://bugs.php.net/?id=29609&edit=1 -- Try a CVS snapshot (php4): http://bugs.php.net/fix.php?id=29609&r=trysnapshot4 Try a CVS snapshot (php5): http://bugs.php.net/fix.php?id=29609&r=trysnapshot5 Fixed in CVS: http://bugs.php.net/fix.php?id=29609&r=fixedcvs Fixed in release: http://bugs.php.net/fix.php?id=29609&r=alreadyfixed Need backtrace: http://bugs.php.net/fix.php?id=29609&r=needtrace Need Reproduce Script: http://bugs.php.net/fix.php?id=29609&r=needscript Try newer version: http://bugs.php.net/fix.php?id=29609&r=oldversion Not developer issue: http://bugs.php.net/fix.php?id=29609&r=support Expected behavior: http://bugs.php.net/fix.php?id=29609&r=notwrong Not enough info: http://bugs.php.net/fix.php?id=29609&r=notenoughinfo Submitted twice: http://bugs.php.net/fix.php?id=29609&r=submittedtwice register_globals: http://bugs.php.net/fix.php?id=29609&r=globals PHP 3 support discontinued: http://bugs.php.net/fix.php?id=29609&r=php3 Daylight Savings: http://bugs.php.net/fix.php?id=29609&r=dst IIS Stability: http://bugs.php.net/fix.php?id=29609&r=isapi Install GNU Sed: http://bugs.php.net/fix.php?id=29609&r=gnused Floating point limitations: http://bugs.php.net/fix.php?id=29609&r=float