ID: 49660 Updated by: ka...@php.net Reported By: sta at netimage dot dk -Status: Open +Status: Feedback Bug Type: XML related Operating System: FreeBSD 7.1 PHP Version: 5.2.11 New Comment:
I guess we could expose the constant value from ext/libxml if available like: Index: libxml.c =================================================================== --- libxml.c (revision 288659) +++ libxml.c (working copy) @@ -622,6 +622,9 @@ REGISTER_LONG_CONSTANT("LIBXML_COMPACT", XML_PARSE_COMPACT, CONST_CS | CONST_PERSISTENT); REGISTER_LONG_CONSTANT("LIBXML_NOXMLDECL", XML_SAVE_NO_DECL, CONST_CS | CONST_PERSISTENT); #endif +#if LIBXML_VERSION >= 20703 + REGISTER_LONG_CONSTANT("LIBXML_PARSEHUGE", XML_PARSE_HUGE, CONST_CS | CONST_PERSISTENT); +#endif REGISTER_LONG_CONSTANT("LIBXML_NOEMPTYTAG", LIBXML_SAVE_NOEMPTYTAG, CONST_CS | CONST_PERSISTENT); /* Error levels */ Does this work for you when passing it to SimpleXML's $option parameter? (Patch made against PHP_5_3, but is just a 3 line c/p to other branches) Previous Comments: ------------------------------------------------------------------------ [2009-09-24 15:37:58] sta at netimage dot dk Description: ------------ Since version 2.7.3 libxml limits the maximum size of a single text node to 10MB. The limit can be removed with a new option, XML_PARSE_HUGE. PHP has no way to specify this option to libxml. I found the bug when making af SOAP-request where the reply contained a 20MB string. SoapClient->__call() threw an exception: 'looks like we got no XML document' Using libxml_use_internal_errors(true) and libxml_get_errors() I could narrow it down to a LibXMLError, code 5, 'Extra content at the end of the document' - but the specified line and column was in the middle of a large text node. Using SoapClient->__getLastResponse() I saved the response to a file. The xmllib program xmllint then revealed the cause: > xmllint --noout soap_response.txt soap_response.txt:111834: error: xmlSAX2Characters: huge text node: out of memory We need a way to specify the XML_PARSE_HUGE option to libxml - perhaps something like a new function: libxml_parse_huge(true). Reproduce code: --------------- <?php $xml = "<?xml version='1.0' encoding='utf-8' standalone='yes' ?><test>" . str_repeat('A', 12000000) . "</test>"; file_put_contents('file.xml', $xml); $sxe = simplexml_load_file('file.xml'); if ($sxe instanceof SimpleXMLElement) { echo 'OK\n'; } else { var_dump($sxe); } Expected result: ---------------- OK Actual result: -------------- PHP Warning: simplexml_load_file(): file.xml:1: error: xmlSAX2Characters: huge text node: out of memory in /usr/dana/data/developers/holst/mobilmap/cron/xml.php on line 5 PHP Warning: simplexml_load_file(): AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA in /usr/dana/data/developers/holst/mobilmap/cron/xml.php on line 5 PHP Warning: simplexml_load_file(): ^ in /usr/dana/data/developers/holst/mobilmap/cron/xml.php on line 5 bool(false) ------------------------------------------------------------------------ -- Edit this bug report at http://bugs.php.net/?id=49660&edit=1