ID:               49660
 Updated by:       ka...@php.net
 Reported By:      sta at netimage dot dk
-Status:           Open
+Status:           Feedback
 Bug Type:         XML related
 Operating System: FreeBSD 7.1
 PHP Version:      5.2.11
 New Comment:

I guess we could expose the constant value from ext/libxml if available
like:
Index: libxml.c
===================================================================
--- libxml.c    (revision 288659)
+++ libxml.c    (working copy)
@@ -622,6 +622,9 @@
        REGISTER_LONG_CONSTANT("LIBXML_COMPACT",        XML_PARSE_COMPACT,      
        CONST_CS
| CONST_PERSISTENT);

        REGISTER_LONG_CONSTANT("LIBXML_NOXMLDECL",      XML_SAVE_NO_DECL,       
        CONST_CS
| CONST_PERSISTENT);
 #endif
+#if LIBXML_VERSION >= 20703
+       REGISTER_LONG_CONSTANT("LIBXML_PARSEHUGE",      XML_PARSE_HUGE,         
        CONST_CS
| CONST_PERSISTENT);
+#endif

        REGISTER_LONG_CONSTANT("LIBXML_NOEMPTYTAG",     LIBXML_SAVE_NOEMPTYTAG, 
CONST_CS
| CONST_PERSISTENT);
 
        /* Error levels */


Does this work for you when passing it to SimpleXML's $option
parameter?

(Patch made against PHP_5_3, but is just a 3 line c/p to other
branches)


Previous Comments:
------------------------------------------------------------------------

[2009-09-24 15:37:58] sta at netimage dot dk

Description:
------------
Since version 2.7.3 libxml limits the maximum size of a single text
node to 10MB.
The limit can be removed with a new option, XML_PARSE_HUGE.
PHP has no way to specify this option to libxml.

I found the bug when making af SOAP-request where the reply contained a
20MB string.
SoapClient->__call() threw an exception: 'looks like we got no XML
document'

Using libxml_use_internal_errors(true) and libxml_get_errors() I could
narrow it down to a LibXMLError, code 5, 'Extra content at the end of
the document' - but the specified line and column was in the middle of a
large text node.

Using SoapClient->__getLastResponse() I saved the response to a file.
The xmllib program xmllint then revealed the cause:
> xmllint --noout soap_response.txt 
soap_response.txt:111834: error: xmlSAX2Characters: huge text node: out
of memory

We need a way to specify the XML_PARSE_HUGE option to libxml - perhaps
something like a new function: libxml_parse_huge(true).


Reproduce code:
---------------
<?php
$xml = "<?xml version='1.0' encoding='utf-8' standalone='yes' ?><test>"
. str_repeat('A', 12000000) . "</test>";
file_put_contents('file.xml', $xml);
$sxe = simplexml_load_file('file.xml');
if ($sxe instanceof SimpleXMLElement) {
        echo 'OK\n';
}
else {
        var_dump($sxe);
}


Expected result:
----------------
OK

Actual result:
--------------
PHP Warning:  simplexml_load_file(): file.xml:1: error:
xmlSAX2Characters: huge text node: out of memory in
/usr/dana/data/developers/holst/mobilmap/cron/xml.php on line 5
PHP Warning:  simplexml_load_file():
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
in /usr/dana/data/developers/holst/mobilmap/cron/xml.php on line 5
PHP Warning:  simplexml_load_file():                                   
                                            ^ in
/usr/dana/data/developers/holst/mobilmap/cron/xml.php on line 5
bool(false)


------------------------------------------------------------------------


-- 
Edit this bug report at http://bugs.php.net/?id=49660&edit=1

Reply via email to