ID: 22108 Comment by: jaanus at heeringson dot com Reported By: bugzilla at jellycan dot com Status: Assigned Bug Type: Feature/Change Request Operating System: Any PHP Version: All (as of the current implementation) Assigned To: moriyoshi New Comment:
There are some consequenses that are not mentioned in the above comments. More than one BOM renders a perfectly valid XHTM file non-valid. Apparently the XHTML specification allows BOM characters, but only ONE, not multiple which is what you get if you include other utf-8 files (include, require). This in turn results in the possibility that the xml declaration in not read since it is not found. This is the case with the w3c validator. Previous Comments: ------------------------------------------------------------------------ [2003-06-04 03:11:51] [EMAIL PROTECTED] That script appears to be written in UTF-16. As for UTF-16, it could actually be a parser problem as well, but this report addresses the issue related to UTF-8. ------------------------------------------------------------------------ [2003-06-04 02:59:33] [EMAIL PROTECTED] Actually, not totally. A friend mailed me a PHP script, which had the annoying BOM AND the whole file was in double byte (saved by notepad)... which definitely makes it a parser problem too (\0 < \0 ? \0 p doesn't match "<?p" for example). ------------------------------------------------------------------------ [2003-06-04 02:56:01] [EMAIL PROTECTED] And just for clarification, this is a scanner problem, irrelevant to the parser. ------------------------------------------------------------------------ [2003-06-04 02:45:43] [EMAIL PROTECTED] It wasn't assigned, just set to open (and I didn't notice your name in the "Assign to" field). ------------------------------------------------------------------------ [2003-06-04 02:40:30] [EMAIL PROTECTED] Derick, Please do not change the status of the bug that is already assigned to someone. There's no point that PHP can only handle ASCII documents because if you want to use German in PHP for example, at least you have to use ISO-8859-1 or ISO-8859-15, which is not even part of ASCII. ------------------------------------------------------------------------ The remainder of the comments for this report are too long. To view the rest of the comments, please view the bug report online at http://bugs.php.net/22108 -- Edit this bug report at http://bugs.php.net/?id=22108&edit=1