From:             php at richardneill dot org
Operating system: Linux
PHP version:      6CVS-2009-07-07 (CVS)
PHP Bug Type:     Performance problem
Bug description:  strtotime() seems surprisingly slow (3x worse than 
preg_match).

Description:
------------
I've just been troubleshooting a script whose job is to fix 
logfiles. Each line of the logfile has a timestamp, but only
HH:MM:SS and it's missing the date. My script has to fix this.

For a 500,000 line file, this takes about 2 minutes, so I 
investigated further. I found that strtotime() was really quite a 
lot slower than I expected, and that using preg_match() instead 
doubled the script execution speed.

Is this really to be expected, or could strtotime() be optimised?


Reproduce code:
---------------
#!/usr/bin/php
<?

//This demonstrates that strtotime() is really rather slow. The 
//example is quite contrived, but surely it should be possible to 
//make strtotime() much faster than an entire regular-expression
//match, rather than 3 times slower?

$time1=microtime(1);

$timestamp1="05:03:02";
$timestamp2="03:05:07";

for ($i=0;$i<1E6;$i++){
        $seconds = strtotime($timestamp1)  - strtotime($timestamp2);
        #echo "$seconds\n"; //Do something with $seconds
}

$time2=microtime(1);

for ($i=0;$i<1E6;$i++){
        //preg_match lets us both check that this IS a valid
        //timestamp and split it into parts.
        preg_match('/^(\d\d):(\d\d):(\d\d)(\.\d+)?$/', $timestamp1, $matches1);
        preg_match('/^(\d\d):(\d\d):(\d\d)(\.\d+)?$/', $timestamp2, $matches2);

        $seconds = ( 3600 * intval($matches1[1]) + 60 *
        intval($matches1[2]) + intval($matches1[3]) ) -
        ( 3600 * intval($matches2[1]) + 60 * 
        intval($matches2[2]) + intval($matches2[3])  );
        #echo "$seconds\n";  //Do something with $seconds
}

$time3=microtime(1);

# 42.8 on 2.4GHz CPU
echo "Each pair of calls to strtotime() took:".($time2 - $time1 )."
microseconds \n";
   
# 16.6.
echo "Each pair of calls to preg_match() etc took:".($time3 - $time2 )."
microseconds \n";
?>

Expected result:
----------------
I expect strtotime(), which is designed specifically for the 
purpose, to be faster than preg_match, which is more 
general-purpose.

Actual result:
--------------
strtotime() takes about 3 x longer than the brute-force approach 
using a regular-expression + array maths. Is that really expected?
date() isn't very fast either.

42us means 100,000 CPU cycles!


-- 
Edit bug report at http://bugs.php.net/?id=48828&edit=1
-- 
Try a CVS snapshot (PHP 5.2):        
http://bugs.php.net/fix.php?id=48828&r=trysnapshot52
Try a CVS snapshot (PHP 5.3):        
http://bugs.php.net/fix.php?id=48828&r=trysnapshot53
Try a CVS snapshot (PHP 6.0):        
http://bugs.php.net/fix.php?id=48828&r=trysnapshot60
Fixed in CVS:                        
http://bugs.php.net/fix.php?id=48828&r=fixedcvs
Fixed in CVS and need be documented: 
http://bugs.php.net/fix.php?id=48828&r=needdocs
Fixed in release:                    
http://bugs.php.net/fix.php?id=48828&r=alreadyfixed
Need backtrace:                      
http://bugs.php.net/fix.php?id=48828&r=needtrace
Need Reproduce Script:               
http://bugs.php.net/fix.php?id=48828&r=needscript
Try newer version:                   
http://bugs.php.net/fix.php?id=48828&r=oldversion
Not developer issue:                 
http://bugs.php.net/fix.php?id=48828&r=support
Expected behavior:                   
http://bugs.php.net/fix.php?id=48828&r=notwrong
Not enough info:                     
http://bugs.php.net/fix.php?id=48828&r=notenoughinfo
Submitted twice:                     
http://bugs.php.net/fix.php?id=48828&r=submittedtwice
register_globals:                    
http://bugs.php.net/fix.php?id=48828&r=globals
PHP 4 support discontinued:          http://bugs.php.net/fix.php?id=48828&r=php4
Daylight Savings:                    http://bugs.php.net/fix.php?id=48828&r=dst
IIS Stability:                       
http://bugs.php.net/fix.php?id=48828&r=isapi
Install GNU Sed:                     
http://bugs.php.net/fix.php?id=48828&r=gnused
Floating point limitations:          
http://bugs.php.net/fix.php?id=48828&r=float
No Zend Extensions:                  
http://bugs.php.net/fix.php?id=48828&r=nozend
MySQL Configuration Error:           
http://bugs.php.net/fix.php?id=48828&r=mysqlcfg

Reply via email to