http://gcc.gnu.org/bugzilla/show_bug.cgi?id=52459
--- Comment #1 from M8R-ynb11d at mailinator dot com 2012-03-02 07:11:47 UTC --- Similar (but much slower) results when not using SSE and using the libgcc library version of __builtin_popcount: -O0: 22.55 secs -O1: 20.57 secs -O2: 22.48 secs -Os: 22.81 secs -O3: 45.17 secs