I was looking at the spec 456.hmmer benchmark and this email string from Jeff Law and Micheal Matz:
https://gcc.gnu.org/ml/gcc-patches/2015-11/msg01970.html and was wondering if anyone was looking at what more it would take for GCC to vectorize the loop in P7Viterbi. There is a big performance win to be had here if it can be done but the alias checking needed seems rather extensive. Steve Ellcey sell...@cavium.com