------- Comment #5 from laurent at guerby dot net 2006-10-22 12:04 ------- Eric, if the type is Long_Float your solution will be much worse since the alignement on the stack of the matrices will be 4-bytes so you have 50% chance the code will be N times slower (when not 8-bytes aligned).
So in practice you have to go malloc in order to get proper alignment for Long_Float, and there is no reason the proposed code should perform badly. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29543