On Jul 29, 2014, at 7:56 AM, Peter Bergner <berg...@vnet.ibm.com> wrote: > Currently, the IBM long double routines in libgcc use a union to construct > a long double from two double values. This causes horrific code generation > that copies the two double from the FP registers over to GPRs and back > again, giving us two loads and two stores, which leads to two load-hit-store > hazzards.
Gosh, it’s too bad we don’t have any sort of technology to optimize moving data around.