Notes on mixing D16/D32 code

2010-11-24 Thread Michael Hope
It's a bit of a newbie question, but I've been wondering if you can intermix hard float VFPv3-D16 code with VFPv3-D32 code. You can as: According to the ABI: * d0-d15 are used for floating point parameters, no matter if you are D16 or D32 * d0-d15 are not preserved across function calls * d16-

Re: microoptimising atomic memory ops

2010-11-24 Thread Peter Maydell
On 24 November 2010 21:18, Ken Werner wrote: > On Wednesday, November 24, 2010 8:29:35 pm Peter Maydell wrote: >>         strex   r0, r2, [r3] # store r2 (new) into  memory pointed to >> by r3 (mem) (Apologies for the linewrap damage, by the way -- blame google mail.) > Initially I thought r2 co

Re: microoptimising atomic memory ops

2010-11-24 Thread Ken Werner
On Wednesday, November 24, 2010 8:29:35 pm Peter Maydell wrote: > This wiki page came up during the toolchain call: > https://wiki.linaro.org/Internal/People/KenWerner/AtomicMemoryOperations/ The page was just moved to: https://wiki.linaro.org/WorkingGroups/ToolChain/AtomicMemoryOperations >

Re: microoptimising atomic memory ops

2010-11-24 Thread Michael Hope
(I've logged this as a potential speed improvement at LP: #681138 so we don't lose it) -- Michael On Thu, Nov 25, 2010 at 8:29 AM, Peter Maydell wrote: > This wiki page came up during the toolchain call: > https://wiki.linaro.org/Internal/People/KenWerner/AtomicMemoryOperations/ > > It gives the

[ACTIVITY] report week 47

2010-11-24 Thread Peter Maydell
(short week: only three days) RAG: Red: Amber: Green: qemu: initial pull req sent; vfp-in-sighandlers patchset sent Milestones: | Planned| Estimate | Actual | finish virtio-system | 2010-08-27 | postponed || get valgrind into linaro PPA

microoptimising atomic memory ops

2010-11-24 Thread Peter Maydell
This wiki page came up during the toolchain call: https://wiki.linaro.org/Internal/People/KenWerner/AtomicMemoryOperations/ It gives the code generated for __sync_val_compare_and_swap as including a push {r4} / pop {r4} pair because it uses too many temporaries to fit them all in callee-saves regi

Re: __sync barriers

2010-11-24 Thread Ken Werner
On Monday, November 22, 2010 12:34:04 pm Richard Sandiford wrote: > For the record, the thing I half-remembered on the call was: > > http://gcc.gnu.org/ml/gcc-patches/2009-08/msg00697.html > and: > http://gcc.gnu.org/ml/gcc-patches/2009-09/msg02112.html > > The problem is that all __sync