On 10/8/25 6:32 PM, Vineet Gupta wrote:
On 10/8/25 11:54, Vineet Gupta wrote:
We are seeing about 100 billion dynamic instructions saved which is about 5%
on cactuBSSN and a 2% improvement in performance on the BPI.

For Cactu, QEMU dynamic icounts went down from 2,010,916,571,027 to
1,894,197,931,855
Right in line with expectations. Note that in an OOO core performance may not improve -- the code in question is dominated by fp loads and fp ops. So there's a ton of excess scalar integer execution resources available to hide the inefficiencies in the old code.



This is just awesome. Thx Shreya for doing this.
Agreed. Interestingly enough Philip R. is looking at this exact same issue right now in the LLVM side.

jeff

Reply via email to