Hahnfeld added a comment.

In https://reviews.llvm.org/D52434#1248974, @gtbercea wrote:

> One big problem your code has is that the trip count is incredibly small, 
> especially for STREAM and especially on GPUs. You need a much larger loop 
> size otherwise the timings will be dominated by OpenMP setups costs.


Sure, I'm not that dump. The real code has larger loops, this was just for 
demonstration purposes. I don't expect the register count to change based on 
loop size - is that too optimistic?


Repository:
  rC Clang

https://reviews.llvm.org/D52434



_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to