Hahnfeld added a comment. In https://reviews.llvm.org/D52434#1248974, @gtbercea wrote:
> One big problem your code has is that the trip count is incredibly small, > especially for STREAM and especially on GPUs. You need a much larger loop > size otherwise the timings will be dominated by OpenMP setups costs. Sure, I'm not that dump. The real code has larger loops, this was just for demonstration purposes. I don't expect the register count to change based on loop size - is that too optimistic? Repository: rC Clang https://reviews.llvm.org/D52434 _______________________________________________ cfe-commits mailing list cfe-commits@lists.llvm.org http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits