Re: [PATCH] D20493: [CUDA] Add -fcuda-approx-transcendentals flag.

2016-05-23 Thread Justin Lebar via cfe-commits
This revision was automatically updated to reflect the committed changes. Closed by commit rL270484: [CUDA] Add -fcuda-approx-transcendentals flag. (authored by jlebar). Changed prior to commit: http://reviews.llvm.org/D20493?vs=58123&id=58145#toc Repository: rL LLVM http://reviews.llvm.org

Re: [PATCH] D20493: [CUDA] Add -fcuda-approx-transcendentals flag.

2016-05-23 Thread Artem Belevich via cfe-commits
tra added a comment. LGTM. http://reviews.llvm.org/D20493 ___ cfe-commits mailing list cfe-commits@lists.llvm.org http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Re: [PATCH] D20493: [CUDA] Add -fcuda-approx-transcendentals flag.

2016-05-23 Thread Justin Lebar via cfe-commits
jlebar updated this revision to Diff 58123. jlebar added a comment. More tightly scope the __USE_FAST_MATH__ macro. tra pointed out that device_functions.hpp uses __USE_FAST_MATH__ for its own purposes. For this CL, we only want to define __USE_FAST_MATH__ around math_functions.hpp. http://rev

[PATCH] D20493: [CUDA] Add -fcuda-approx-transcendentals flag.

2016-05-20 Thread Justin Lebar via cfe-commits
jlebar created this revision. jlebar added a reviewer: rnk. jlebar added subscribers: cfe-commits, tra. This lets us emit e.g. sin.approx.f32. See http://docs.nvidia.com/cuda/parallel-thread-execution/#floating-point-instructions-sin http://reviews.llvm.org/D20493 Files: include/clang/Basic/L