This revision was automatically updated to reflect the committed changes.
Closed by commit rL270484: [CUDA] Add -fcuda-approx-transcendentals flag.
(authored by jlebar).
Changed prior to commit:
http://reviews.llvm.org/D20493?vs=58123&id=58145#toc
Repository:
rL LLVM
http://reviews.llvm.org
tra added a comment.
LGTM.
http://reviews.llvm.org/D20493
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
jlebar updated this revision to Diff 58123.
jlebar added a comment.
More tightly scope the __USE_FAST_MATH__ macro.
tra pointed out that device_functions.hpp uses __USE_FAST_MATH__ for its own
purposes. For this CL, we only want to define __USE_FAST_MATH__ around
math_functions.hpp.
http://rev
jlebar created this revision.
jlebar added a reviewer: rnk.
jlebar added subscribers: cfe-commits, tra.
This lets us emit e.g. sin.approx.f32. See
http://docs.nvidia.com/cuda/parallel-thread-execution/#floating-point-instructions-sin
http://reviews.llvm.org/D20493
Files:
include/clang/Basic/L