This revision was automatically updated to reflect the committed changes.
Closed by commit rL312441: [OpenCL] Do not use vararg in emitted functions for
enqueue_kernel (authored by yaxunl).
Changed prior to commit:
https://reviews.llvm.org/D36678?vs=113483&id=113694#toc
Repository:
rL LLVM
Anastasia accepted this revision.
Anastasia added a comment.
This revision is now accepted and ready to land.
LGTM. Thanks!
https://reviews.llvm.org/D36678
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman
yaxunl updated this revision to Diff 113483.
yaxunl marked 3 inline comments as done.
yaxunl added a comment.
update tests.
https://reviews.llvm.org/D36678
Files:
lib/CodeGen/CGBuiltin.cpp
test/CodeGenOpenCL/cl20-device-side-enqueue.cl
Index: test/CodeGenOpenCL/cl20-device-side-enqueue.cl
Anastasia added inline comments.
Comment at: test/CodeGenOpenCL/cl20-device-side-enqueue.cl:116
+ // B32: store i32 4, i32* %[[TMP3]], align 4
+ // B32: call i32 @__enqueue_kernel_vaargs(%opencl.queue_t{{.*}}* [[DEF_Q]],
i32 [[FLAGS]], %struct.ndrange_t* [[NDR]]{{(.[0-9]+)?}},
yaxunl marked an inline comment as done.
yaxunl added inline comments.
Comment at: test/CodeGenOpenCL/cl20-device-side-enqueue.cl:116
+ // B32: store i32 4, i32* %[[TMP3]], align 4
+ // B32: call i32 @__enqueue_kernel_vaargs(%opencl.queue_t{{.*}}* [[DEF_Q]],
i32 [[FLAGS]], %st
Anastasia added inline comments.
Comment at: test/CodeGenOpenCL/cl20-device-side-enqueue.cl:116
+ // B32: store i32 4, i32* %[[TMP3]], align 4
+ // B32: call i32 @__enqueue_kernel_vaargs(%opencl.queue_t{{.*}}* [[DEF_Q]],
i32 [[FLAGS]], %struct.ndrange_t* [[NDR]]{{(.[0-9]+)?}},
yaxunl updated this revision to Diff 111302.
yaxunl marked an inline comment as done.
yaxunl added a comment.
Revised by Anastasia's comments.
https://reviews.llvm.org/D36678
Files:
lib/CodeGen/CGBuiltin.cpp
test/CodeGenOpenCL/cl20-device-side-enqueue.cl
Index: test/CodeGenOpenCL/cl20-devi
yaxunl marked an inline comment as done.
yaxunl added inline comments.
Comment at: test/CodeGenOpenCL/cl20-device-side-enqueue.cl:64
+ // B32: call i32 @__enqueue_kernel_vaargs(%opencl.queue_t{{.*}}* [[DEF_Q]],
i32 [[FLAGS]], %struct.ndrange_t* [[NDR]]{{(.[0-9]+)?}}, i8 addrspa
Anastasia added inline comments.
Comment at: test/CodeGenOpenCL/cl20-device-side-enqueue.cl:64
+ // B32: call i32 @__enqueue_kernel_vaargs(%opencl.queue_t{{.*}}* [[DEF_Q]],
i32 [[FLAGS]], %struct.ndrange_t* [[NDR]]{{(.[0-9]+)?}}, i8 addrspace(4)*
addrspacecast (i8 addrspace(1)
yaxunl created this revision.
Herald added a subscriber: tpr.
Not all targets support vararg (e.g. amdgpu). Instead of using vararg in the
emitted functions for enqueue_kernel,
this patch creates a temporary array of size_t, stores the size arguments in
the temporary array
and passes it to the e
10 matches
Mail list logo