yaxunl added a comment.

In D56411#1349364 <https://reviews.llvm.org/D56411#1349364>, @jlebar wrote:

> Without reading the patch in detail (sorry) but looking mainly at the 
> testcase: It looks like we're not checking how overloading and `__host__ 
> __device__` functions play into this.  Maybe there are some additional 
> edge-cases to explore/check.


will add test for `__host__` `__device__`.

> Just some examples:
> 
> Will we DTRT and parse `bar` call as calling the `device` overload of `bar` in
> 
>   __host__ void bar() {}
>   __device__ int bar() { return 0; }
>   __host__ __device__ void foo() { int x = bar(); }
>   template <void (*devF)()> __global__ void kernel() { devF();}
>   
>   kernel<foo>();
> 
> 
> ?  Also will we know that we don't have to codegen `foo` for host (so `foo` 
> is actually able to do things that only device functions can)?

we DTRT for this case. Here `__host__` bar needs to return int since foo() 
expects that. will add a test for that.

> Another one: How should the following template be instantiated?
> 
>   __host__ constexpr int n() { return 0; }
>   __device__ constexpr int n() { return 1; }
>   template <int> __global__ void kernel() {}
>   
>   kernel<n()>
> 
> 
> Presumably the call to `n` should be the host one?  That seems correct to me, 
> but then it's pretty odd that a function pointer template argument would 
> point to a *device* function.  Maybe that's the right thing, but I bet I can 
> come up with something weird, like:

I think n() should be resolved in the containing function context. n itself is 
not template argument. the result of n() is.

> 
> 
>   __host__ void bar() {}
>   __device__ int bar() { return 0; }
>   __device__ auto baz() -> decltype(foo<n()>()) {} // which n() does it call? 
>  Presumably host, but:
>   __device__ auto baz() -> decltype(bar()) {}  // does baz return void or 
> int?  Presumably...the device one, int?
> 
> 
> Now mix in templates and sizeof and...yeah.  Rife for opportunities.  :)

I think this example is different from the issue which this patch tries to 
address. In the case of function type template parameter, it is less 
controversial about host/device resolution. The function argument is supposed 
to be called by the function template, therefore its host/device attribute 
should be consistent with the function template. Whereas in the above decltype 
example, such reqirement does not exist. Therefore I tend to suggest we keep 
things as they are, i.e., bar is host/device resolved in its containing 
function context.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D56411/new/

https://reviews.llvm.org/D56411



_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to