On 7/12/20 9:38 PM, Roger Sayle wrote:
>
> The following patch addds support for PTX's rcp.rn.f32 and rcp.rn.f64
> instructions. Note that the "rcp.rn" forms of this instruction
> calculate the fully IEEE compliant result for the reciprocal, unlike
> the rcp.approx variants that just provide fast
The following patch addds support for PTX's rcp.rn.f32 and rcp.rn.f64
instructions. Note that the "rcp.rn" forms of this instruction
calculate the fully IEEE compliant result for the reciprocal, unlike
the rcp.approx variants that just provide fast approximations.
I'm undecided as to whether to