Re: [PATCH] nvptx: Support floating point reciprocal instructions.

2020-07-29 Thread Tom de Vries
On 7/12/20 9:38 PM, Roger Sayle wrote: > > The following patch addds support for PTX's rcp.rn.f32 and rcp.rn.f64 > instructions. Note that the "rcp.rn" forms of this instruction > calculate the fully IEEE compliant result for the reciprocal, unlike > the rcp.approx variants that just provide fast

[PATCH] nvptx: Support floating point reciprocal instructions.

2020-07-12 Thread Roger Sayle
The following patch addds support for PTX's rcp.rn.f32 and rcp.rn.f64 instructions. Note that the "rcp.rn" forms of this instruction calculate the fully IEEE compliant result for the reciprocal, unlike the rcp.approx variants that just provide fast approximations. I'm undecided as to whether to