Hi,
Znver5 has addss cost of 2 while other common floating point SSE operations
costs 3 cycles. We currently have only one entry in the costs tables which
makes it impossible to model this. This patch adds sse_fp_op which is used for
other common FP operations (basically conversions) and updates
> On Thu, 17 Apr 2025, Jan Hubicka wrote:
>
> > Hi,
> > Znver5 has addss cost of 2 while other common floating point SSE operations
> > costs 3 cycles. We currently have only one entry in the costs tables which
> > makes it impossible to model this. This patch adds sse_fp_op which is used
> > f
On Thu, 17 Apr 2025, Jan Hubicka wrote:
> Hi,
> Znver5 has addss cost of 2 while other common floating point SSE operations
> costs 3 cycles. We currently have only one entry in the costs tables which
> makes it impossible to model this. This patch adds sse_fp_op which is used
> for
> other com