Hi Tom!
On 2022-02-15T11:52:29+0100, Tom de Vries <[email protected]> wrote:
> On 2/15/22 08:34, Thomas Schwinge wrote:
>> For my understanding:
Thanks for your explanations!
>> It is expected that this changes, for example (similar elsewhere)
>> 'nvptx-none/libatomic/store_4_.o', to use (a) 'atom.exch' (with a new
>> dummy register allocated)
>
> Yes.
>
> We could do slightly better by emitting that as:
> ...
> membar.sys;
> { .reg .u32 dummy;
> atom.exch.b32 dummy,[%r22],%r23;
> }
> membar.sys;
> ...
> which could improve register pressure.
Or, use the "bit bucket" operand -- assuming that's applicable here?
atom.exch.b32 _,[%r22],%r23;
For example, see PTX 3.1, 8.2 "PTX Instructions".
Grüße
Thomas
> I just wrote a patch for that (attached, ftr), but using a scratch
> register, and it seems that this similar code:
> ...
> void
> foo (U_4 *mptr, U_4 newval)
> {
> __atomic_exchange_n (mptr, newval, 5);
> }
> ...
> still maps to:
> ...
> .reg .u32 %r24;
> membar.sys;
> atom.exch.b32 %r24,[%r22],%r23;
> membar.sys;
> ...
> so that may not be the right way to do it.
> --- a/gcc/config/nvptx/nvptx.md
> +++ b/gcc/config/nvptx/nvptx.md
> @@ -89,9 +89,10 @@
> ;; only literal constants, which differ from the generic ones, which
> ;; permit subregs and symbolc constants (as appropriate)
> (define_predicate "nvptx_register_operand"
> - (match_code "reg")
> + (match_code "reg,scratch")
> {
> - return register_operand (op, mode);
> + return (register_operand (op, mode)
> + || (GET_CODE (op) == SCRATCH && GET_MODE (op) == mode));
> })
>
> (define_predicate "nvptx_nonimmediate_operand"
> @@ -188,7 +189,7 @@
>
> (define_constraint "R"
> "A pseudo register."
> - (match_code "reg"))
> + (ior (match_code "reg") (match_code "scratch")))
>
> (define_constraint "Ia"
> "Any integer constant."
> @@ -2036,6 +2037,7 @@
> (match_operand:SDIM 2 "nvptx_nonmemory_operand" "Ri"))] ;; input
> ""
> {
> + bool scratch_dst_p = GET_CODE (operands[0]) == SCRATCH;
> if (nvptx_mem_local_p (operands[1]))
> {
> output_asm_insn ("{", NULL);
> @@ -2047,7 +2049,9 @@
> return "";
> }
> const char *t
> - = "%.\tatom%A1.exch.b%T0\t%0, %1, %2;";
> + = (scratch_dst_p
> + ? "{ .reg.u%T0 dummy; %.\tatom%A1.exch.b%T0\t dummy,%1, %2; }"
> + : "%.\tatom%A1.exch.b%T0\t%0, %1, %2;");
> return nvptx_output_atomic_insn (t, operands, 1, 3);
> }
> [(set_attr "atomic" "true")])
> @@ -2079,7 +2083,7 @@
> /* Fall back to expand_atomic_store. */
> FAIL;
>
> - rtx tmpreg = gen_reg_rtx (<MODE>mode);
> + rtx tmpreg = gen_rtx_SCRATCH (<MODE>mode);
> emit_insn (gen_atomic_exchange<mode> (tmpreg, operands[0], operands[1],
> operands[2]));
> DONE;
-----------------
Siemens Electronic Design Automation GmbH; Anschrift: Arnulfstraße 201, 80634
München; Gesellschaft mit beschränkter Haftung; Geschäftsführer: Thomas
Heurung, Frank Thürauf; Sitz der Gesellschaft: München; Registergericht
München, HRB 106955