Kewen:
On 6/3/24 22:59, Kewen.Lin wrote:
> Hi,
>
> on 2024/5/30 00:16, Carl Love wrote:
>> This was patch 13 from the previous series. Note the previous series patch
>> 12 was dropped. This patch is the same as the previous version. The
>> additional work to remove __builtin_vec_set_v1ti, __builtin_vec_set_v2di,
>> __builtin_vec_set_v2d per the feedback comments with equivalent gimple code
>> is being deferred to a future patch. The goal of this series was simply to
>> remove duplicated built-ins, extending overloaded built-ins as needed.
>> Adding the needed gimple code to remove the additional built-ins is beyond
>> the goal of this patch series.
>>
>> Carl
>> -------------------------------------------------------
>>
>> rs6000, remove vector set and vector init built-ins.
>>
>> The vector init built-ins:
>>
>> __builtin_vec_init_v16qi, __builtin_vec_init_v8hi,
>> __builtin_vec_init_v4si, __builtin_vec_init_v4sf,
>> __builtin_vec_init_v2di, __builtin_vec_init_v2df,
>> __builtin_vec_set_v1ti
>
> Typo here, s/__builtin_vec_set_v1ti/__builtin_vec_init_v1ti/
Fixed.
>
>>
>> perform the same operation as initializing the vector in C code. For
>> example:
>>
>> result_v4si = __builtin_vec_init_v4si (1, 2, 3, 4);
>> result_v4si = {1, 2, 3, 4};
>>
>> These two constructs were tested and verified they generate identical
>> assembly instructions with no optimization and -O3 optimization.
>>
>> The vector set built-ins:
>>
>> __builtin_vec_set_v16qi, __builtin_vec_set_v8hi.
>> __builtin_vec_set_v4si, __builtin_vec_set_v4sf
>
> Please also add the reserved ones (...v1ti/v2di/v2df), as they are the
> same too, temporarily reserving them for the uses in resolve_vec_insert()
> doesn't affect this.
Added the three additional built-ins to the list.
>
>>
>> perform the same operation as setting a specific element in the vector in
>> C code. For example:
>>
>> src_v4si = __builtin_vec_set_v4si (src_v4si, int_val, index);
>> src_v4si[index] = int_val;
>>
>> The built-in actually generates more instructions than the inline C code
>> with no optimization but is identical with -O3 optimizations.
>>
>> All of the above built-ins that are removed do not have test cases and
>> are not documented.
>>
>> Built-ins __builtin_vec_set_v1ti __builtin_vec_set_v2di,
>> __builtin_vec_set_v2df are not removed as they are used in function
>> resolve_vec_insert() in file rs6000-c.cc.
>>
>> The built-ins are removed as they don't provide any benefit over just
>> using C code.
>>
>> gcc/ChangeLog:
>> * config/rs6000/rs6000-builtins.def (__builtin_vec_init_v16qi,
>> __builtin_vec_init_v8hi, __builtin_vec_init_v4si,
>> __builtin_vec_init_v4sf, __builtin_vec_init_v2di,
>> __builtin_vec_init_v2df, __builtin_vec_set_v1ti,
>
> Typo, s/__builtin_vec_set_v1ti/__builtin_vec_init_v1ti/
Fixed
>
>> __builtin_vec_set_v16qi, __builtin_vec_set_v8hi.
>> __builtin_vec_set_v4si, __builtin_vec_set_v4sf,
>> __builtin_vec_set_v2di, __builtin_vec_set_v2df,
>> __builtin_vec_set_v1ti): Remove built-in definitions.
>
> The last three ones are not actually removed.
OK, fixed.
>
>> ---
>> gcc/config/rs6000/rs6000-builtins.def | 42 ++-------------------------
>> 1 file changed, 2 insertions(+), 40 deletions(-)
>>
>> diff --git a/gcc/config/rs6000/rs6000-builtins.def
>> b/gcc/config/rs6000/rs6000-builtins.def
>> index 48ebc018a8d..8349d45169f 100644
>> --- a/gcc/config/rs6000/rs6000-builtins.def
>> +++ b/gcc/config/rs6000/rs6000-builtins.def
>> @@ -1118,37 +1118,6 @@
>> const signed short __builtin_vec_ext_v8hi (vss, signed int);
>> VEC_EXT_V8HI nothing {extract}
>>
>> - const vsc __builtin_vec_init_v16qi (signed char, signed char, signed
>> char, \
>> - signed char, signed char, signed char, signed char, signed
>> char, \
>> - signed char, signed char, signed char, signed char, signed
>> char, \
>> - signed char, signed char, signed char);
>> - VEC_INIT_V16QI nothing {init}
>> -
>> - const vf __builtin_vec_init_v4sf (float, float, float, float);
>> - VEC_INIT_V4SF nothing {init}
>> -
>> - const vsi __builtin_vec_init_v4si (signed int, signed int, signed int, \
>> - signed int);
>> - VEC_INIT_V4SI nothing {init}
>> -
>> - const vss __builtin_vec_init_v8hi (signed short, signed short, signed
>> short,\
>> - signed short, signed short, signed short, signed short, \
>> - signed short);
>> - VEC_INIT_V8HI nothing {init}
>> -
>> - const vsc __builtin_vec_set_v16qi (vsc, signed char, const int<4>);
>> - VEC_SET_V16QI nothing {set}
>> -
>> - const vf __builtin_vec_set_v4sf (vf, float, const int<2>);
>> - VEC_SET_V4SF nothing {set}
>> -
>> - const vsi __builtin_vec_set_v4si (vsi, signed int, const int<2>);
>> - VEC_SET_V4SI nothing {set}
>> -
>> - const vss __builtin_vec_set_v8hi (vss, signed short, const int<3>);
>> - VEC_SET_V8HI nothing {set}
>> -
>> -
>> ; Cell builtins.
>> [cell]
>> pure vsc __builtin_altivec_lvlx (signed long, const void *);
>> @@ -1295,15 +1264,8 @@
>> const signed long long __builtin_vec_ext_v2di (vsll, signed int);
>> VEC_EXT_V2DI nothing {extract}
>>
>> - const vsq __builtin_vec_init_v1ti (signed __int128);
>> - VEC_INIT_V1TI nothing {init}
>> -
>> - const vd __builtin_vec_init_v2df (double, double);
>> - VEC_INIT_V2DF nothing {init}
>> -
>> - const vsll __builtin_vec_init_v2di (signed long long, signed long long);
>> - VEC_INIT_V2DI nothing {init}
>> -
>> +;; VEC_SET_V1TI, VEC_SET_V2DF and VEC_SET_V2DI are used in
>> +;; resolve_vec_insert(), rs6000-c.cc
>
> It would be good to place one TODO here, something like:
Added comment.
>
> ;; TODO: Remove VEC_SET_V1TI, VEC_SET_V2DF and VEC_SET_V2DI once the uses
> ;; in resolve_vec_insert are replaced by the equivalent gimple statements.
>
>> const vsq __builtin_vec_set_v1ti (vsq, signed __int128, const int<0,0>);
>> VEC_SET_V1TI nothing {set}
>>
>
> BR,
> Kewen
>