https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110751
--- Comment #8 from rguenther at suse dot de <rguenther at suse dot de> --- On Thu, 20 Jul 2023, juzhe.zhong at rivai dot ai wrote: > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110751 > > --- Comment #6 from JuzheZhong <juzhe.zhong at rivai dot ai> --- > (In reply to rguent...@suse.de from comment #5) > > On Thu, 20 Jul 2023, kito at gcc dot gnu.org wrote: > > > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110751 > > > > > > --- Comment #4 from Kito Cheng <kito at gcc dot gnu.org> --- > > > > OK, so TA is either merge or all-ones. > > > > > > Yes, your understand is correct, just few more detail is that can be > > > mixing > > > with either merge or all-ones. > > > > > > e.g. > > > > > > An 4 x i32 vector with mask 1 0 1 0 > > > > > > Op = | a | b | c | d | > > > Mask = | 1 | 0 | 1 | 0 | > > > > > > the result could be: > > > | a | b | c | d | > > > | a | all-1 | c | d | > > > | a | all-1 | c | all-1 | > > > | a | all-1 | c | d | > > > > > > > > > > Not sure how you can use MA at the moment since you specify an existing > > > > operand in your target hook. As far as > > > > I can see there's no value the target hook can provide that matches any > > > of the implementation semantics? > > > > > > That's the key point - we don't know how to return an undefined value > > > there, we > > > have intrinsic can generate undefined value, but it seems impossible to > > > generate that within the hook. > > > > Well, neither *A nor *U can be specified currently. As said for 'merge' > > we would need another operand. And since 'unspecified' is either merge > > or all-ones we can't express that either. It's not really 'undefined' > > either. > > > > Note this also means the proposal to define a .MASK_LOAD as zeroing > > masked elements is not going to work for RISC-V, instead we'd need > > an explicit 'else' value there as well. > > > > In fact we could follow .MASK_LOAD for .COND_* and simply omit > > the 'else' operand for the case of 'unspecified', no? GIMPLE would > > be fine omitting it, not sure whether there's precedent for > > optabs with optional operands? > > For RVV auto-vectorization, we define COND_LEN_* has else value in the > arguments. But the else value is not always the real value we need to > care about, this is the code from vectorizable_operation: > > if (reduc_idx >= 0) > { > /* Perform the operation on active elements only and take > inactive elements from the reduction chain input. */ > gcc_assert (!vop2); > vops.quick_push (reduc_idx == 1 ? vop1 : vop0); > } > else > { > auto else_value = targetm.preferred_else_value > (cond_fn, vectype, vops.length () - 1, &vops[1]); > vops.quick_push (else_value); > } > > > You can see for reduction operations, the else value is the real value we > need to depend on, we should use "TU" (Undisturbed or merge value) in RVV. > Meaning the inactive elements should remain the "old" value that's why we > use "TU". Sure. For the above case that's obviously correct. > However, for single binary operations for example, division, we just only > need to forbid the division operations of the inactive elements in the > hardware, we don't care the value of the inactive elements value. so in > this case, we want to use "TA". In this case, we want the else value be > a meaningless placeholder in Gimple IR (similar to "undef" or "poison" in > LLVM). > > Such meaningless placeholder in the argument of Gimple IR, can be beneficail > for RVV for 2 following reasons: > 1. allow us use "TA". > 2. Doesn't consume a register. > > I am not sure whether we can represent such placeholder in Gimple IR. As said, just drop the 'else' operand and assign 'unspecified' to its semantics? Like we do for .LEN_MASK_LOAD where there isn't any 'else' value and I presume you'll use 'TA' as well there?