Re: [DISCUSS] Disable merge for failing pull requests

Jacob Barrett Tue, 04 Jun 2019 16:42:06 -0700

I’m still not interested until there is a solution for all community members to 
retrigger failed jobs. Also not excited about not having a way to override if 
there is a known issue that prevents a job from going green. My last PR needed 
multiple reruns because of a known flakey test with an active ticket. While 
mine eventually went green on its own it would have been annoying to keep 
hitting a retrigger.


-jake


> On Jun 4, 2019, at 4:06 PM, Owen Nichols <onich...@pivotal.io> wrote:
> 
> I’d like to follow up on this discussion from late last year.  Six months 
> ago, Kirk wrote:
> 
>> After we get it more consistently GREEN, I would be willing to change my 
>> vote to +1.
> 
> While we’re still not perfect, I have noticed that PR checks go green much 
> more consistently now than they did six months ago.  Also, Ryan wrote:
> 
>>>>>>>> I think we'd need clear guidelines on what to do if your PR fails due 
>>>>>>>> to something seemingly unrelated.
> 
> 
> If you still encounter a flaky failure occasionally, you can either 
> re-trigger all checks with an empty commit, or just ask on the dev list and 
> someone will be happy to help you re-trigger only your failed check.
> 
> 
> The above concerns were commonly cited as reasons for not moving ahead with 
> the proposal to enable GitHub policy to disable merge button until checks 
> have passed.  Even with these addressed, there is still a “people over 
> process” argument to be made for not imposing an enforced process (see recent 
> thread that rejected imposing a requirement of >0 reviews before allowing 
> merge).
> 
> 
> So, is there any interest at all in tightening GitHub controls?  Or maybe a 
> better way to approach the question is: are we Very Happy with our current 
> source control practices?
> 
> -Owen
> 
>> On Dec 26, 2018, at 4:03 PM, Kirk Lund <kl...@apache.org> wrote:
>> 
>> I'm changing my vote to -1 for disallowing merge until precheck passes.
>> 
>> The reason is that it's rare for me to see a 100% clean precheckin for any
>> of my PRs. There seems to always be some failure unrelated to my PR. For
>> example PR #3042 just hit GEODE-6008. If we make the change to disable the
>> merge button, then my PR could potentially be blocked indefinitely.
>> 
>> After we get it more consistently GREEN, I would be willing to change my
>> vote to +1.
>> 
>>> On Fri, Dec 21, 2018 at 10:36 AM Kirk Lund <kl...@apache.org> wrote:
>>> 
>>> I was responding to Udo's comment:
>>> 
>>> "Could one not configure the button that the owner of the PR cannot merge
>>> the PR?"
>>> 
>>> I'm +1 for disallowing merge until precheck passes.
>>> I'm -1 for disallowing the owner of the PR to merge it.
>>> 
>>>> On Fri, Dec 21, 2018 at 9:28 AM Helena Bales <hba...@pivotal.io> wrote:
>>>> 
>>>> Kirk, this change would not require you to get someone to merge it. It
>>>> would just require that your PR pass CI before it can be merged.
>>>> 
>>>>> On Thu, Dec 20, 2018 at 2:38 PM Kirk Lund <kl...@apache.org> wrote:
>>>>> 
>>>>> I have enough trouble just getting other developers to review my PR. I
>>>>> don't want to have to struggle to find someone to merge it for me, too.
>>>>> 
>>>>>> On Mon, Nov 19, 2018 at 4:09 PM Udo Kohlmeyer <u...@apache.org> wrote:
>>>>>> 
>>>>>> I don't believe "name and shame" is a hammer we should wield, but if
>>>> we
>>>>>> have use it... use it wisely
>>>>>> 
>>>>>> Could one not configure the button that the owner of the PR cannot
>>>> merge
>>>>>> the PR?
>>>>>> 
>>>>>> --Udo
>>>>>> 
>>>>>> 
>>>>>>> On 11/19/18 16:03, Dan Smith wrote:
>>>>>>> Closing the loop on this thread - I don't feel like there was enough
>>>>>>> agreement to go forwards with disabling the merge button, so I'm
>>>> going
>>>>> to
>>>>>>> drop this for now.
>>>>>>> 
>>>>>>> I would like to see everyone make sure that they only merge green
>>>> PRs.
>>>>>>> Maybe we can go with a name and shame approach? As in, we shouldn't
>>>> see
>>>>>> any
>>>>>>> new PRs show up in this query:
>>>>>>> 
>>>>>>> 
>>>>>> 
>>>>> 
>>>> https://github.com/apache/geode/pulls?utf8=%E2%9C%93&q=is%3Apr+is%3Amerged+status%3Afailure
>>>>>>> 
>>>>>>> -Dan
>>>>>>> 
>>>>>>> On Tue, Nov 13, 2018 at 10:19 AM Ryan McMahon <rmcma...@pivotal.io>
>>>>>> wrote:
>>>>>>> 
>>>>>>>> +1 I like this idea, but I recognize that it will be a challenge
>>>> when
>>>>>> there
>>>>>>>> is still some flakiness to the pipeline.  I think we'd need clear
>>>>>>>> guidelines on what to do if your PR fails due to something
>>>> seemingly
>>>>>>>> unrelated.  For instance, we ran into GEODE-5943 (flaky
>>>>>> EvictionDUnitTest)
>>>>>>>> in our last PR, and saw that there was already an open ticket for
>>>> the
>>>>>>>> flakiness (we even reverted our change and reproduced to be
>>>> sure).  So
>>>>>> we
>>>>>>>> triggered another PR pipeline and it passed the second time.  Is
>>>>>> rerunning
>>>>>>>> the pipeline again sufficient in this case?  Or should we have
>>>> stopped
>>>>>> what
>>>>>>>> we were doing and took up GEODE-5943, assuming it wasn't assigned
>>>> to
>>>>>>>> someone?  If it was already assigned to someone, do we wait until
>>>> that
>>>>>> bug
>>>>>>>> is fixed before we run through the PR pipeline again?
>>>>>>>> 
>>>>>>>> I think if anything this will help us treat bugs that are causing a
>>>>> red
>>>>>>>> pipeline as critical, and I think that is a good thing.  So I'm
>>>> still
>>>>> +1
>>>>>>>> despite the flakiness.  Just curious what people think about how we
>>>>>> should
>>>>>>>> handle cases where there is a known failure and it is indeed
>>>> unrelated
>>>>>> to
>>>>>>>> our PR.
>>>>>>>> 
>>>>>>>> Ryan
>>>>>>>> 
>>>>>>>> 
>>>>>>>> On Mon, Nov 12, 2018 at 2:49 PM Jinmei Liao <jil...@pivotal.io>
>>>>> wrote:
>>>>>>>> 
>>>>>>>>> Just to clarify, that flaky EvictionDUnitTest is old flaky. The
>>>> PR to
>>>>>>>>> refactor the test passed all checks, even the repeatTest as well.
>>>> I
>>>>>> had a
>>>>>>>>> closed PR that just touched the "un-refactored"
>>>> EvictionDUnitTest, it
>>>>>>>>> wouldn't even pass the repeatTest at all.
>>>>>>>>> 
>>>>>>>>> On Mon, Nov 12, 2018 at 2:04 PM Dan Smith <dsm...@pivotal.io>
>>>> wrote:
>>>>>>>>> 
>>>>>>>>>> To be clear, I don't think we have an issue of untrustworthy
>>>>>> committers
>>>>>>>>>> pushing code they know is broken to develop.
>>>>>>>>>> 
>>>>>>>>>> The problem is that it is all to easy to look at a PR with some
>>>>>>>> failures
>>>>>>>>>> and *assume* your code didn't cause the failures and merge it
>>>>> anyway.
>>>>>> I
>>>>>>>>>> think we should all be at least rerunning the tests and not
>>>> merging
>>>>>> the
>>>>>>>>> PR
>>>>>>>>>> until everything passes. If the merge button is greyed out, that
>>>>> might
>>>>>>>>> help
>>>>>>>>>> communicate that standard to everyone.
>>>>>>>>>> 
>>>>>>>>>> Looking at the OpenJDK 8 metrics, it looks to me like most of the
>>>>>>>> issues
>>>>>>>>>> are recently introduced (builds 81-84 and the EvictionDUnitTest),
>>>>> not
>>>>>>>> old
>>>>>>>>>> flaky tests. So I think we were a little more disciplined always
>>>>>>>>> listening
>>>>>>>>>> to what the checks are telling us we would have less noise in the
>>>>> long
>>>>>>>>> run.
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>> 
>>>>>> 
>>>>> 
>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__concourse.apachegeode-2Dci.info_teams_main_pipelines_apache-2Ddevelop-2Dmetrics_jobs_GeodeDistributedTestOpenJDK8Metrics_builds_23&d=DwIBaQ&c=lnl9vOaLMzsy2niBC8-h_K-7QJuNJEsFrzdndhuJ3Sw&r=s8zALi1UpxiUlTfCkFIvTI7Yi4EtlpqAZ68hQ4JDyaI&m=EBW_QlDSlKgshy50KztUi566idyTMguNUkj6wc1jLXo&s=tgtdFeHVZtk1hlNfH-VTlrV9-WkUt_tWv_yx7MjSUdo&e=
>>>>>>>>>> -Dan
>>>>>>>>>> 
>>>>>>>>>> On Mon, Nov 12, 2018 at 11:21 AM Udo Kohlmeyer <u...@apache.org>
>>>>>> wrote:
>>>>>>>>>> 
>>>>>>>>>>> 0
>>>>>>>>>>> 
>>>>>>>>>>> Patrick does make a point. The committers on the project have
>>>> been
>>>>>>>>> voted
>>>>>>>>>>> in as "responsible, trustworthy and best of breed" and rights
>>>> and
>>>>>>>>>>> privileges according to those beliefs have been bestowed.
>>>>>>>>>>> 
>>>>>>>>>>> We should live those words and trust our committers. In the
>>>> end..
>>>>> If
>>>>>>>>>>> there is a "rotten" apple in the mix this should be addressed,
>>>>> maybe
>>>>>>>>> not
>>>>>>>>>>> as public flogging, but with stern communications.
>>>>>>>>>>> 
>>>>>>>>>>> On the other side, I've also seen the model where the submitter
>>>> of
>>>>> PR
>>>>>>>>> is
>>>>>>>>>>> not allowed to merge + commit their own PR's. That befalls to
>>>>> another
>>>>>>>>>>> committer to complete this task, avoiding the whole, "I'll just
>>>>>>>> quickly
>>>>>>>>>>> commit without due diligence".
>>>>>>>>>>> 
>>>>>>>>>>> --Udo
>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>>> On 11/12/18 10:23, Patrick Rhomberg wrote:
>>>>>>>>>>>> -1
>>>>>>>>>>>> 
>>>>>>>>>>>> I really don't think this needs to be codified.  If people are
>>>>>>>>> merging
>>>>>>>>>>> red
>>>>>>>>>>>> PRs, that is a failing as a responsible developer.  Defensive
>>>>>>>>>> programming
>>>>>>>>>>>> is all well and good, but this seems like it's a bit beyond the
>>>>>>>> pale
>>>>>>>>> in
>>>>>>>>>>>> that regard.  I foresee it making the correction of a red main
>>>>>>>>> pipeline
>>>>>>>>>>>> must more difficult that it needs to be.  And while it's much
>>>>>>>> better
>>>>>>>>>> than
>>>>>>>>>>>> it had been, I am (anecdotally) still seeing plenty of
>>>> flakiness
>>>>> in
>>>>>>>>> my
>>>>>>>>>>> PRs,
>>>>>>>>>>>> either resulting from infra failures or test classes that need
>>>> to
>>>>>>>> be
>>>>>>>>>>>> refactored or reevaluated.
>>>>>>>>>>>> 
>>>>>>>>>>>> If someone is merging truly broken code that has failed
>>>>> precheckin,
>>>>>>>>>> that
>>>>>>>>>>>> seems to me to be a discussion to have with that person.  <s>
>>>> If
>>>>> it
>>>>>>>>>>>> persists, perhaps a public flogging would be in order. </s>
>>>> But at
>>>>>>>>> the
>>>>>>>>>>> end
>>>>>>>>>>>> of the day, the onus is on us to be responsible developers,
>>>> and no
>>>>>>>>>> amount
>>>>>>>>>>>> of gatekeeping is going to be a substitute for that.
>>>>>>>>>>>> 
>>>>>>>>>>>> On Mon, Nov 12, 2018 at 9:38 AM, Galen O'Sullivan <
>>>>>>>>>> gosulli...@pivotal.io
>>>>>>>>>>>> wrote:
>>>>>>>>>>>> 
>>>>>>>>>>>>> I'm in favor of this change, but only if we have a way to
>>>> restart
>>>>>>>>>>> failing
>>>>>>>>>>>>> PR builds without being the PR submitter. Any committer
>>>> should be
>>>>>>>>> able
>>>>>>>>>>> to
>>>>>>>>>>>>> restart the build. The pipeline is still flaky enough and I
>>>> want
>>>>>>>> to
>>>>>>>>>>> avoid
>>>>>>>>>>>>> the situation where a new contributor is asked repeatedly to
>>>> push
>>>>>>>>>> empty
>>>>>>>>>>>>> commits to trigger a PR build (in between actual PR review)
>>>> and
>>>>>>>>> their
>>>>>>>>>> PR
>>>>>>>>>>>>> gets delayed by days if not weeks. That's a real bad
>>>> experience
>>>>>>>> for
>>>>>>>>>> the
>>>>>>>>>>>>> people we want to be inviting in.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> On Mon, Nov 12, 2018 at 9:23 AM Alexander Murmann <
>>>>>>>>>> amurm...@pivotal.io>
>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>> 
>>>>>>>>>>>>>> What's the general consensus on flakiness of the pipeline for
>>>>>>>> this
>>>>>>>>>>>>> purpose?
>>>>>>>>>>>>>> If there is consensus that it's still too flaky to disable
>>>> the
>>>>>>>>> merge
>>>>>>>>>>>>> button
>>>>>>>>>>>>>> on failure, we should probably consider doubling down on that
>>>>>>>> issue
>>>>>>>>>>>>> again.
>>>>>>>>>>>>>> It's hard to tell from just looking at the dev pipeline
>>>> because
>>>>>>>> you
>>>>>>>>>>>>> cannot
>>>>>>>>>>>>>> see easily what failures were legitimate.
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> On Mon, Nov 12, 2018 at 8:47 AM Bruce Schuchardt <
>>>>>>>>>>> bschucha...@pivotal.io
>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> I'm in favor of this.
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Several times over the years we've made a big push to get
>>>>>>>>> precheckin
>>>>>>>>>>> to
>>>>>>>>>>>>>>> reliably only to see rapid degradation due to crappy code
>>>> being
>>>>>>>>>> pushed
>>>>>>>>>>>>>>> in the face of precheckin failures.  We've just invested
>>>>> another
>>>>>>>>>> half
>>>>>>>>>>>>>>> year doing it again.  Are we going to do things differently
>>>>> now?
>>>>>>>>>>>>>>> Disabling the Merge button on test failure might be a good
>>>>>>>> start.
>>>>>>>>>>>>>>>> On 11/9/18 12:55 PM, Dan Smith wrote:
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> Hi all,
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> Kirks emails reminded me - I think we are at the point now
>>>>> with
>>>>>>>>> our
>>>>>>>>>>>>> PR
>>>>>>>>>>>>>>>> checks where we should not be merging anything to develop
>>>> that
>>>>>>>>>>>>> doesn't
>>>>>>>>>>>>>>> pass
>>>>>>>>>>>>>>>> all of the PR checks.
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> I propose we disable the merge button unless a PR is
>>>> passing
>>>>>>>> all
>>>>>>>>> of
>>>>>>>>>>>>> the
>>>>>>>>>>>>>>>> checks. If we are in agreement I'll follow up with infra to
>>>>> see
>>>>>>>>> how
>>>>>>>>>>>>> to
>>>>>>>>>>>>>>> make
>>>>>>>>>>>>>>>> that happen.
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> This would not completely prevent pushing directly to
>>>> develop
>>>>>>>>> from
>>>>>>>>>>>>> the
>>>>>>>>>>>>>>>> command line, but since most developers seem to be using
>>>> the
>>>>>>>>> github
>>>>>>>>>>>>>> UI, I
>>>>>>>>>>>>>>>> hope that it will steer people towards getting the PRs
>>>> passing
>>>>>>>>>>>>> instead
>>>>>>>>>>>>>> of
>>>>>>>>>>>>>>>> using the command line.
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> Thoughts?
>>>>>>>>>>>>>>>> -Dan
>>>>>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> --
>>>>>>>>> Cheers
>>>>>>>>> 
>>>>>>>>> Jinmei
>>>>>>>>> 
>>>>>> 
>>>>>> 
>>>>> 
>>>> 
>>> 
>

Re: [DISCUSS] Disable merge for failing pull requests

Reply via email to