This PR changes 17 tests. At 50 repeats each, that’s 850 tests. I’m not sure if StressNew does all 850 serially — if it does, they would have to complete in under 10 seconds per test to duck the concourse timeout (currently set a little over 2 hours).
Approximately how long do you expect each of these tests to take? Would it make sense to break up the PR into 2 PRs with fewer changed tests in each? Or do we just need to increase the timeout for StressNew to a much bigger value (what is reasonable, anyway? 6 hours? 12 hours?). If you’d like to change the timeout for this job, the following lines may be relevant: ci/pipelines/shared/jinja.variables.yml lines 111-118: - name: “StressNew" ... CALL_STACK_TIMEOUT: "7200" execute_test_timeout: 2h15m If we think there are good reasons to keep the 2-hour time limit in place, and there are extenuating reasons why your changes cannot be stressed within this time period, I propose we should be able to substitute additional manual reviewers in place of the imperfect automatic check here to get this approved. -Owen > On Dec 13, 2018, at 2:20 PM, Galen O'Sullivan <gosulli...@pivotal.io> wrote: > > On the PR for https://github.com/apache/geode/pull/2938, the StressNewTest > and (in two different runs of the same code) other jobs fail occasionally. > > I'm inclined to think that the Upgrade and Acceptance test failures were > caused by flaky tests, and I can keep rerunning the PR build. > > The StressNewTest issue is probably because the tests take a long time to > run. We can't see what tests run or how long they took because the > StressNewTests have an archive but no results page. > > If StressTest is going to take too long, should I ignore it and push, or is > there a way to disable it or dial it down for this PR? I know we've had > discussions on the list about not merging if the pipeline is, but I think > the StressTest failures are due to the pipeline not allowing enough time, > and it's meant to fix a test issue. > > StressTest jobs: > https://urldefense.proofpoint.com/v2/url?u=https-3A__concourse.apachegeode-2Dci.info_builds_24422&d=DwIBaQ&c=lnl9vOaLMzsy2niBC8-h_K-7QJuNJEsFrzdndhuJ3Sw&r=5pwPNRvtAJAFP7w9SGYR-NUqYcl8RSrvSLXHd5dKU-o&m=q8-azNJP7a-Vb_gLuTJaHlC9VFqSd-uIaW85r2xRriY&s=KbF1dF4tF8e-rmTrJEVwrFjCuqNHZAhtphTogHN4U5k&e= > https://urldefense.proofpoint.com/v2/url?u=https-3A__concourse.apachegeode-2Dci.info_builds_24423&d=DwIBaQ&c=lnl9vOaLMzsy2niBC8-h_K-7QJuNJEsFrzdndhuJ3Sw&r=5pwPNRvtAJAFP7w9SGYR-NUqYcl8RSrvSLXHd5dKU-o&m=q8-azNJP7a-Vb_gLuTJaHlC9VFqSd-uIaW85r2xRriY&s=SGe4aOZYEEVTrEMoMK73p5EjzNANF7wtenF7w7vxtAQ&e= > https://urldefense.proofpoint.com/v2/url?u=https-3A__concourse.apachegeode-2Dci.info_builds_24061&d=DwIBaQ&c=lnl9vOaLMzsy2niBC8-h_K-7QJuNJEsFrzdndhuJ3Sw&r=5pwPNRvtAJAFP7w9SGYR-NUqYcl8RSrvSLXHd5dKU-o&m=q8-azNJP7a-Vb_gLuTJaHlC9VFqSd-uIaW85r2xRriY&s=T_8qb6cjL_qdcEbUsuITFtIXZnVoH6mKgXF22sYTL-Q&e= > https://urldefense.proofpoint.com/v2/url?u=https-3A__concourse.apachegeode-2Dci.info_builds_24062&d=DwIBaQ&c=lnl9vOaLMzsy2niBC8-h_K-7QJuNJEsFrzdndhuJ3Sw&r=5pwPNRvtAJAFP7w9SGYR-NUqYcl8RSrvSLXHd5dKU-o&m=q8-azNJP7a-Vb_gLuTJaHlC9VFqSd-uIaW85r2xRriY&s=bcNQQeaLkfe_i5FL5MXu4eWNkhwjFfxksDLAkXkjV0c&e= > > UpgradeTest: > https://urldefense.proofpoint.com/v2/url?u=https-3A__concourse.apachegeode-2Dci.info_builds_24060&d=DwIBaQ&c=lnl9vOaLMzsy2niBC8-h_K-7QJuNJEsFrzdndhuJ3Sw&r=5pwPNRvtAJAFP7w9SGYR-NUqYcl8RSrvSLXHd5dKU-o&m=q8-azNJP7a-Vb_gLuTJaHlC9VFqSd-uIaW85r2xRriY&s=IcWIfcx5y0MagkEI7xC9hgt9RTpYj9k881orD5lPau0&e= > AcceptanceTest: > https://urldefense.proofpoint.com/v2/url?u=https-3A__concourse.apachegeode-2Dci.info_builds_24415&d=DwIBaQ&c=lnl9vOaLMzsy2niBC8-h_K-7QJuNJEsFrzdndhuJ3Sw&r=5pwPNRvtAJAFP7w9SGYR-NUqYcl8RSrvSLXHd5dKU-o&m=q8-azNJP7a-Vb_gLuTJaHlC9VFqSd-uIaW85r2xRriY&s=_2_nAWdcjyyqKQjP9o6riCgrZb3HnQJ8nDog3Lzdoj4&e= > > > Thanks, > Galen