Re: [PATCH 4/6] testsuite: Add expected-fail to psim

Chris Johns Tue, 12 May 2020 02:12:02 -0700

On 12/5/20 5:15 pm, Sebastian Huber wrote:

Hello,
On 09/05/2020 03:30, Gedare Bloom wrote:
Without these tests being tagged this way the user would have noidea where the stand after a build and test run and that would meanwe would have to make sure a release has no failures. I considerthat as not practical or realistic.
Maybe we need another state, e.g. something-is-broken-please-fix-it.
I do not think so, it is implicit in the failure or the test isbroken. The only change is to add unexpected-pass, that will be onmaster after the 5 branch.
I disagree with this in principle, and it should be reverted after we
branch 5. It's fine for now to get the release state sync'd, but we
should find a long-term solution that distinguishes the cases:
1. we don't expect this test to pass on this bsp
2. we expect this test to pass, but know it doesn't currently

They are two very different things, and I don't like conflating them
into one "expected-fail" case
originally, I had the same point of view. What I didn't take intoaccount was the perspective of the tester. Now, I think it is perfectlyfine to flag these tests as expected failure test states. Because rightnow, due to some known bugs such as https://devel.rtems.org/ticket/3982and probably also some more issues, these tests fail. On this BSP andthis RTEMS version, they will always fail. This is not some sort ofrandom failure. When we change test states to expected failure I thinkwe should make sure that a ticket exists, which captures that there aresome test results which indicate issues (expected failure test state).The ticket system is the better place to manage this. We should not usethe test states for this. The test states should be used to figure outchanges between different test runs. They should enable also to quicklycheck if the outcome of a test run yields the expected results for acertain RTEMS version and BSP.

Thanks. It is clear to me we lack documentation on this topic and thisis an oversight on my part which I will attempt to correct.

I have reviewed Dejagnu and considered other things like the withdrawnIEEE 1003.3 standard and there are states we have that need to changebut I think the original intent is the right path.


The Dejagnu states are documented here:

https://www.gnu.org/software/dejagnu/manual/A-POSIX-Conforming-Test-Framework.html#A-POSIX-Conforming-Test-Framework

And the exit codes are:

https://www.gnu.org/software/dejagnu/manual/Runtest.html#Runtest

For me they define the goal and intent.

The test states are metadata for the tester so it can determine theresult of any given set of tests in relation to the expected state ofthe test when it was built. You need to detach yourself from being adeveloper and put yourself in the position of a tester who's task is togive an overall pass or fail for a specific build of RTEMS withoutneeding to consider the specifics of any test, bug or feature.

The primary requirement is to allow machine check of the results todetermine regressions. A regression is a failure, pass or unresolvedresult that was not expected.


My current thinking for test states are:

PASS:
The test has succeeded and passed without a failure.

UNEXCEPTED-PASS:
The test has succeeded when it was expected to fail.

FAIL:

The test has not succeeded and has failed when it was expected to pass.The failure can be a failed assert, unhandled exception, resourceconstraint, or a faulty test.


EXCEPTED-FAIL:
The test has not succeeded and has failed and this is expected.

UNRESOLVED:

The test has not completed and the result cannot be determined. Theresult can be unresolved because the test did not start or end, testharness failure, insufficient computing resources for the test harnessto function correctly.


EXCEPTED-UNRESOLVED:

The test has not completed and the result cannot be determined and thisis expected.


INDETERMINATE:

The test has succeeded, has failed or in unresolved. The test is an edgecase where the test can pass, can fail, can be unresolved and this isexpected.


USER-INPUT:

The test has not completed and the result is unresolved because itrequires user intervention that cannot be provided.


BENCHMARK:

The test performs a performance type test. These are currently notsupported.


UNTESTED:

The test has not run and is a place holder for a real test that is notyet provided.


UNSUPPORTED:
The test is not supported for this build of RTEMS, BSP or architecture.

Note:

1. Any expected failures, unresolved, or indeterminate test results areconsidered faults and require fixing.

2. The nature of a failure cannot be inferred from the test's metadatastate.


3. The timeout and invalid states will be merged into UNRESOLVED.

4. The excluded state will be changed to UNSUPPORTED.

5. The metadata is placed in each test because is it an effective way tocapture the state. Tests can be run as a group, stand alone or atdifferent location and the test results can determine a regression. Theversion of the test harness does not need to match the RTEMS build.

This list of test states account for some missing states. It also addssome states I do not see being available until we move to a new buildsystem. For UNTESTED and UNSUPPORTED I see a template test being builtand run and does nothing. This is important because it means we get acomplete set of test results that are complete and consistent for all BSPs.

I can attend to this change before releasing 5.1 or it can be done onmaster and we can determine if it is back ported to 5.2[34..].


The change will come with documentation to explain thing a little better.

I hope this addresses the issues we have and I am sorry for creating adisturbance so close to a release.


Chris
_______________________________________________
devel mailing list
[email protected]
http://lists.rtems.org/mailman/listinfo/devel

Re: [PATCH 4/6] testsuite: Add expected-fail to psim

Reply via email to