Re: DejaGnu unit testing protocol

Jacob Bachmeyer Wed, 22 Jul 2020 19:40:35 -0700

David Malcolm wrote:

On Wed, 2020-07-22 at 17:05 -0500, Jacob Bachmeyer wrote:

[...]

In terms of "other comments":


FWIW within gcc, the jit.dg testsuite for libgccjit has a copy of
host_execute ("fixed_host_execute") to workaround issues I've run into:

https://gcc.gnu.org/git/?p=gcc.git;a=blob;f=gcc/testsuite/jit.dg/jit.exp

It has drifted somewhat from the DejaGnu original; for example it
gained the ability to parse valgrind output and convert leaks into
pass/fail results.

The only differences seem to be support for running the test undervalgrind, an expect_after to raise an error if the Expect matchingbuffer overflows, and some additional code to check the exit status ofthe invoked program.

I have already added a different solution to ensuring that the childprocess has time to finish: instead of immediately closing the spawnhandle when the {^Totals} line is reached, DejaGnu now (after thePR42399 fix) reads to EOF. This is not really a complete solutioneither, as it only works with local testing, and eventually the DejaGnuunit test protocol will need an explicit end marker to allow for remotetesting that returns to a shell prompt. This will need considerablymore planning to account for cases where a unit test executable crashesinstead of returning, including possibly crashing the remote host entirely.

I am unsure if DejaGnu should even recognize the "Totals" line, as itdoes not fit the overall pattern of {^\t[][[:upper:]]+:}. The internalunit tests for unit testing do not generate a "Totals" line at all.

Similarly, the full_buffer handling has been included into the mainexpect call in host_execute to fix that issue. Instead of aborting thetest, upstream DejaGnu will log an ERROR (causing the next test to berecorded as UNRESOLVED) and attempt to resynchronize. I believe thatthe unit testing protocol can support this, although we do not currentlyhave a test for this recovery sequence.

Generally, checking the exit status does not seem to be long-termsupportable, particularly with the future plans for transparent remotetesting, where an exit status may not be available on some hostplatforms. Please do not rely on it.

Future development should make this easier, but running the test undervalgrind should be currently possible with a wrapper instead ofreplacing host_execute.

Overall, 1.6.3 should enable you to replace fixed_host_execute with anew wrap_host_execute that handles using valgrind with a call toDejaGnu's host_execute procedure. (Also fixed as part of tests forPR42399: host_execute no longer insists on running executables from thecurrent directory. This was needed to make a regression test (writtenin Awk) for PR42399 run correctly.)

(I also ran into the issue that dejagnu.h's pass/fail C functions
aren't thread-safe, which I hack around in my testsuite, replacing them
in multi-threaded tests with ones guarded by a mutex).

Looking at dejagnu.h, I see a few problems; the use of a shared staticbuffer is definitely one of them. Would changing those to makedejagnu.h thread-safe be a sufficient concern to do for 1.6.3? If so,please file a bug report at <bug-deja...@gnu.org>; I see a solutionusing flockfile and vprintf that avoids building up a buffer at all andwould also fix the minor issue of truncating long test names. This willbe in 1.6.4 in any case, but if thread-safety in dejagnu.h is importantto you, please file a bug and we will work to land this as a bug fix for1.6.3 instead of an enhancement for 1.6.4.

The internal total counts are a less severe problem, as the DejaGnu testdriver will perform its own count as it reads the results. The C codealready plays fast and loose with counting results: xfail/xpass bumpthe same counters as fail/pass, respectively.



-- Jacob

Re: DejaGnu unit testing protocol

Reply via email to