from:"VandeVondele Joost"

gcc parallel make check

2014-09-03 Thread VandeVondele Joost

I've noticed that make -j -k check-fortran results in a serialized checking, while make -j32 -k check-fortran goes parallel. Somehow the explicit 'N' in -jN seems to be needed for the check target, while the other targets seem to do just fine. Is that a feature, or should I file a PR for that

RE: gcc parallel make check

2014-09-03 Thread VandeVondele Joost

> It is intentional. With -j it is essentially a fork bomb, just don't use it. well, silently ignoring it for just this target did cost me a lot of time, while an eventual fork bomb would have been dealt with much more quickly. >> Somewhat related is there a rule of thumb on how is the gran

RE: gcc parallel make check

2014-09-03 Thread VandeVondele Joost

> What did you expect for -j alone? an error? No, as is standard in gnu make, a new process for any target that can be processed (i.e. unlimited). >> ... check-fortran seems to be limited to about ~5 parallel targets ... > >Running the make with -j8 gives 7 directories gfortran[1-6]? in gcc/tes

RE: gcc parallel make check

2014-09-03 Thread VandeVondele Joost

> I have to admit that I don't know why that's the case. Actually Marc answered that one (I had the wrong mail address for gcc@ so repeat here): https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53155 > See: gcc/fortran/Make-lang.in, which has: I'll have a look and do some testing what the gains/co

RE: gcc parallel make check

2014-09-03 Thread VandeVondele Joost

>> expect -- /usr/share/dejagnu/runtest.exp --tool gcc lto.exp weak.exp tls.exp >> ipa.exp tree-ssa.exp debug.exp >dwarf2.exp fixed-point.exp vxworks.exp >> cilk-plus.exp vmx.exp pch.exp simulate-thread.exp x86_64-costmodel-vect.exp >> i386-costmodel-vect.exp spu-costmodel-vect.exp ppc-costmodel

[PATCH] RE: gcc parallel make check

2014-09-05 Thread VandeVondele Joost

> The splits are in the Makefiles, see check_gcc_parallelize attached is a patch to improve the parallel performance of 'make -jXX -k check-fortran'. For XX=16, this yields ~50% speedup, and even with XX=4 we still have 15%, the measured slowdown at XX=1 (<2%) is in the noise of testing. The pa

RE: [PATCH] RE: gcc parallel make check

2014-09-05 Thread VandeVondele Joost

>> > Please sort the letters (LC_ALL=C sort) and where consecutive, use ranges. >> > Thus \[0-9A-Zhjqvx-z\]* OK, works fine with the attached patch, and looks cleaner in Make-lang.in. Now, with the proper email address for gcc-patches... I wonder how many time I'll be punished for typos. unmodi

RE: [PATCH] RE: gcc parallel make check

2014-09-08 Thread VandeVondele Joost

Attached is an extended version of the patch, it brings a 100% improvement in make -j32 -k check-gcc (down from 20min to <10min) by modification of check_gcc_parallelize. It includes one non-trivial part, namely a split of the target exps. They are now all split using a common choice (based on

RE: [PATCH] RE: gcc parallel make check

2014-09-09 Thread VandeVondele Joost

> +# ls -1 | ../../../contrib/generate_tcl_patterns.sh 300 > "dg.exp=gfortran.dg/" > > How does this work with subdirectories? Can we replace ls with find? The input to the script is general, you can use this to your advantage. For example, I've been using: ls -1 g++.*/* | cut -c5- | ../../.

RE: [PATCH] RE: gcc parallel make check

2014-09-09 Thread VandeVondele Joost

> No. As I wrote earlier, splitting on filenames and test counts only is only > very rough split, all the splits really need to be backed out by real timing > data from popular targets. I'm actually doing quite some testing trying to get a reasonable balance, checking 'completed in' in all *.l

RE: [PATCH] RE: gcc parallel make check

2014-09-09 Thread VandeVondele Joost

> If you get whitespace right, one can provide multiple different wildcards to > a single *.exp file, e.g. > make check-gcc RUNTESTFLAGS="dg.exp='p[0-9A-Za-qs-z]* pr[9A-Za-z]*'" should > cover all tests starting with p other than pr[0-8]*.c (where you could split > say pr[0-2]* into another job, pr

RE: [PATCH] RE: gcc parallel make check

2014-09-09 Thread VandeVondele Joost

Now with gzipped figure.. why do these bounce ? > But if there are jobs that just take 1s to complete, then clearly it doesn't > make sense to split them off as separate job. I think we don't need 100% > even split, but at least roughly is highly desirable. Let me add some data, attached is a gr

RE: [PATCH] RE: gcc parallel make check

2014-09-09 Thread VandeVondele Joost

Attached is a further revision of the patch, now dealing with check-c++. Roughly 50% speedup here at '-j32' (18m vs 12m). For my setup (--enable-languages=c,c++,fortran) I have now improved all targets called in 'make -j32 -k check'. The latter is now 30% faster (15m vs 20m). Note that there ar

RE: [PATCH] RE: gcc parallel make check

2014-09-10 Thread VandeVondele Joost

Thanks for testing. The vect-args.c I explained earlier, and is indeed due to i386.exp hardcoding those. The libstdc++ double counts didn't appear in my testing, but I'll have a look. Note that these patterns are handwritten, so error prone. The long tests in libstdc++ come from (in timing o

RE: [PATCH] RE: gcc parallel make check

2014-09-10 Thread VandeVondele Joost

> You mean enhancing the script to split across arbitrarily long prefixes? > That would be great. I've now a script that does something like that: ~/test$ find /data/vjoost/gnu/gcc_trunk/gcc/gcc/testsuite/gfortran.dg/ -maxdepth 1 -type f -printf "%f\n" | ./generate_patterns.py 500 foo All 3947

RE: [PATCH] RE: gcc parallel make check

2014-09-10 Thread VandeVondele Joost

Jakub, > First of all, the -j2 testing shows more tests tested in gcc and libstdc++: > >-# of expected passes 10133 >+# of expected passes 10152 > >+PASS: 23_containers/set/modifiers/erase/abi_tag.cc (test for excess errors) >[...] > >Not sure where the bug is, could be e.g. in

RE: [PATCH] RE: gcc parallel make check

2014-09-11 Thread VandeVondele Joost

> could it be that the pattern in normal1 should have been '[ab]*/ de*/ > [ep]*/*' ? I've checked that this fixes the bug in the current trunk split. I.e. files are stil tested, but now only once. Consider this change added to the previously submitted patch.

RE: [PATCH] RE: gcc parallel make check

2014-09-11 Thread VandeVondele Joost

>> could it be that the pattern in normal1 should have been '[ab]*/ de*/ >> [ep]*/*' ? > >Yes, we are running these tests multiple times: > >PASS: 23_containers/map/modifiers/erase/abi_tag.cc (test for excess errors) >PASS: 23_containers/multimap/modifiers/erase/abi_tag.cc (test for excess >erro

RE: [PATCH] gcc parallel make check

2014-09-11 Thread VandeVondele Joost

> Here is a patch I'm testing now: Hi Jakub, I also tested your patch to compare timings vs a newer patch (v8) I'll send soon == patch v8 == make -j32 -k == check-fortran 4m58.178s check-c++ ~10m check-c ~10m check 15m29.873s == patch Jakub check-c++ ~20m check-fortran

RE: [PATCH] gcc parallel make check

2014-09-11 Thread VandeVondele Joost

> And these Fortran inter-test dependencies, which Tobias told me is > PR56408. > For PR56408 we need some fix. BTW, is there anything special about Fortran ? There are at least 180 test files that contain 'dg-additional-sources' some in a very non-local way: ./objc.dg/foreach-2.m: /* {

RE: [PATCH] gcc parallel make check

2014-09-11 Thread VandeVondele Joost

>>> >For PR56408 we need some fix. >> BTW, is there anything special about Fortran ? There are at least 180 test >> files that contain 'dg-additional-sources' >some in a very non-local way: >The current scheme comes at its limits in that case. . See the files listed in >the PR for issues. So, w

RE: [PATCH] gcc parallel make check

2014-09-12 Thread VandeVondele Joost

> a newer patch (v8) I'll send soon attached with updated changelog. Compared to the previously posted v6, only the libstdc++-v3/testsuite/Makefile.am has been refined to split a little more the e*/* pattern, and two quickly running goal have been merged, in addition to fixing the pre-exisiting

RE: [PATCH] gcc parallel make check

2014-09-12 Thread VandeVondele Joost

>> Regtested on x86_64-linux, ok for trunk? > >Oh, forgot to say, PR56408 isn't fixed by this patch, but given the >higher granularity (10 tests instead of 1) we don't happen to trigger it >right now. which means that any commit to that dir could trigger it, right ?

RE: [PATCH] gcc parallel make check

2014-09-12 Thread VandeVondele Joost

> So, I’d love to see the numbers for 5 and 20 to double check that 10 is the > right number to pick. This sort of refinement is trivial post checkin. So, some timings with the patch, I think this is great. Doing the testing you suggest, changing the variable doesn't influence things much (at

RE: [PATCH] gcc parallel make check

2014-09-16 Thread VandeVondele Joost

>> > These numbers are useful to try and ensure the overhead (scaling factor) >> > is reasonable, thanks. >> >> A nice improvement indeed. The patched result is 15 times faster >> than the serial unpatched run. So there is room for improvement > > Note, the box used was oldish AMD 16-core, no ht

msan and gcc ?

2014-10-01 Thread VandeVondele Joost

Hi, I've noticed that gcc includes a msan_interface.h file, and I'm wondering if this implies that memory sanitizer is already part of gcc. If not, are there plans to port this useful looking tool to gcc during the current stage 1 ? Cheers, Joost

RE: msan and gcc ?

2014-10-01 Thread VandeVondele Joost

> it was certainly worth it. since I see msan as a kind of valgrind replacement (similar functionality, but ~10x the speed, partially at the cost of more difficult deployment), I did a quick search in gcc bugzilla. 982 PRs mention valgrind, so such functionality is clearly heavily used.

lto and gold

2009-08-15 Thread VandeVondele Joost

I'd like to test lto on a project where objects first go through an archive, and so wanted to follow http://gcc.gnu.org/wiki/LinkTimeOptimization using 'gcc -use-linker-plugin' However, I can't get this to work. gfortran -use-linker-plugin -flto main.f90 test.f90 /data03/vondele/binutils-2

Re: lto and gold

2009-08-16 Thread VandeVondele Joost

I guess this is some configure flag missing, does anybody have a clue? Yes, you must build with --enable-gold --enable-plugins :-) Is that for gcc or for binutils (neither documents this in ./configure --help) ? I used it for both, but only get this to work with binutils CVS, is that c

Re: Reducing fortran testcase with delta.

2009-10-30 Thread VandeVondele Joost

Hi Li, I've attached 'Fortran-aware' delta. I tries to guess cut a Fortran file in more reasonable places (e.g. between subroutine boundaries, after enddos). It works reasonably well, but is a hack. Especially with Fortran90 and modules, iterated delta runs can help a lot (i.e. first runs re

trunk bootstrap failure?

2008-12-17 Thread VandeVondele Joost

Current trunk fails for me with /data04/vondele/gcc_trunk/obj/./gcc/xgcc -B/data04/vondele/gcc_trunk/obj/./gcc/ -B/data04/vondele/gcc_trunk/build/x86_64-unknown-linux-gnu/bin/ -B/data04/vondele/gcc_trunk/build/x86_64-unknown-linux-gnu/lib/ -isystem /data04/vondele/gcc_trunk/build/x86_64-unkno

Re: trunk bootstrap failure?

2008-12-17 Thread VandeVondele Joost

thats is on a standard linux (x86_64) box running opensuse 11.0, and a clean checkout. Is this a known problem? You haven't installed the 32-bit glibc devel package. Many thanks, that fixed it. Would be great if such a thing could be detected at configure time (i.e. like missing mpfr.h head

Re: trunk bootstrap failure?

2008-12-17 Thread VandeVondele Joost

Would be great if such a thing could be detected at configure time (i.e. like missing mpfr.h headers are already detected), with some kind of a gentle error message. It wouldn't be detected until the target libs are built, since that's the first time any 32-bit headers are needed. Patches welco

Re: gfortran / gdb question

2009-02-02 Thread VandeVondele Joost

actually, I just find out that this seems a 4.4 issue, compiled with 4.3 the gdb session just goes fine... I also seem to be able to debug small examples with either 4.3 or 4.4, just CP2K seems to cause troubles (as usual ;-) I've filed PR39073 for this, somehow hope this can be solved before

optimization question

2009-05-16 Thread VandeVondele Joost

the attached code (see contract__sparse) is a kernel which I hope gets optimized well. Unfortunately, compiling (on opteron or core2) it as gfortran -O3 -march=native -ffast-math -funroll-loops -ffree-line-length-200 test.f90 ./a.out Sparse: time[s] 0.66804099 New: time[s] 0.2080

Re: optimization question

2009-05-16 Thread VandeVondele Joost

thanks for the info I think it is useful to have a bugzilla here. will do. I tested 4.4, what did you test? 4.3 4.4 4.5 Joost

Re: optimization question

2009-05-16 Thread VandeVondele Joost

I think it is useful to have a bugzilla here. will do. http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40168 Btw, complete unrolling is also hindred by the artificial limit of maximally unrolling 16 iterations. Your inner loops iterate 27 times. Also by the artificial limit of the maximal u

GCC performance with CP2K

2008-04-28 Thread VandeVondele Joost

I've just tested gcc/gfortran with CP2K, which some of you might know from PR29975 and other messages to the list, and observed some very pleasing evolution in the runtime of the code. In each case the set of compilation options is '-O2 -ffast-math -funroll-loops -ftree-vectorize -march=native

bootstrap broken?

2008-08-07 Thread VandeVondele Joost

dwarf2out.c:13496: internal compiler error: in extract_insn, at recog.c:1988 http://gcc.gnu.org/bugzilla/show_bug.cgi?id=37045

CP2K gcc nightly benchmark / wwwdocs patch

2008-08-12 Thread VandeVondele Joost

A nightly tester has been set up to track the performance of the gcc/gfortran compiler (trunk) for typical CP2K runs. Results and code can be found at: http://cp2k.berlios.de/gfortran/ I'll consider your suggestions for improvements. The following patch could be applied to the wwwdocs Inde

vectorizer question

2008-08-18 Thread VandeVondele Joost

The attached testcase yields (on a core2 duo, gcc trunk): gfortran -O3 -ftree-vectorize -ffast-math -march=native test.f90 time ./a.out real0m3.414s ifort -xT -O3 test.f90 time ./a.out real0m1.556s The assembly contains: ifort gfortran mulpd 140 0 mulsd

Re: vectorizer question

2008-08-18 Thread VandeVondele Joost

It would be nice to have a stand-alone testcase for this, so please file a bugreport. I've opened PR37150 for this. Thanks, Joost

42 matches

Mail list logo