Re: [patch 4/3] Header file reduction - Tools for contrib

Andrew MacLeod Tue, 06 Oct 2015 12:20:14 -0700

On 10/06/2015 08:02 AM, Bernd Schmidt wrote:

This sounds like the intention is to move recognized core files (Iassume these are the ones in the "order" array in the tool) to thestart, and leaving everything alone? I was a bit confused about thisat first; I see for example "timevar.h" moving around without beingpresent in the list, but it looks like it gets added implicitlythrough being included by df.h. (Incidentally, this looks like anothercase like the obstack one - a file using timevars should includetimevar.h IMO, even if it also includes df.h).

Ordering the includes is perhaps more complex than you realize. It morecomplex than I realized when I first started it. it took a long and veryfrustrating period to get it working properly.

There are implicit dependencies between some include files. The primaryordering list is to provide a canonical order for key files so thatthose dependencies are automatically taken care of. Until now we'vemanaged it by hand. The problem is that the dependencies are notnecessary always from the main header file.. they may come from one ofthe headers that were included in it. There are lots of dependencies onsymtab.h for instance, which comes from tree.h Some other source filesdon't need tree.h, but they do need symtab.h. If symtab.h isn't in theordering list and the header which uses it is (like cgraph.h) , the toolwould move cgraph.h above symtab.h and the result doesn't work.

The solution is to take that initial canonical list, and fully expand itto include everything that those headers include. This gives a linearcanonical list of close to 100 files. It means things like timevar.h(which is included by df.h) are in this "ordering":

<...>
regset.h
alloc-pool.h
timevar.h
df.h
tm_p.h
gimple-iterator
<...>

A source file which does not include df.h but includes timevar.h muistkeep it in this same relative ordering, or some other header from theordering list which uses timevar.h may no longer compile. (timevar.hwould end up after everything in the canonical list instead of in fromtof the other file)

This means the any of those 100 headers files which occur in a sourcefile should occur in this order. The original version of the tool triedto spell out this exact order, but I realized that was not maintainableas headers change, and it was actually far simply to specify the coreones In the tool, and let it do the expansion based on what is in thecurrent tree.

This also means that taken as a snapshot, you are going to see thingslike timevar.h move around in apparently random fashion... but it is notrandom. It will be in front of any and all headers listed after it inthe ordering. Any headers which don't appear in the canonical list willsimply retain their current order in the source file, but AFTER all theones in the canonical list.

This also made it fairly easy to remove redundant includes that havebeen seen already by including some other header... I just build thelist of headers that have been seen already


There are a couple of specialty cases that are handled..

The 'exclude processing' list are headers which shouldn't be expandedlike above. They can cause irreconcilable problems when expanded ,especially the front end file files. They do need to be ordered sincediagnostics require them to be included first in order to satisfy therequirement that GCC_DIAG_STYLE be defined before diagnostic.h isincluded. Plus most of them include tree.h and/or diagnostic.hthemselves, but we don't want them to impact the ordering for thebackend files.

That list puts those core files in an appropriate place canoncailly, butdoesn't expand into the file because the order we get for the differentfront ends would be different . Finally diagnostic*.h and friends areremoved from the list and put at the end to ensure eveything that mightbe needed by them is available. Again, the front end files would havemade it much earlier than we wanted for the backend files.

I also disagree with the assertion that " a file using timevars shouldinclude timevar.h IMO, even if it also includes df.h" It could, but Idon't see the value, and I doubt anyone really cares much. If someoneever removes the only thing that does bring timevar.h, you simply add itthen. That is just part of updating headers. I'm sure before I runthis patch not every file which uses timevar.h actually physicallyincludes it. This process will set us to a somewhat consistent state.

Its simple enough to remove the ones that are redundant in anautomated way, and very difficult to determine whether they notrequired, but contain content that is used.


The fully expanded canonical list looks something like this:

safe-ctype.h
filenames.h
libiberty.h
hwint.h
system.h
insn-modes.h
machmode.h
signop.h
wide-int.h
double-int.h
real.h
fixed-value.h
statistics.h
gtype-desc.h
ggc.h
vec.h
hashtab.h
inchash.h
mem-stats-traits.h
hash-traits.h
hash-map-traits.h
mem-stats.h
hash-map.h
hash-table.h
hash-set.h
line-map.h
input.h
is-a.h
memory-block.h
coretypes.h
options.h
tm.h
function.h
obstack.h
bitmap.h
sbitmap.h
basic-block.h
dominance.h
cfg.h
backend.h
insn-codes.h
hard-reg-set.h
target.h
genrtl.h
rtl.h
c-target.h
c-target-def.h
symtab.h
tree-core.h
tree-check.h
tree.h
cp-tree.h
c-common.h
c-tree.h
gfortran.h
tree-ssa-alias.h
gimple-expr.h
gimple.h
predict.h
cfghooks.h
regset.h
alloc-pool.h
timevar.hdf.h
tm_p.h
gimple-iterators.h
stringpool.h
tree-ssa-operands.h
gimple-ssa.h
tree-ssanames.h
tree-phinodes.h
ssa-iterators.h
ssa.h
expmed.h
insn-opinit.h
optabs-query.h
optabs-libfuncs.h
insn-config.h
optabs.h
regs.h
emit-rtl.h
ira.h
recog.h
ira-int.h
streamer-hooks.h
plugin-api.h
gcov-iov.h
gcov-io.h
wide-int-print.h
pretty-print.h
bversion.h
lto-streamer.h
data-streamer.h
tree-streamer.h
gimple-streamer.h


Intentionally commented out?

+
+ def process_ii (filen):
+   return process_include_info (filen, False, False)
+
+ def process_ii_macro (filen):
+   return process_include_info (filen, True, False)
+
+ def process_ii_src (filen):
+   return process_include_info (filen, False, True)
+
+ def process_ii_macro_src (filen):
+   return process_include_info (filen, True, True)
+
+ def ii_base (iinfo):
+   return iinfo[0]
+
+ def ii_path (iinfo):
+   return iinfo[1]
+
+ def ii_include_list (iinfo):
+   return iinfo[2]
+
+ def ii_include_list_cond (iinfo):
+   return iinfo[3]
+
+ def ii_include_list_non_cond (iinfo):
+   l = ii_include_list (iinfo)
+   for n in ii_include_list_cond (iinfo):
+     l.remove (n)
+   return l
+
+ def ii_macro_consume (iinfo):
+   return iinfo[4]
+
+ def ii_macro_define (iinfo):
+   return iinfo[5]
+
+ def ii_src (iinfo):
+   return iinfo[6]
+
+ def ii_src_line (iinfo):
+   return iinfo[7]

That's a lot of little functions with pretty much no clue for thereader what's going on. It looks like maybe there's an array where astruct should have been used?

there once was a large comment at the start of process_include_infodescribing the return value vactor... they simply access it. Im notsure where it went. I will find and put the big comment back in.


Andrew

Re: [patch 4/3] Header file reduction - Tools for contrib

Reply via email to