[ https://issues.apache.org/jira/browse/MNG-7592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17932600#comment-17932600 ]
Guillaume Nodet commented on MNG-7592: -------------------------------------- [~laeubi] here's a list of of fields which are now intern'ed: https://github.com/apache/maven/pull/2083/files#diff-a02184e10bbc00b692f818ad6cc385ed0b428c8d83d163917a0729249fc2a5adR2037-R2047 Basically, groupId, artifactId, versions and various enums. > String deduplication in model building > -------------------------------------- > > Key: MNG-7592 > URL: https://issues.apache.org/jira/browse/MNG-7592 > Project: Maven > Issue Type: Improvement > Reporter: Christoph Läubrich > Assignee: Guillaume Nodet > Priority: Major > Labels: easytask > Fix For: 4.0.0-rc-3 > > > I currently investigate improving memory consumption in m2eclipse (maven ide > extension) and noticed that one problem is that maven model seem to not > deduplicate strings, so for large projects (I used apache camel as an > example), there are a lot of duplicate strings hanging around, e.g. I see > 12.000 instances of "org.apache.maven.plugins" or around 10.000 of > "org.apache.camel" (please note that probably not all related to maven!). > If I look at the Graph of incoming references I see for example that these > are from Model/Artifact groupId. > I know that string deduplication in general is hard and even controversial, > but maybe one could think about such thing at least for the "hotsposts", e,g, > groupId, artifactId and version or even managementKeys seem good candidates > to be considered for such thing as these are used all over the place. -- This message was sent by Atlassian Jira (v8.20.10#820010)