If it's really the interned strings, you could try upgrade JDK, as the newer HotSpot
JVM puts interned strings in regular heap:
I haven't got a chance to look into the new core auto discovery code, so I don't know
if it's implemented with reflection or not. Reflection and dynamic class loading is another
source of PermGen exception, in my experience.
I don't see anything wrong with your JVM config, which is very much standard.
Hope this helps,
Tri
On Mar 03, 2014, at 08:52 AM, Josh <jwda...@gmail.com> wrote:
On Mar 03, 2014, at 08:52 AM, Josh <jwda...@gmail.com> wrote:
In the user core there are two fields, the database core in question was
40, but in production environments the database core is dynamic. My time
has been pretty crazy trying to get this out the door and we haven't tried
a standard solr install yet but it's on my plate for the test app and I
don't know enough about Solr/Bitnami to know if they've done any serious
modifications to it.
I had tried doing a dump from VisualVM previously but it didn't seem to
give me anything useful but then again I didn't know how to look for
interned strings. This is something I can take another look at in the
coming weeks when I do my test case against a standard solr install with
SolrJ. The exception with user cores happens after 80'ish runs, so 640'ish
user cores with the PermGen set to 64MB. The database core test was far
lower, it was in the 10-15 range. As a note once the permgen limit is hit,
if we simply restart the service with the same number of cores loaded the
permgen usage is minimal even with the amount of user cores being high in
our production environment (500-600).
If this does end up being the interning of strings, is there anyway it can
be mitigated? Our production environment for our heavier users would see in
the range of 3200+ user cores created a day.
Thanks for the help.
Josh
On Mon, Mar 3, 2014 at 11:24 AM, Tri Cao <tm...@me.com> wrote:
Hey Josh,I am not an expert in Java performance, but I would start with dumping athe heapand investigate with visualvm (the free tool that comes with JDK).In my experience, the most common cause for PermGen exception is the appcreatestoo many interned strings. Solr (actually Lucene) interns the field namesso if you havetoo many fields, it might be the cause. How many fields in total acrosscores did youcreate before the exception?Can you reproduce the problem with the standard Solr? Is the bitnamidistribution justSolr or do they have some other libraries?Hope this helps,TriOn Mar 03, 2014, at 07:28 AM, Josh <jwda...@gmail.com> wrote:It's a windows installation using a bitnami solr installer. I incorrectlyput 64M into the configuration for this, as I had copied the testconfiguration I was using to recreate the permgen issue we were seeing onour production system (that is configured to 512M) as it takes awhile withto recreate the issue with larger permgen values. In the test scenariothere was a small 180 document data core that's static with 8 dynamic usercores that are used to index the unique document ids in the users view,which is then merged into a single user core. The final user core containsthe same number of document ids as the data core and the data core isqueried against with the ids in the final merged user core as the limiter.The user cores are then unloaded, and deleted from the drive and then thetest is reran again with the user cores re-createdWe are also using the core discovery mode to store/find our cores and thedatabase data core is using dynamic fields with a mix of single value andmulti value fields. The user cores use a static configuration. The data isindexed from SQL Server using jtDS for both the user and data cores. As anote we also reversed the test case I mention above where we keep the usercores static and dynamically create the database core and this created thesame issue only it leaked faster. We assumed this because the configurationwas larger/loaded more classes then the simpler user core.When I get the time I'm going to put together a SolrJ test app to recreatethe issue outside of our environment to see if others see the same issuewe're seeing to rule out any kind of configuration problem. Right now we'reinteracting with solr with POCO via the restful interface and it's not veryeasy for us to spin this off into something someone else could use. In themean time we've made changes to make the user cores more static, this hasslowed down the build up of permgen to something that can be managed by aweekly reset.Sorry about the confusion in my initial email and I appreciate theresponse. Anything about my configuration that you can think might beuseful just let me know and I can provide it. We have a work around, but itreally hampers what our long term goals were for our Solr implementation.ThanksJoshOn Mon, Mar 3, 2014 at 9:57 AM, Greg Walters <greg.walt...@answers.com>wrote:Josh,You've mentioned a couple of times that you've got PermGen set to 512M butthen you say you're running with -XX:MaxPermSize=64M. These two statementsare contradictory so are you *sure* that you're running with 512M ofPermGen? Assuming your on a *nix box can you provide `ps` output provingthis?Thanks,GregOn Feb 28, 2014, at 5:22 PM, Furkan KAMACI <furkankam...@gmail.com> wrote:> Hi;>> You can also check here:>http://stackoverflow.com/questions/3717937/cmspermgensweepingenabled-vs-cmsclassunloadingenabled>> Thanks;> Furkan KAMACI>>> 2014-02-26 22:35 GMT+02:00 Josh <jwda...@gmail.com>:>>> Thanks Timothy,>>>> I gave these a try and -XX:+CMSPermGenSweepingEnabled seemed to causethe>> error to happen more quickly. With this option on it didn't seemed to do>> any intermittent garbage collecting that delayed the issue in with itoff.>> I was already using a max of 512MB, and I can reproduce it with it setthis>> high or even higher. Right now because of how we have this implementedjust>> increasing it to something high just delays the problem :/>>>> Anything else you could suggest I would really appreciate.>>>>>> On Wed, Feb 26, 2014 at 3:19 PM, Tim Potter <tim.pot...@lucidworks.com>>> wrote:>>>>> Hi Josh,>>>>>> Try adding: -XX:+CMSPermGenSweepingEnabled as I think for some VM>>> versions, permgen collection was disabled by default.>>>>>> Also, I use: -XX:MaxPermSize=512m -XX:PermSize=256m with Solr, so 64Mmay>>> be too small.>>>>>>>>> Timothy Potter>>> Sr. Software Engineer, LucidWorks>>> www.lucidworks.com>>>>>> ________________________________________>>> From: Josh <jwda...@gmail.com>>>> Sent: Wednesday, February 26, 2014 12:27 PM>>> To: solr-user@lucene.apache.org>>> Subject: Solr Permgen Exceptions when creating/removing cores>>>>>> We are using the Bitnami version of Solr 4.6.0-1 on a 64bit windows>>> installation with 64bit Java 1.7U51 and we are seeing consistent issues>>> with PermGen exceptions. We have the permgen configured to be 512MB.>>> Bitnami ships with a 32bit version of Java for windows and we are>> replacing>>> it with a 64bit version.>>>>>> Passed in Java Options:>>>>>> -XX:MaxPermSize=64M>>> -Xms3072M>>> -Xmx6144M>>> -XX:+UseParNewGC>>> -XX:+UseConcMarkSweepGC>>> -XX:CMSInitiatingOccupancyFraction=75>>> -XX:+CMSClassUnloadingEnabled>>> -XX:NewRatio=3>>>>>> -XX:MaxTenuringThreshold=8>>>>>> This is our use case:>>>>>> We have what we call a database core which remains fairly static and>>> contains the imported contents of a table from SQL server. We then have>>> user cores which contain the record ids of results from a text search>>> outside of Solr. We then query for the data we want from the database>> core>>> and limit the results to the content of the user core. This allows usto>>> combine facet data from Solr with the search results from anotherengine.>>> We are creating the user cores on demand and removing them when theuser>>> logs out.>>>>>> Our issue is the constant creation and removal of user cores combined>> with>>> the constant importing seems to push us over our PermGen limit. Theuser>>> cores are removed at the end of every session and as a test I made an>>> application that would loop creating the user core, import a set ofdata>> to>>> it, query the database core using it as a limiter and then remove the>> user>>> core. My expectation was in this scenario that all the permgenassociated>>> with that user cores would be freed upon it's unload and allow permgento>>> reclaim that memory during a garbage collection. This was not the case,>> it>>> would constantly go up until the application would exhaust the memory.>>>>>> I also investigated whether the there was a connection between the two>>> cores left behind because I was joining them together in a query buteven>>> unloading the database core after unloading all the user cores won't>>> prevent the limit from being hit or any memory to be garbage collected>> from>>> Solr.>>>>>> Is this a known issue with creating and unloading a large number of>> cores?>>> Could it be configuration based for the core? Is there something other>> than>>> unloading that needs to happen to free the references?>>>>>> Thanks>>>>>> Notes: I've tried using tools to determine if it's a leak within Solr>> such>>> as Plumbr and my activities turned up nothing.>>>>>