If your subentities are large, the default DIH Cache probably isn't going to work because it stores all the data in-memory. (This is CachedSQLEntityProcessor for Solr 3.5 or earlier ; cacheImpl="SortedMapBackedCache" for 3.6 or later)
DIH for Solr 3.6 and later supports pluggable caches (see https://issues.apache.org/jira/browse/SOLR-2382), so you have the option of caching to disk. Unfortunately the only good disk-backed cache available here uses Berkley Database, which has an incompatible license and cannot be included with an Apache project. See https://issues.apache.org/jira/browse/SOLR-2613 for the code ; you'll have to download bdb-je from Oracle yourself. We also converted from Endeca, and needed these cache options to replace the Forge Cache feature which we depended on heavily for joins. It was a lot of work to set this up with DIH and getting everything to work correctly but the end result for us is actually a lot faster (and way more flexible) than Forge ever was. By the way, there have been sporatic reports of unexpected behavior using Caching with 3.6. You may want to try 4.0 if you're currently running 3.6. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -----Original Message----- From: mroosendaal [mailto:mroosend...@yahoo.com] Sent: Monday, October 29, 2012 5:06 AM To: solr-user@lucene.apache.org Subject: Re: DIH nested entities don't work Hi, It seems to work without the cache option, the downside is it will takes ages for everything to be indexed and my testset is 20 times smaller than the productset. Indexing just the root item takes 3 minutes (>600K) but every subentity takes more time which is obvious but i would've hoped it would at least be faster. Our current searchengine (Endeca) does the same thing but takes 'only' 1h20m. How can i speed this up, the bottleneck is not the CPU or memory, but simply the databasetime. Thanks, Maarten -- View this message in context: http://lucene.472066.n3.nabble.com/DIH-nested-entities-don-t-work-tp4015514p4016618.html Sent from the Solr - User mailing list archive at Nabble.com.