On Fri, Nov 7, 2008 at 12:49 AM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > Your problem is most likely the time it takes to facet on those > multi-valued fields. > Help is coming within the month I'd estimate, in the form of faster > faceting for multivalued fields where the number of values per > document is low. which is the issue tracking that?
> > Until then, you might be able to get some smaller speedups by trying > the new FastLRUCache in Solr. Download the latest nightly build and > change solr.LRUCache to solr.FastLRUCache in your solrconfig.xml. > > unrelated: you may not want a huge document cache like you have now, > unless you still have enough RAM free for the OS to cache most of the > index files. > > -Yonik > > On Thu, Nov 6, 2008 at 1:18 PM, Cedric Houis <[EMAIL PROTECTED]> wrote: >> >> Hi Solr's users, and first of all, sorry for my bad english =^D !!! >> >> We are experimenting Solr with the intention of using it in a web site. >> >> To test solr's performance, we have made a little program that simulates >> users on the web site. >> This program starts N threads (users), with 0.5 second pause between each >> start. >> Each thread makes a query, and wait between 2 and 10 seconds before make >> another query. >> Each query is randomly created with a list of 1500 words. >> >> When we simulate 1 user, a query takes about 0.3 second to execute. >> When we simulate 10 users, a query takes about 2 second to execute. >> When we simulate 50 users, a query takes about 60 seconds to execute. >> >> When we simulate 6*(10 users) with 6 instances of Solr on the same machine, >> a query takes 8 seconds to execute. >> >> You probably see where my problem is… >> >> Here is our configuration : >> >> Hardware : >> >> Debian etch, quad core, 6Go ram, 10000 rpm disk, jetty. >> Windows 2003, quad core, 4Go ram, 10000 rpm disk, jetty and tomcat. >> >> Schema : >> >> <field name="searchable" type="boolean" indexed="true" >> stored="true"/> >> <field name="publishdate" type="date" indexed="true" stored="true"/> >> <field name="expirationdate" type="date" indexed="true" >> stored="true"/> >> <field name="site" type="string" indexed="true" stored="true"/> >> <field name="section" type="text" indexed="true" stored="false" >> multiValued="true"/> >> <field name="byline" type="text" indexed="true" stored="false" >> multiValued="true"/> >> <field name="workflowstate" type="string" indexed="true" >> stored="false" multiValued="true"/> >> >> <field name="nDocumentId" type="string" indexed="true" >> stored="true"/> >> <field name="nLanguage" type="string" indexed="true" >> stored="false"/> >> >> <field name="nFullText" type="text" indexed="true" stored="false" >> termVectors="true"/> >> <field name="nObjectXml" type="string" indexed="false" stored="true" >> multiValued="false"/> >> >> <field name="nGeographicLocations" type="string" indexed="true" >> stored="false" multiValued="true"/> >> <field name="nOrganizationNames" type="string" indexed="true" >> stored="false" multiValued="true"/> >> <field name="nPersonNames" type="string" indexed="true" >> stored="false" multiValued="true"/> >> <field name="nCategories" type="string" indexed="true" >> stored="false" multiValued="true"/> >> <field name="nSimpleConcepts" type="string" indexed="true" >> stored="false" multiValued="true"/> >> <field name="nComplexConcepts" type="string" indexed="true" >> stored="false" multiValued="true"/> >> <field name="nSummary" type="text_ws" indexed="true" >> stored="false"/> >> >> Solr config : >> >> <filterCache class="solr.LRUCache" size="1048576" >> initialSize="4096" >> autowarmCount="1024"/> >> <queryResultCache class="solr.LRUCache" size="16384" >> initialSize="4096" >> autowarmCount="128"/> >> <documentCache class="solr.LRUCache" size="1048576" >> initialSize="512" >> autowarmCount="0"/> >> >> <listener event="newSearcher" class="solr.QuerySenderListener"> >> <arr name="queries"> >> <lst> >> <str name="q">*:*</str> >> <str name="fq">searchable:true</str> >> <str name="sort">publishdate asc</str> >> </lst> >> <lst> >> <str name="q">*:*</str> >> <str name="facet.field">nPersonNames</str> >> <str name="facet.field">nOrganizationNames</str> >> <str name="facet.field">nGeographicLocations</str> >> <str name="facet.field">nCategories</str> >> <str name="fq">searchable:true</str> >> </lst> >> </arr> >> </listener> >> <listener event="firstSearcher" class="solr.QuerySenderListener"> >> <arr name="queries"> >> <lst> >> <str name="q">*:*</str> >> <str name="fq">searchable:true</str> >> <str name="sort">publishdate asc</str> >> </lst> >> <lst> >> <str name="q">*:*</str> >> <str name="facet.field">nPersonNames</str> >> <str name="facet.field">nOrganizationNames</str> >> <str name="facet.field">nGeographicLocations</str> >> <str name="facet.field">nCategories</str> >> <str name="fq">searchable:true</str> >> </lst> >> </arr> >> </listener> >> >> Data : >> >> 367380 documents >> >> nGeographicLocations : 39298 distincts values >> nPersonNames : 325142 distincts values >> nOrganizationNames : 130681 distincts values >> nCategories : 929 distincts values >> nSimpleConcepts : 110198 distincts values >> nComplexConcepts : 1508141 distincts values >> Each of those fields has a maximum of 7 values per document. >> >> Query : >> >> http://olympe:1006/solr/select >> ?sort=publishdate+asc >> &facet=true >> &facet.field=nPersonNames >> &facet.field=nOrganizationNames >> &facet.field=nGeographicLocations >> &facet.field=nCategories >> &f.nPersonNames.facet.limit=10 >> &f.nOrganizationNames.facet.limit=10 >> &f.nGeographicLocations.facet.limit=10 >> &f.nCategories.facet.limit=10 >> &version=2.2 >> &wt=json >> &json.nl=map >> &q=nFullText%3A"Random words"+AND+searchable%3Atrue >> &start=0 >> &rows=10 >> >> Every help will be appreciated because I don't know how to optimize the >> query time response…:-( >> >> Regards, >> >> Cédric Houis >> -- >> View this message in context: >> http://www.nabble.com/Very-bad-performance-tp20366783p20366783.html >> Sent from the Solr - User mailing list archive at Nabble.com. >> >> > -- --Noble Paul