Re: Distribution and Tomcat
: Based on this information, it doesn't sound as if any failover or LB : capacity in Tomcat is being used. Is that correct? Correct, Solr doesn't try to take advantage of any special features in any one servlet container (Tomcat or otherwise) -Hoss
Re: Faceted Browsing questions
Thank you Chris and Erik. That makes it a bit clearer, but I might need to sit down and look at the code (nines + DisMax...) a bit closer to see how it all works in Solr. Erik, when do you plan on having your implementation refactored with "good" use of code? Or, in general, when is Solr planning on having this feature out (as I see it on the wiki for near term features)? It might be better for me to wait and see how the group decides to implement it, rather than having something done myself and have to drop it at the end. Plus, you guys probably have the higher hand when it comes to knowing the details of Solr/Lucene, and its re-useable features. Thanks all, and just wanted to say -- I am quite impressed by how Solr is being taken on by the community. It's a solid search api, if it fits your needs. On 6/23/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: : nature of the data I am working with. I have searched through the mailing : list and seen some posts mentioning BitSets DocSets, etc.., but wasn't clear : on if those are already built into the solr's nightly builds (I don't see : any documentation either on the wiki, or online). Can some please steer me : towards the right direction to have the above up in the short time? You'll want to start with the Solr javadocs, which are linked to from the left nav of every page on the Solr website ("Documentation > API Docs")... http://incubator.apache.org/solr/docs/api/ The DocSet classes are in fact a core part of Solr. There are some examples in email threads where Erik sent out some code demonstrating how he was doing faceting using BitSets, and I suggested ways he could do things using DocSets ... another good example you can look at is the code for the DisMaxRequestHandler. It doesn't do faceting, but it does use DocSets when dealing with the "fq" (filter query) param. That should be a good place to start. -Hoss
Re: Faceted Browsing questions
On Jun 24, 2006, at 12:38 PM, Vish D. wrote: Erik, when do you plan on having your implementation refactored with "good" use of code? This weekend :) I have imported more data than my hacked implementation can handle without bumping up Jetty's JVM heap size, so I'm now at the point where it is necessary for me to start using the LRUCache. Though I have already refactored to use OpenBitSet instead of BitSet. Or, in general, when is Solr planning on having this feature out (as I see it on the wiki for near term features)? It might be better for me to wait and see how the group decides to implement it, rather than having something done myself and have to drop it at the end. Plus, you guys probably have the higher hand when it comes to knowing the details of Solr/Lucene, and its re-useable features. The best way for Solr to get this functionality is for those that have implemented it in a custom fashion to get together and generalize it, so that we have a proven architecture that is configurable enough to handle real world situations. My implementation is still being ironed out. And it does rely on custom request handlers to utilize the facets and return back the counts per facet. Erik
Re: Faceted Browsing questions
On 6/24/06, Erik Hatcher <[EMAIL PROTECTED]> wrote: This weekend :) I have imported more data than my hacked implementation can handle without bumping up Jetty's JVM heap size, so I'm now at the point where it is necessary for me to start using the LRUCache. Though I have already refactored to use OpenBitSet instead of BitSet. You can also fit more in mem if you can use DocSet (HashDocSet) for smaller sets. This will also speed up intersection counts. This is done automatically when you get the DocSet from Solr, or if numDocs() is used. -Yonik http://incubator.apache.org/solr Solr, the open-source Lucene search server
Re: Faceted Browsing questions
Erik, Oh good! Keep me (us) updated!! As for committing some code into Solr, and the real world uses, I am sure we can find some generic/abstract rules for faceted browsing -- simplest being, a set of fields/categories defined in schema.xml, which could be used for an optional extented query response, or a custom/new response by itself. I am also sure that we have at least a couple other implementation of this feature, which might bring in some good insights in "better" use of code. In any case, I am eager to see this feature "ironed" out on the community level. Thanks! On 6/24/06, Erik Hatcher <[EMAIL PROTECTED]> wrote: On Jun 24, 2006, at 12:38 PM, Vish D. wrote: > Erik, when do you plan on having your implementation refactored > with "good" > use of code? This weekend :) I have imported more data than my hacked implementation can handle without bumping up Jetty's JVM heap size, so I'm now at the point where it is necessary for me to start using the LRUCache. Though I have already refactored to use OpenBitSet instead of BitSet. > Or, in general, when is Solr planning on having this feature > out (as I see it on the wiki for near term features)? It might be > better for > me to wait and see how the group decides to implement it, rather > than having > something done myself and have to drop it at the end. Plus, you guys > probably have the higher hand when it comes to knowing the details of > Solr/Lucene, and its re-useable features. The best way for Solr to get this functionality is for those that have implemented it in a custom fashion to get together and generalize it, so that we have a proven architecture that is configurable enough to handle real world situations. My implementation is still being ironed out. And it does rely on custom request handlers to utilize the facets and return back the counts per facet. Erik