Faceted Browsing questions
Hi all, I am trying to figure out how I can have some type of faceted browsing working. I am also in need of a way to get a list of unique field values within a query's results set (for filtering, etc...). When I say trying, I mean having it up and running without much coding, b/c of time reasons. I would most definitely be involved in some customizing just because of the nature of the data I am working with. I have searched through the mailing list and seen some posts mentioning BitSets DocSets, etc.., but wasn't clear on if those are already built into the solr's nightly builds (I don't see any documentation either on the wiki, or online). Can some please steer me towards the right direction to have the above up in the short time? Thanks a lot! Vish
Re: Faceted Browsing questions
I'm extremely time constrained at the moment, but I'll reply briefly. Solr provides the ground work for making faceted features possible, but out of the box it does not provide it without coding a custom request handler and knowing a little about Lucene and Solr's APIs. As you've seen, bits and pieces have been posted to the list. My project is open-source at the "patacriticism" project at SourceForge, under the "nines" folder in Subversion. Feel free to have a peek there, but its certainly going to change dramatically soon to take better advantage of Solr's caching infrastructure - so take it as a (bad) example for now. Erik On Jun 23, 2006, at 9:33 AM, Vish D. wrote: Hi all, I am trying to figure out how I can have some type of faceted browsing working. I am also in need of a way to get a list of unique field values within a query's results set (for filtering, etc...). When I say trying, I mean having it up and running without much coding, b/c of time reasons. I would most definitely be involved in some customizing just because of the nature of the data I am working with. I have searched through the mailing list and seen some posts mentioning BitSets DocSets, etc.., but wasn't clear on if those are already built into the solr's nightly builds (I don't see any documentation either on the wiki, or online). Can some please steer me towards the right direction to have the above up in the short time? Thanks a lot! Vish
Re: Distribution and Tomcat
Solr itself does not handle failover and load balancing of the slave servers. You can put a load balancer in front of the pool of slave servers for that. You can add and remove slave servers from the pool dynamicly without making any change of the master and other slaves. Bill On 6/23/06, Jeff Rodenburg <[EMAIL PROTECTED]> wrote: I'm reading in earnest the wiki entries regarding collection distribution and the master/slave operational basis. This is very similar to what we've used previously in our operation based on the lucene core. We're deploying initially on Tomcat 5.5.17, Linux RHEL 3 and the latest bits from Solr. My question concerns how Solr handles slave querying & configuration. - How is failover achieved? - How is performance load balancing achieved? thanks, jeff r.
Re: Distribution and Tomcat
: You can put a load balancer in front of the pool of slave servers for that. Solr does have some features designed to make Load Balancing easy * "healthcheck" URLs that your LoadBalancer can query to determine when it should add/remove a server from rotation * a pingQuery which allowing you to control in the solrconfig.xml what query should be executed when a LoadBalancer (or anyone) hits the /admin/ping URL for checkign the response time of various slaves if you want "response time" load balancing. Neither of which seeem to be documented very well in the Wiki... Bill, do you think maybe you could add a little bit on each of these to the SolrConfigXml wiki page? -Hoss
Re: Faceted Browsing questions
: nature of the data I am working with. I have searched through the mailing : list and seen some posts mentioning BitSets DocSets, etc.., but wasn't clear : on if those are already built into the solr's nightly builds (I don't see : any documentation either on the wiki, or online). Can some please steer me : towards the right direction to have the above up in the short time? You'll want to start with the Solr javadocs, which are linked to from the left nav of every page on the Solr website ("Documentation > API Docs")... http://incubator.apache.org/solr/docs/api/ The DocSet classes are in fact a core part of Solr. There are some examples in email threads where Erik sent out some code demonstrating how he was doing faceting using BitSets, and I suggested ways he could do things using DocSets ... another good example you can look at is the code for the DisMaxRequestHandler. It doesn't do faceting, but it does use DocSets when dealing with the "fq" (filter query) param. That should be a good place to start. -Hoss
Re: Distribution and Tomcat
Should the /admin/ping URL work on any Solr instance? i.e., http://solr-server:8080/solr/admin/ping? I'm seeing a 500 error. Based on this information, it doesn't sound as if any failover or LB capacity in Tomcat is being used. Is that correct? thanks, j On 6/23/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: : You can put a load balancer in front of the pool of slave servers for that. Solr does have some features designed to make Load Balancing easy * "healthcheck" URLs that your LoadBalancer can query to determine when it should add/remove a server from rotation * a pingQuery which allowing you to control in the solrconfig.xml what query should be executed when a LoadBalancer (or anyone) hits the /admin/ping URL for checkign the response time of various slaves if you want "response time" load balancing. Neither of which seeem to be documented very well in the Wiki... Bill, do you think maybe you could add a little bit on each of these to the SolrConfigXml wiki page? -Hoss
Re: Distribution and Tomcat
O/E on the ping. Just found the adminPing config setting. On 6/23/06, Jeff Rodenburg <[EMAIL PROTECTED]> wrote: Should the /admin/ping URL work on any Solr instance? i.e., http://solr-server:8080/solr/admin/ping? I'm seeing a 500 error. Based on this information, it doesn't sound as if any failover or LB capacity in Tomcat is being used. Is that correct? thanks, j On 6/23/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > > : You can put a load balancer in front of the pool of slave servers for > that. > > Solr does have some features designed to make Load Balancing easy > > * "healthcheck" URLs that your LoadBalancer can query to determine > when > it should add/remove a server from rotation > > * a pingQuery which allowing you to control in the solrconfig.xml what > query should be executed when a LoadBalancer (or anyone) hits the > /admin/ping URL for checkign the response time of various slaves if you > want "response time" load balancing. > > Neither of which seeem to be documented very well in the Wiki... > > Bill, do you think maybe you could add a little bit on each of these to > the SolrConfigXml wiki page? > > > -Hoss > >