Faceted Browsing questions

2006-06-23 Thread Vish D.

Hi all,

I am trying to figure out how I can have some type of faceted browsing
working. I am also in need of a way to get a list of unique field values
within a query's results set (for filtering, etc...). When I say trying, I
mean having it up and running without much coding, b/c of time reasons. I
would most definitely be involved in some customizing just because of the
nature of the data I am working with. I have searched through the mailing
list and seen some posts mentioning BitSets DocSets, etc.., but wasn't clear
on if those are already built into the solr's nightly builds (I don't see
any documentation either on the wiki, or online). Can some please steer me
towards the right direction to have the above up in the short time?

Thanks a lot!

Vish


Re: Faceted Browsing questions

2006-06-23 Thread Erik Hatcher
I'm extremely time constrained at the moment, but I'll reply  
briefly.  Solr provides the ground work for making faceted features  
possible, but out of the box it does not provide it without coding a  
custom request handler and knowing a little about Lucene and Solr's  
APIs.  As you've seen, bits and pieces have been posted to the list.   
My project is open-source at the "patacriticism" project at  
SourceForge, under the "nines" folder in Subversion.  Feel free to  
have a peek there, but its certainly going to change dramatically  
soon to take better advantage of Solr's caching infrastructure - so  
take it as a (bad) example for now.


Erik


On Jun 23, 2006, at 9:33 AM, Vish D. wrote:


Hi all,

I am trying to figure out how I can have some type of faceted browsing
working. I am also in need of a way to get a list of unique field  
values
within a query's results set (for filtering, etc...). When I say  
trying, I
mean having it up and running without much coding, b/c of time  
reasons. I
would most definitely be involved in some customizing just because  
of the
nature of the data I am working with. I have searched through the  
mailing
list and seen some posts mentioning BitSets DocSets, etc.., but  
wasn't clear
on if those are already built into the solr's nightly builds (I  
don't see
any documentation either on the wiki, or online). Can some please  
steer me

towards the right direction to have the above up in the short time?

Thanks a lot!

Vish




Re: Distribution and Tomcat

2006-06-23 Thread Bill Au

Solr itself does not handle failover and load balancing of the slave
servers.
You can put a load balancer in front of the pool of slave servers for that.
You can add and remove slave servers from the pool dynamicly without
making any change of the master and other slaves.

Bill

On 6/23/06, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:


I'm reading in earnest the wiki entries regarding collection distribution
and the master/slave operational basis.  This is very similar to what
we've
used previously in our operation based on the lucene core.  We're
deploying
initially on Tomcat 5.5.17, Linux RHEL 3 and the latest bits from Solr.

My question concerns how Solr handles slave querying & configuration.

- How is failover achieved?
- How is performance load balancing achieved?


thanks,
jeff r.




Re: Distribution and Tomcat

2006-06-23 Thread Chris Hostetter

: You can put a load balancer in front of the pool of slave servers for that.

Solr does have some features designed to make Load Balancing easy

  * "healthcheck" URLs that your LoadBalancer can query to determine when
it should add/remove a server from rotation

  * a pingQuery which allowing you to control in the solrconfig.xml what
query should be executed when a LoadBalancer (or anyone) hits the
/admin/ping URL for checkign the response time of various slaves if you
want "response time" load balancing.

Neither of which seeem to be documented very well in the Wiki...

Bill, do you think maybe you could add a little bit on each of these to
the SolrConfigXml wiki page?


-Hoss



Re: Faceted Browsing questions

2006-06-23 Thread Chris Hostetter

: nature of the data I am working with. I have searched through the mailing
: list and seen some posts mentioning BitSets DocSets, etc.., but wasn't clear
: on if those are already built into the solr's nightly builds (I don't see
: any documentation either on the wiki, or online). Can some please steer me
: towards the right direction to have the above up in the short time?

You'll want to start with the Solr javadocs, which are linked to from the
left nav of every page on the Solr website ("Documentation > API Docs")...

http://incubator.apache.org/solr/docs/api/

The DocSet classes are in fact a core part of Solr.

There are some examples in email threads where Erik sent out some code
demonstrating how he was doing faceting using BitSets, and I suggested
ways he could do things using DocSets ... another good example you can
look at is the code for the DisMaxRequestHandler.  It doesn't do faceting,
but it does use DocSets when dealing with the "fq" (filter query) param.

That should be a good place to start.


-Hoss



Re: Distribution and Tomcat

2006-06-23 Thread Jeff Rodenburg

Should the /admin/ping URL work on any Solr instance? i.e.,
http://solr-server:8080/solr/admin/ping?  I'm seeing a 500 error.
Based on this information, it doesn't sound as if any failover or LB
capacity in Tomcat is being used.  Is that correct?

thanks,
j

On 6/23/06, Chris Hostetter <[EMAIL PROTECTED]> wrote:



: You can put a load balancer in front of the pool of slave servers for
that.

Solr does have some features designed to make Load Balancing easy

  * "healthcheck" URLs that your LoadBalancer can query to determine when
it should add/remove a server from rotation

  * a pingQuery which allowing you to control in the solrconfig.xml what
query should be executed when a LoadBalancer (or anyone) hits the
/admin/ping URL for checkign the response time of various slaves if you
want "response time" load balancing.

Neither of which seeem to be documented very well in the Wiki...

Bill, do you think maybe you could add a little bit on each of these to
the SolrConfigXml wiki page?


-Hoss




Re: Distribution and Tomcat

2006-06-23 Thread Jeff Rodenburg

O/E on the ping.  Just found the adminPing config setting.

On 6/23/06, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:


Should the /admin/ping URL work on any Solr instance? i.e.,
http://solr-server:8080/solr/admin/ping?  I'm seeing a 500 error.
Based on this information, it doesn't sound as if any failover or LB
capacity in Tomcat is being used.  Is that correct?

thanks,
j


On 6/23/06, Chris Hostetter <[EMAIL PROTECTED]> wrote:
>
>
> : You can put a load balancer in front of the pool of slave servers for
> that.
>
> Solr does have some features designed to make Load Balancing easy
>
>   * "healthcheck" URLs that your LoadBalancer can query to determine
> when
> it should add/remove a server from rotation
>
>   * a pingQuery which allowing you to control in the solrconfig.xml what
> query should be executed when a LoadBalancer (or anyone) hits the
> /admin/ping URL for checkign the response time of various slaves if you
> want "response time" load balancing.
>
> Neither of which seeem to be documented very well in the Wiki...
>
> Bill, do you think maybe you could add a little bit on each of these to
> the SolrConfigXml wiki page?
>
>
> -Hoss
>
>