Re: Distribution and Tomcat

2006-06-24 Thread Chris Hostetter

: Based on this information, it doesn't sound as if any failover or LB
: capacity in Tomcat is being used.  Is that correct?

Correct, Solr doesn't try to take advantage of any special features in any
one servlet container (Tomcat or otherwise)



-Hoss



Re: Faceted Browsing questions

2006-06-24 Thread Vish D.

Thank you Chris and Erik. That makes it a bit clearer, but I might need to
sit down and look at the code (nines + DisMax...) a bit closer to see how it
all works in Solr.

Erik, when do you plan on having your implementation refactored with "good"
use of code? Or, in general, when is Solr planning on having this feature
out (as I see it on the wiki for near term features)? It might be better for
me to wait and see how the group decides to implement it, rather than having
something done myself and have to drop it at the end. Plus, you guys
probably have the higher hand when it comes to knowing the details of
Solr/Lucene, and its re-useable features.

Thanks all, and just wanted to say -- I am quite impressed by how Solr is
being taken on by the community. It's a solid search api, if it fits your
needs.

On 6/23/06, Chris Hostetter <[EMAIL PROTECTED]> wrote:



: nature of the data I am working with. I have searched through the
mailing
: list and seen some posts mentioning BitSets DocSets, etc.., but wasn't
clear
: on if those are already built into the solr's nightly builds (I don't
see
: any documentation either on the wiki, or online). Can some please steer
me
: towards the right direction to have the above up in the short time?

You'll want to start with the Solr javadocs, which are linked to from the
left nav of every page on the Solr website ("Documentation > API Docs")...

http://incubator.apache.org/solr/docs/api/

The DocSet classes are in fact a core part of Solr.

There are some examples in email threads where Erik sent out some code
demonstrating how he was doing faceting using BitSets, and I suggested
ways he could do things using DocSets ... another good example you can
look at is the code for the DisMaxRequestHandler.  It doesn't do faceting,
but it does use DocSets when dealing with the "fq" (filter query) param.

That should be a good place to start.


-Hoss




Re: Faceted Browsing questions

2006-06-24 Thread Erik Hatcher


On Jun 24, 2006, at 12:38 PM, Vish D. wrote:
Erik, when do you plan on having your implementation refactored  
with "good"

use of code?


This weekend :)   I have imported more data than my hacked  
implementation can handle without bumping up Jetty's JVM heap size,  
so I'm now at the point where it is necessary for me to start using  
the LRUCache.  Though I have already refactored to use OpenBitSet  
instead of BitSet.



Or, in general, when is Solr planning on having this feature
out (as I see it on the wiki for near term features)? It might be  
better for
me to wait and see how the group decides to implement it, rather  
than having

something done myself and have to drop it at the end. Plus, you guys
probably have the higher hand when it comes to knowing the details of
Solr/Lucene, and its re-useable features.


The best way for Solr to get this functionality is for those that  
have implemented it in a custom fashion to get together and  
generalize it, so that we have a proven architecture that is  
configurable enough to handle real world situations.  My  
implementation is still being ironed out.  And it does rely on custom  
request handlers to utilize the facets and return back the counts per  
facet.


Erik




Re: Faceted Browsing questions

2006-06-24 Thread Yonik Seeley

On 6/24/06, Erik Hatcher <[EMAIL PROTECTED]> wrote:

This weekend :)   I have imported more data than my hacked
implementation can handle without bumping up Jetty's JVM heap size,
so I'm now at the point where it is necessary for me to start using
the LRUCache.  Though I have already refactored to use OpenBitSet
instead of BitSet.


You can also fit more in mem if you can use DocSet (HashDocSet) for
smaller sets.  This will also speed up intersection counts.  This is
done automatically when you get the DocSet from Solr, or if numDocs()
is used.


-Yonik
http://incubator.apache.org/solr Solr, the open-source Lucene search server


Re: Faceted Browsing questions

2006-06-24 Thread Vish D.

Erik,

Oh good! Keep me (us) updated!!

As for committing some code into Solr, and the real world uses, I am sure we
can find some generic/abstract rules for faceted browsing -- simplest being,
a set of fields/categories defined in schema.xml, which could be used for an
optional extented query response, or a custom/new response by itself.

I am also sure that we have at least a couple other implementation of this
feature, which might bring in some good insights in "better" use of code. In
any case, I am eager to see this feature "ironed" out on the community
level.

Thanks!


On 6/24/06, Erik Hatcher <[EMAIL PROTECTED]> wrote:



On Jun 24, 2006, at 12:38 PM, Vish D. wrote:
> Erik, when do you plan on having your implementation refactored
> with "good"
> use of code?

This weekend :)   I have imported more data than my hacked
implementation can handle without bumping up Jetty's JVM heap size,
so I'm now at the point where it is necessary for me to start using
the LRUCache.  Though I have already refactored to use OpenBitSet
instead of BitSet.

> Or, in general, when is Solr planning on having this feature
> out (as I see it on the wiki for near term features)? It might be
> better for
> me to wait and see how the group decides to implement it, rather
> than having
> something done myself and have to drop it at the end. Plus, you guys
> probably have the higher hand when it comes to knowing the details of
> Solr/Lucene, and its re-useable features.

The best way for Solr to get this functionality is for those that
have implemented it in a custom fashion to get together and
generalize it, so that we have a proven architecture that is
configurable enough to handle real world situations.  My
implementation is still being ironed out.  And it does rely on custom
request handlers to utilize the facets and return back the counts per
facet.

Erik