Yes, it seems like I don't need to split. I could use different commit
times. In my use case it is too often and I could have a different commit
time on a country basis.Your questions made me rethink the need of splitting
into cores.

Thanks

On Fri, Sep 4, 2009 at 5:38 AM, Shalin Shekhar Mangar <
shalinman...@gmail.com> wrote:

> On Fri, Sep 4, 2009 at 4:35 AM, Jonathan Ariel <ionat...@gmail.com> wrote:
>
> > It seems like it is really hard to decide when the Multiple Core solution
> > is
> > more appropriate.As I could understand from this list and wiki the
> Multiple
> > Core feature was designed to address the need of handling different sets
> of
> > data within the same solr instance, where the sets of data don't need to
> be
> > joined.
> >
>
> Correct. It is also useful when you don't want to setup multiple boxes or
> tomcats for each Solr.
>
>
> > In my case the documents are of a specific site and country. So document
> A
> > can be of Site 1 / Country 1, B of Site 2 / Country 1, C of Site 1 /
> > Country
> > 2, and so on.
> > For the use cases of my application I will never query across countries
> or
> > sites. I will always have to provide to the query the country id and the
> > site id.
> > Would you suggest to split my data into cores? I have few sites (around
> 20)
> > and more countries (around 90).
> > Should I split my data into sites (around 20 cores) and within a core
> > filter
> > by site? Should I split by Site and Country (around 1800 cores)?
> > What should I consider when splitting my data into multiple cores?
> >
> >
> The first question is why do you want to split at all? Is the schema or
> solrconfig different? Are the different sites or countries updated at
> different times? Is the combined index very big that the response times
> jump
> wildly when all the caches are thrown out if documents related to one site
> or country are updated? Does warmup or optimize or replication take too
> much
> time with one big index?
>
> Each core will have its own configuration files (maintenance) and you need
> to setup replication separately for each core (which is a pain with the
> script based replication). Also note that by keeping all cores in one
> tomcat
> (one JVM), a stop-the-world GC will stop all cores which is not the case
> when using separate JVMs for each index/core.
>
> --
> Regards,
> Shalin Shekhar Mangar.
>

Reply via email to