Shahzad - I am curious what features of distributed search stops you to run
SolrCloud. Using DS, you would be able to search across cores or
collections.
https://cwiki.apache.org/confluence/display/solr/Advanced+Distributed+Request+Options

Thanks,
Susheel

On Tue, Feb 9, 2016 at 12:10 AM, Shahzad Masud <
shahzad.ma...@northbaysolutions.net> wrote:

> Thank you Shawn for your response. I would be running some performance
> tests lately on this structure (one JVM with multiple cores), and would
> share feedback on this thread.
>
> >There IS a way to specify the solr home for a specific context, but keep
> >in mind that I definitely DO NOT recommend doing this.  There is
> >resource and administrative overhead to running multiple copies of Solr
> >in one JVM.  Simply run one context and let it handle multiple shards,
> >whether you choose SolrCloud or not.
> Due to distributed search feature, I might not be able to run SolrCloud. I
> would appreciate, if you please share that way of setting solr home for a
> specific context in Jetty-Solr. Its good to seek more information for
> comparison purposes. Do you think having multiple JVMs would increase or
> decrease performance. My document base is around 20 million rows (in 24
> shards), with document size ranging from 100KB - 400 MB.
>
> SM
>
> On Mon, Feb 8, 2016 at 8:09 PM, Shawn Heisey <apa...@elyograg.org> wrote:
>
> > On 2/8/2016 1:14 AM, Shahzad Masud wrote:
> > > Thank you Shawn for your reply. Here is my structure of cores and
> shards
> > >
> > > Shard 1 = localhost:8983/solr_2014 [3 Core  - Employee, Service
> Tickets,
> > > Departments]
> > > Shard 2 = localhost:8983/solr_2015 [3 Core  - Employee, Service
> Tickets,
> > > Departments]
> > > Shard 3 = localhost:8983/solr_2016 [3 Core  - Employee, Service
> Tickets,
> > > Departments]
> > >
> > > While searching, I use distributed search feature to search data from
> all
> > > three shards in respective cores e.g. If I want to search from Employee
> > > data for all three years, I search from Employee core of three
> contexts.
> > > This is legacy design, do you think this is okay, or this require
> > immediate
> > > restructure / design? I am going to try this,
> > >
> > > Context = localhost:8982/solr (9 cores - Employee-2014, Employee-2015,
> > > Employee-2016, ServiceTickets-2014, ServiceTickets-2015,
> > > ServiceTickets-2016, Department-2014, Department-2015, Department-2016]
> > > distributed search would be from all three cores of same data category
> > > (i.e. For Employee search, it would be from Employee-2014,
> Employee-2015,
> > > Employee-2016).
> >
> > With SolrCloud, you can have multiple collections for each of these
> > types and alias them together.  Or you can simply have one collection
> > for employee, one for servicetickets, and one for department, with
> > SolrCloud automatically handling splitting those documents into the
> > number of shardsthat you specify when you create the collection.  You
> > can also do manual sharding and split each collection on a time basis
> > like you have been doing, but then you lose some of the automation that
> > SolrCloud provides, so I do not recommend handling it that way.
> >
> > > Regarding one Solr context per jetty; I cannot run two solr contexts
> > > pointing to different data in Jetty, as while starting jetty I have to
> > > provide -Dsolr.solr.home variable - which ends up pointing to one data
> > > folder (2014 data) only.
> >
> > You do not need multiple contexts to have multiple indexes.
> >
> > My dev Solr server has exactly one Solr JVM, with exactly one context --
> > /solr.  That instance of Solr has 45 indexes (cores) on it.  These 45
> > cores are various shards for three larger indexes.  I am not running
> > SolrCloud, but I certainly could.
> >
> > You can see 25 of the 45 cores in my Solr instance in this screenshot of
> > the admin UI for this server:
> >
> > https://www.dropbox.com/s/v87mxvkdejvd92h/solr-with-45-cores.png?dl=0
> >
> > There IS a way to specify the solr home for a specific context, but keep
> > in mind that I definitely DO NOT recommend doing this.  There is
> > resource and administrative overhead to running multiple copies of Solr
> > in one JVM.  Simply run one context and let it handle multiple shards,
> > whether you choose SolrCloud or not.
> >
> > Thanks,
> > Shawn
> >
> >
>

Reply via email to