On 01/02/2014 12:44 PM, Chris Hostetter wrote:

: Not really ... uptime is irrelevant because they aren't in production. I just
: don't want to spend the time reloading 1TB of documents.

terminologiy confusion: you mean you don't wnat to *reindex* all of the
documents ... in solr "reloading" a core means something specific &
different from what you are talking about, and is what michael.boom was
refering to.

quite correct, sorry. reindex the core(s), not reload the core(s).

: I want to bring them all into a cloud collection. Assume I have 3 cores/shards
:
:   core1
:   core2
:   core3

You can't convert arbitrary cores into shards of a new collection, because
the document routing logic (which dictates what shard a doc lives in based
on it's uniqueKey) won't make sense.

I guess this is the heart if the issue.

I managed to assign the individual cores to a collection using the collection API to create the collection and then the solr.xml to define the core(s) and it's collection. This *seemed* to work. I even test indexed a set of documents checking totals before and after as well as content. Again, this *seemed* to work.

Did I get lucky that all 5k documents were coincidentally found in the appropriate core(s)? Have I possibly corrupted one or more cores? They are a working copy so nothing would be lost.

: I want to be able to address all three as if they were shards of a collection,
: something like.

w/o reindexing, one thing you could do is create a single collection for
each of your cores, and then create a collection alias over all three of
these collections...

https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-CreateormodifyanAliasforaCollection

Yes this works but isn't this really just a convenient way to avoid the shard parameter on /select?

if you want to just be able to shove docs to a single "collection" in solr
cloud and have them replace docs with the same uniqueKey then you're

yes, this is what I was hoping I could do.

going to need to either: re-index using SolrCloud so the default document
routing is done properly up front; implement a custom doc router that
knows about whatever rules you used to decide what would be in core1,
core2, core3.

I was afraid of that, but see question above about what I've done and index consistency.

Thanks for the insight.

David

Reply via email to