First of all, sorry about the subject of this discussion. It should have been something like "Adding config to SolrCloud without starting a Solr server"

Mark Miller skrev:
k
On May 16, 2012, at 5:35 AM, Per Steffensen wrote:

Hi

We want to create a Solr config in ZK during installation of our product, but 
we dont want to create any shards in that phase. We will create shards from our 
application when it starts up and also automatically maintain the set of shards 
from our application (which uses SolrCloud). The only way we know to create a 
Solr config in ZK is to spin up a  Solr with  system properties zkHost, 
bootstrap_confdir and collection.configName. Is there another, more API-ish, 
way of creating a Solr config in ZK?

Regards, Per Steffensen

I've started some work on this, but I have not finished.

There is a main method in ZkController that has some initial code. Currently it 
just lets you upload a specifically named config set directory - I would also 
like to add the same multi core config set upload option we have on startup - 
where it reads solr.xml, finds all the config dirs and uploads them, and links 
each collection to a config set named after it.
Yeah ok, I just want the config created - no shards/slices/collections.
Technically, you could use any tool to set this up - there are a variety of 
options in the zk world - you just have to place the config files under the 
right node.
I would really want to do it through Solr. This is the correct way, I think. So that, when you change your "strategy" e.g. location or format of configs in ZK, I will automatically inherit that.
 There is one other tricky part though - the collection has to be set to the 
right config set name. This is specified on the collection node in ZooKeeper. 
When creating a new collection, you can specify this as a param. If none is set 
and there is only one config set, that one config set is used. However, some 
link must be made, and it is not done automatically with your initial 
collections in solr.xml unless there is only one config set.
I know about that, and will use Solr to create collections. I just want the config established in ZK before that, and not create the config "during the process of creating a collection".
So now I'm thinking perhaps we should default the config set name to the 
collection name. Then if you simply use the collection name initially when you 
upload the set, no new linking is needed. If you don't like that, you can 
explicitly override what config set to use. Convention would be to name your 
config sets after your collection name, but extra work would allow you to do 
whatever you want.
I want several collections to use the same config, so I would have to do that extra work.
You can find an example of the ZkContoller main method being used in 
solr/cloud-dev scripts. The one caveat is that we pass an extra param to 
solrhome and briefly run a ZkServer within the ZkController#main method since 
we don't have an external ensemble. Normally this would not make sense and you 
would want to leave that out. I need to clean this all up (the ZkController 
params) and document it on the wiki as soon as I make these couple tweaks 
though.
Ok

Thanks, Mark
- Mark Miller
lucidimagination.com

Regards, Per Steffensen












Reply via email to