Re: Micro-Sharding

Shawn Heisey Mon, 05 Dec 2011 19:47:11 -0800

On 12/5/2011 6:57 PM, Jamie Johnson wrote:

Question which is a bit off topic.  You mention your algorithm for
sharding, how do you handle updates or do you not have to deal with
that in your scenario?

I have a long running program based on SolrJ that handles updates. Oncea minute, I run through an update cycle, which consists of deletes,document reinserts, and inserting new content. The data is pulled froma mysql database with the sharding algorithm specified as part of themysql query. I keep track of which shards actually received changes, sothat I do not do unnecessary commits.

For a full reindex, the build program sets up a separate thread, whichuses the dataimporter on a set of build cores, then swaps them with thelive cores. The algorithm is in the SQL entity in dih-config.conf,passing parameters in via the URL.


Thanks,
Shawn

Re: Micro-Sharding

Reply via email to