Thank you Erick & Shawn for taking significant time off your weekends to debug and explain in great detail. I will try to address the main points from your emails to provide more situation context for better understanding of my situation
1. Erick, As part of our upgrade from 4.7.2 to 5.3.0 I re-indexed all docs from my old Master-Slave to My SolrCloud using DIH SolrEntityProcessor which used a Script Transformer. I unwittingly messed up the script and hence this 'uuid' (String Type field) got messed up. All records prior to Sep 20 2015 have this issue that I am currently try to rectify. 2. Regarding openSearcher=true/false, I had it as false all along in my 4.7.2 config. I read somewhere that SolrCloud or 5.x doesn't honor it or it should be left default (Don't exactly remember where I read it), hence, I removed it from my solrconfig.xml going against my intuition :-) 3. Erick, I wasnt getting all 1.4 mill in one shot. I was initially using 100 docs batch, which, I later increased to 500 docs per batch. Also it would not be a infinite loop if I commit for each batch, right !!?? 4. Shawn, you are correct the uuid is of String Type and its not unique key for my schema. My uniqueKey is uniqueId and systemid is of no consequence here, it's another field for differentiating apps within my solr. Than you very much again guys. I will incorporate your suggestions and report back. Thanks Ravi Kiran Bhaskar On Sat, Sep 26, 2015 at 12:58 PM, Erick Erickson <erickerick...@gmail.com> wrote: > Oh, one more thing. _assuming_ you can't change the indexing process > that gets the docs from the system of record, why not just add an > update processor that does this at index time? See: > https://cwiki.apache.org/confluence/display/solr/Update+Request+Processors > , > in particular the StatelessScriptUpdateProcessorFactory might be a > good candidate. It just takes a bit of javascript (or other scripting > language) and changes the record before it gets indexed. > > FWIW, > Erick > > On Sat, Sep 26, 2015 at 9:52 AM, Shawn Heisey <apa...@elyograg.org> wrote: > > On 9/26/2015 10:41 AM, Shawn Heisey wrote: > >> <autoCommit> <maxTime>300000</maxTime> </autoCommit> > > > > This needs to include openSearcher=false, as Erick mentioned. I'm sorry > > I screwed that up: > > > > <autoCommit> > > <maxTime>300000</maxTime> > > <openSearcher>false</openSearcher> > > </autoCommit> > > > > Thanks, > > Shawn >