Earlier I recommended clients to set stored=false wherever they could in order 
to save index space,
but now I do the opposite (well, either stored or docValues) to prepare for a 
smooth re-index process
from the existing Solr install into a new cluster. 

That is, of course, unless you have the source data readily available and 
re-indexing from it is fairly quick.
Sometimes you have the source repo but indexing takes two weeks, then a 
Solr-Solr reindex may be much faster!

--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com

> 19. des. 2018 kl. 09:56 skrev Charlie Hull <char...@flax.co.uk>:
> 
> On 18/12/2018 17:40, Erick Erickson wrote:
>> You are far better off re-indexing totally.
> 
> I would add '...if you have the original data'. Not everyone *can* re-index, 
> and there are various hairy ways of updating an index in place, but they 
> require deep-level magic.
> 
> But if you have the original source data, you should re-index.
> 
> Cheers
> 
> Charlie
>> Using IndexUpgraderTool has never guaranteed compatibility
>> across multiple major releases. I.e. if you have an index built
>> with 4x, using that tool will work for 5x, but then going from 5x
>> to 6x _even after the entire index is rewritten from 4 x format_
>> has  never been guaranteed to work. By "guaranteed to work"
>> here, I mean that there can be subtle problems, regardless
>> of appearances
>> The two most succinct statements as to why this is true follow.
>> I will not second guess _anything_ these two people have to
>> say about how Lucene works ;)
>>  From Mike McCandless:
>> “This really is the difference between an index and a database:
>> we do not store, precisely, the original documents.  We store an
>> efficient derived/computed index from them.”
>>  From Robert Muir:
>> “I think the key issue here is Lucene is an index not a database.
>> Because it is a lossy index and does not retain all of the user's
>> data, its not possible to safely migrate some things automagically...
>> The function is y = f(x) and if x is not available its not possible, so
>> lucene can't do it.”
>> As of 6x, a marker is written into each segments and the lowest
>> version is retained when segments are merged. 8x will refuse
>> to start if it detects a 6x marker so this will be enforced soon.
>> Best,
>> Erick
>> On Mon, Dec 17, 2018 at 12:27 PM Pushkar Raste <pushkar.ra...@gmail.com> 
>> wrote:
>>> 
>>> Hi,
>>> I have questions about the IndexUpgrader tool.
>>> 
>>> - I want to upgrade from Solr 4 to Solr 7. Can I run upgrade the index from
>>> 4 to 5 then 5 to 6 and finally 6 to 7 using appropriate version of the
>>> IndexUpgrader but without loading the Index in the Solr at all during the
>>> successive upgrades.
>>> 
>>> - The note in the tool says "This tool only keeps last commit in an index".
>>> Does this mean I have optimize the index before running the tool?
>>> 
>>> - There is another note about partially upgraded index. How can the index
>>> be partially upgraded. One scenario I can think of is 'If I upgraded let's
>>> say from Solr 5 to Solr 6 and then added some documents. The new documents
>>> will be in Lucerne 6 format already, while old documents will still be Solr
>>> 5 format’ Is my understanding correct?
> 
> 
> -- 
> Charlie Hull
> Flax - Open Source Enterprise Search
> 
> tel/fax: +44 (0)8700 118334 <tel:+44%208700%20118334>
> mobile:  +44 (0)7767 825828 <tel:+44%207767%20825828>
> web: www.flax.co.uk <http://www.flax.co.uk/>

Reply via email to