As you guessed, I'm trying to build a non-HDFS collection from the index
files in HDFS (constructed by MRIT). To give you the overall picture, below
is my workflow (Sorry if it is too long)-
I have collection-A that is serving an index and I'm replacing the
collection with another one - collectio
Why not just use the --go-live option for MRIT and _put_ the data in
collectionB? Note this only works for Solr indexes that are hosted on
HDFS...
If your Solr collections are not on HDFS, then it's the same process
as I outlined. You create collectionB and copy things "to the right
place" after M
Thank you Erick.
I actually only have an index. I do not have collection B that hosts this
index. The reducers of the MR job build an index (reducer per shard). I'm
looking to load these generated lucene index files to the cores of a
collection (new or existing and then I can work with aliasing a
bq: is there a better way to load a pre-built index quickly like
before
In a word (well, two) "Collection Aliasing". You have two collections
and an alias.
So your search URL stays constant, say 'aliasedcollection'. Then you index to
collectionA and point aliasedcollection to it. That's your live