Re: MapReduceIndexerTool Indexing

2016-01-05 Thread Erick Erickson
MRIT is not designed for that scenario, so you simply can't. What people usually do is have a process whereby, after the initial bulk load, there is some way their system-of-record "knows" what new docs have been added since and indexes only those. Flume is sometimes used if you have access. Best

Re: MapReduceIndexerTool Indexing

2016-01-04 Thread vidya
Hi I would like to index only new data but not already indexed data(delta Indexing). how can i achieve it using MRIT. Thanks in advance -- View this message in context: http://lucene.472066.n3.nabble.com/MapReduceIndexerTool-Indexing-tp4248387p4248573.html Sent from the Solr - User mailing li

Re: MapReduceIndexerTool Indexing

2016-01-04 Thread Erick Erickson
Yes it does. MRIT is intended for initial bulk loads. It takes whatever it's pointed at and indexes it. Additionally, it does not update documents. If the same document (by ID) is indexed twice, you'll wind up with two copies in your results. Best, Erick On Mon, Jan 4, 2016 at 5:00 AM, vidya wr