Thanks for the reply,
I'm on solr 8.2 so cursorMark is there.

Doing this from one collection to another collection, and then use a
collection alias is probably the way to go, but  actually, my suggestion
was a little more bold:

I'm indexing on top of the same core, i.e from
http://localhost:8983/solr/mycollection to
http://localhost:8983/solr/mycollection

(This is why I suggested adding a version:[* TO <current_highest_version>]
to ensure it terminates for large imports.)

With this in mind, are you still thinking this is a safe approach?

Thanks,
Bjarke


Den man. 27. apr. 2020 kl. 13.46 skrev Emir Arnautović <
emir.arnauto...@sematext.com>:

> Hi Bjarke,
> I don’t see a problem with that approach if you have enough resources to
> handle both cores at the same time, especially if you are doing that while
> serving production queries. The only issue is that if you plan to do that
> then you have to have all fields stored. Also note that cursorMark support
> was added a bit later to entity processor, so if you are running a bit
> older version of Solr, you might not have cursors - I’ve found it the hard
> way.
>
> Emir
> --
> Monitoring - Log Management - Alerting - Anomaly Detection
> Solr & Elasticsearch Consulting Support Training - http://sematext.com/
>
>
>
> > On 27 Apr 2020, at 13:11, Bjarke Buur Mortensen <morten...@eluence.com>
> wrote:
> >
> > Hi list,
> >
> > Let's say I add a copyField to my solr schema, or change the analysis
> chain
> > of a field or some other change.
> > It seems to me to be an alluring choice to use a very simple
> > dataimporthandler to reindex all documents, by using a
> SolrEntityProcessor
> > that points to itself. I have just done this for a very small collection,
> > but I was wondering what the caveats are, since this is not the
> recommended
> > practice. What can go wrong using this approach?
> >
> > <document> <entity name="all_from_self" processor="SolrEntityProcessor"
> url=
> > "http://localhost:8983/solr/mycollection"; qt="lucene" query="*:*" wt=
> > "javabin" rows="1000" cursorMark="true" sort="id asc" fl=
> > "*,orig_version_l:_version_"/> </document>
> >
> > PS: (It is probably necessary to add a version:[* TO
> > <current_highest_version>] to ensure it terminates for large imports)
> > PPS: (Obviously you shouldn't add the clean parameter)
> >
> > /Bjarke
>
>

Reply via email to