I would add a new field with the new behavior. Then any document with
content in the new field would not need to be deleted. Find the deletable
content with:
*:* -new_field:*
I generally add a field that records when the document was indexed or
updated. That can be really handy.
<field name="indexed_datetime" type="pdate" stored="true" indexed="true"
default="NOW" docValues="true" />
wunder
Walter Underwood
[email protected]
http://observer.wunderwood.org/ (my blog)
> On Jun 21, 2020, at 12:32 PM, Scott Q. <[email protected]> wrote:
>
> Also note that I didn't apply the new schema yet because I don't
> think it will let me change it mid-way like this without deleting all
> data and starting anew...
>
> On Sunday, 21/06/2020 at 15:12 Scott Q. wrote:
>
>
> My apologies, it appears the configuration tags were escaped and
> completely removed from my original e-mail.
>
> I am including them via pastebin.com
>
>
> https://pastebin.com/BSUqgEke
>
>
>
>
> On Sunday, 21/06/2020 at 15:04 Scott Q. wrote:
>
>
> Hello,
>
> I use Solr with Dovecot and I made a mistake when I initially created
> my schema for my instance. I created the username field with partial
> matches enabled.
> Aka, like this:
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
> I already indexed millions of documents using this schema before I
> fixed it and changed it to
>
>
>
>
>
>
>
>
> The task at hand is to remove all documents indexed the old way, but
> how can I do that ? user is of the form [email protected] and if I
> search for [email protected] it matches all of 'user' or 'domain.com'
> which has obvious unwanted consequences.
>
> Therefore, how can I remove older documents which were indexed with
> partial match ?