It sounds like it might be more efficient to implement this at the
crawler level to short-circuit crawling whole sites.  Baring that, a
separate database sounds more flexible.
Non-deletable docs doesn't sound like something that should be a
general feature.
However, one would probably be able to implement custom logic to do
this using an update-processor plugin (should be in the next version
of Solr)

-Yonik

On 8/16/07, Lance Norskog <[EMAIL PROTECTED]> wrote:
> Hi-
>
> We recrawl the same places and update blindly without checking if a document
> is already in the index.   We have a use case where we would like to delete
> documents (porn) and have them stay deleted. To implement this use case now,
> we would need to check the existence of the document and check for a
> 'deleted' flag. Or, we would maintain a separate database of deleted
> documents that we check against.
>
> A more efficient way to do this would be to have a 'do not delete' flag in
> the document. Delete failures are currently ignored and they would continue
> to be ignored.
>
> Is this a worthwhile addition to 1.3 or 1.4?
>
> Thanks for your time,
>
> Lance
>

Reply via email to