s
> duplicates based on that signature and gather that information yourself
> as long as such a feature isn't there."
>
> Can you explain more what you have in mind ?
>
> Thank you for your help!
>
> matt
>
> --- On Mon, 5/10/10, Markus Jelsma wrote:
>
&g
long as such a feature isn't there."
Can you explain more what you have in mind ?
Thank you for your help!
matt
--- On Mon, 5/10/10, Markus Jelsma wrote:
From: Markus Jelsma
Subject: RE: How to query for similar documents before indexing
To: solr-user@lucene.apache.org
Date: Mond
From: Matthieu Labour
Sent: Mon 10-05-2010 23:30
To: solr-user@lucene.apache.org;
Subject: RE: How to query for similar documents before indexing
Markus
Thank you for your response
That would be great if the index has the option to prevent duplicate
from entering the index. But is it going to
---
From: Matthieu Labour
Sent: Mon 10-05-2010 23:30
To: solr-user@lucene.apache.org;
Subject: RE: How to query for similar documents before indexing
Markus
Thank you for your response
That would be great if the index has the option to prevent duplicate
from entering the index. But is it goin
rom: Matthieu Labour
Sent: Mon 10-05-2010 23:30
To: solr-user@lucene.apache.org;
Subject: RE: How to query for similar documents before indexing
Markus
Thank you for your response
That would be great if the index has the option to prevent duplicate from
entering the index. But is it going to be
?
Cheers
matt
--- On Mon, 5/10/10, Markus Jelsma wrote:
From: Markus Jelsma
Subject: RE: How to query for similar documents before indexing
To: solr-user@lucene.apache.org
Date: Monday, May 10, 2010, 4:11 PM
Hi,
Deduplication [1] is what you're looking for.It can utilize different anal
Sent: Mon 10-05-2010 22:41
To: solr-user@lucene.apache.org;
Subject: How to query for similar documents before indexing
Hi
I want to implement the following logic:
Before I index a new document into the index, I want to check if there are
already documents in the index with similar content t
Hi
I want to implement the following logic:
Before I index a new document into the index, I want to check if there are
already documents in the index with similar content to the content of the
document about to be inserted. If the request returns 1 or more documents, then
I don't want to inser