OK.

I need to find find/prevent duplicates in Database using Solr-Index

I use Django with Haystack integration.

I use TextProfileSignature to smart detect duplicates in text fields


solrconfig.xml wrote:
> 
> <updateRequestProcessorChain name="dedupe">
>     <processor
> class="org.apache.solr.update.processor.SignatureUpdateProcessorFactory">
>       <bool name="enabled">true</bool>
>       <str name="signatureField">sig</str>
>       <bool name="overwriteDupes">false</bool>
>       <str name="fields">title,description</str>
>       <str
> name="signatureClass">org.apache.solr.update.processor.TextProfileSignature</str>
>     </processor>
>     <processor class="solr.LogUpdateProcessorFactory" />
>     <processor class="solr.RunUpdateProcessorFactory" />
>   </updateRequestProcessorChain>
> 

But there is also some other fields 

How can I calculate TextProfileSignature-value for custom title,description-
values on Django-Side WITHOUT adding to Solr Index?

I need only detect "possible duplicates" for entered by user
title,description, i.e. select all records from Solr with
user_sig=TextProfileSignature(user_title,user_description)

Is there in Solr Webservice-Interface to do it?
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-install-DuplicatesDetectorService-tp1472561p1478111.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to