Re: mark solr documents as duplicates on hashing the combination of some fields

2014-10-22 Thread Alexandre Rafalovitch
This is the "dark art" knowledge. I've updated the Reference Guide comment with the request to have this text included, but it would also be nice to have it as part of the Javadoc for the Factory or the URP itself. Maybe WIKI as well. I can see not getting this part causing somebody a lot of headac

Re: mark solr documents as duplicates on hashing the combination of some fields

2014-10-22 Thread Chris Hostetter
: I meant signature will be broken. For example suppose the destination of : hash function for signature fields are "sig". After each partial update it : becomes: "00"! details please. how are you configuring your update processor chain? what does your schema look like? what types of at

Re: mark solr documents as duplicates on hashing the combination of some fields

2014-10-22 Thread Ali Nazemian
; > > just don't configure teh signatureField to be the same as your > uniqueKey > > > field. > > > > > > configure some othe fieldname (ie "signature") instead. > > > > > > > > > : Date: Tue, 14 Oct 2014 12:08:26 +

Re: mark solr documents as duplicates on hashing the combination of some fields

2014-10-22 Thread Alexandre Rafalovitch
tureField to be the same as your uniqueKey > > field. > > > > configure some othe fieldname (ie "signature") instead. > > > > > > : Date: Tue, 14 Oct 2014 12:08:26 +0330 > > : From: Ali Nazemian > > : Reply-To: solr-user@lucene.apache.

Re: mark solr documents as duplicates on hashing the combination of some fields

2014-10-22 Thread Ali Nazemian
: solr-user@lucene.apache.org > : To: "solr-user@lucene.apache.org" > : Subject: mark solr documents as duplicates on hashing the combination of > some > : fields > : > : Dear all, > : Hi, > : I was wondering how can I mark some documents as duplicate (just marking > : for f

Re: mark solr documents as duplicates on hashing the combination of some fields

2014-10-21 Thread Chris Hostetter
y-To: solr-user@lucene.apache.org : To: "solr-user@lucene.apache.org" : Subject: mark solr documents as duplicates on hashing the combination of some : fields : : Dear all, : Hi, : I was wondering how can I mark some documents as duplicate (just marking : for future usage not de

mark solr documents as duplicates on hashing the combination of some fields

2014-10-14 Thread Ali Nazemian
Dear all, Hi, I was wondering how can I mark some documents as duplicate (just marking for future usage not deleting) based on the hash combination of some fields? Suppose I have 2 fields name "url" and "title" I want to create hash based on url+title and send it to another field name "signature".