Re: Indexing checksum of field value

2010-09-09 Thread Lance Norskog
Look at Deduplication: http://wiki.apache.org/solr/Deduplication It implements a unique hashcode (Lookup3Signature ) as a tool that avoids rewriting the same document over and over. It declares this in solrconfig.xml instead of schema.xml. Lanc

Re: Indexing checksum of field value

2010-09-09 Thread Markus Jelsma
Hi, You can use an UpdateProcessor to do so. This can be used to deduplicate documents based on exact or near matches with fields in other documents. Check the wiki page on deduplication [1] for an example. [1]: http://wiki.apache.org/solr/Deduplication Cheers, On Thursday 09 September 2010 1

Indexing checksum of field value

2010-09-09 Thread Staffan
Hi, I am looking for a way to store the checksum of a field's value, something like: ... I haven't found anything like that in the docs or on google. Did I miss something? If not, would a custom tokenizer be a good way to implement it? /Staffan