Hi,

You can use an UpdateProcessor to do so. This can be used to deduplicate 
documents based on exact or near matches with fields in other documents. Check 
the wiki page on deduplication [1] for an example.

[1]: http://wiki.apache.org/solr/Deduplication

Cheers,

On Thursday 09 September 2010 13:44:55 Staffan wrote:
> Hi,
> 
> I am looking for a way to store the checksum of a field's value, something
>  like:
> 
> <field name="text"...>
> <!-- the SHA1 checksum of text (before applying analyzer) -->
> <field name="text_sha1" type="checksum" indexed="true" stored="true">
> ...
> <copyField source="text" dest="text_sha1">
> 
> I haven't found anything like that in the docs or on google. Did I
> miss something? If not, would a custom tokenizer be a good way to
> implement it?
> 
> /Staffan
> 

Markus Jelsma - Technisch Architect - Buyways BV
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350

Reply via email to