Hi, You can use an UpdateProcessor to do so. This can be used to deduplicate documents based on exact or near matches with fields in other documents. Check the wiki page on deduplication [1] for an example.
[1]: http://wiki.apache.org/solr/Deduplication Cheers, On Thursday 09 September 2010 13:44:55 Staffan wrote: > Hi, > > I am looking for a way to store the checksum of a field's value, something > like: > > <field name="text"...> > <!-- the SHA1 checksum of text (before applying analyzer) --> > <field name="text_sha1" type="checksum" indexed="true" stored="true"> > ... > <copyField source="text" dest="text_sha1"> > > I haven't found anything like that in the docs or on google. Did I > miss something? If not, would a custom tokenizer be a good way to > implement it? > > /Staffan > Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350