: I just finished reading on the wiki about deduplication and the solr.UUIDField : type. What I'd like to do is generate an ID for a document by hashing a subset : of its fields. One route I thought would be to do this ahead of time to CSV : data, but I would think sticking something into the UpdateRequest chain would : be more elegant. : : Has anyone had any success in this area?
what you described is *exactly* how the SignatureUpdateProcessorFactory - you specify a list of field names and it uses them to build a hash. -Hoss