Firstly, I see that you have overwriteDupes=false in your
configuration. This means that a signature will be generated but the
similar documents will still be added to the index. Now to your main
question about counting duplicate attempts, one simple way is to have
another UpdateRequestProcessor af
Currently I've the following Update Request Processor chain to prevent indexing
very similar text items into a core dedicated to store queries that our users
put into the web interface of our system.
true
false
signature
textsuggest,textng
org.apache.solr.upd