Venkatraman S wrote:
We are using Lucene and are migrating to Solr 1.2 (we are using Embedded
Solr). During this process we are stumbling on certain problems :
1) IF the same document is added again, then it it getting added in the
index again(duplicated); inspite of the fact that the IDs are unique across
documents. This document should be updated in the Index.
The corresponding entry for this field in schema.xml is :
<field name="id" type="text" indexed="true"
stored="true" multiValued="false" required="true"/>
Do you have:
<uniqueKey>id</uniqueKey>
2) Also, at the time of deleting a document, by providing its ID(exactly
similar to the deleteById proc in the Embedded Solr example) , we find that
the document is not getting deleted(and we also do not get any errors).
are you calling <commit/>?
3) While using facets, we are getting the stemmed versions of the
corresponding words in the faceted fields - how do we get the 'original'
word?
As in, 'intenti' for 'intentional' etc
Faceting works on the indexed terms - if the field has stemming applied,
the facets will be stemmed.
If you need to have stemming in some cases and the direct string in
other cases, you can use <copyField ...>