On 3/28/07, Theodan <[EMAIL PROTECTED]> wrote:
For what it's worth, I started with an existing Lucene index and modified Solr's schema.xml so that I could just use the Lucene index in Solr. That Lucene index had about 230K docs. I then used your "post.jar" to post another 10K docs to the index after starting up the server. Those 10K docs only had 7 of the 30 fields that the original 230K docs had. Could that be the problem? I am noticing that the docs that I'm having problems with are from the original 230K-doc index, not from my subsequent 10K-doc post. The 10K docs seem to be findable by any of their 7 fields.
This is almost certainly due to a mismatch between the index- and query-time analysis of the fields. For instance, your schema defines the title field to be "string" (unanalyzed), but it is likely that some tokenization (perhaps via StandardAnalyzer) occurred in the original index. -Mike