Hi,

The way I understand it, having multiple documents in a single xml file and posting that to solr is faster then posting a single xml for each document (even with a single commit after all separate xml files). It makes sence, and some simple tests indicates that this is true. However, if one of the documents triggers an error then solr tells me about the error, but it doesn't say what document it was.

For example, I generated an xml file with 500 documents to be indexed, and for document number 251 I deliberately added a field that doesn't exist in the schema.xml. When I tried to make solr index this xml I got the error:
ERROR:unknown field 'myNonExistantField'
And that is just great. But what document was it? I need to know this, because the next step is to log an error like "document with id 251" could not be indexed, error: unknown field 'myNonExistantField'". Then I want to continue indexing document nr 252-500, and if a get another error I repeat these steps. If the number of errors exceed a certain level (for example 10 documents with errors) I abort the indexing and log an error.

Is there a way to get this information out of solr?

Regards
/Jimi

Reply via email to