For xml 1.1 documents, you can view if any of your documents have these restricted characters defined here: http://www.w3.org/TR/2006/REC-xml11-20060816/#NT-RestrictedChar
If they are, you'll have to remove them. didier On Sun, Jul 18, 2010 at 11:16 AM, robert mena <robert.m...@gmail.com> wrote: > Hi, > > I am doing some tests with solr 1.4.1. > > I've created a XML file with the documents I'd like to index. With a few > items (1000) everything went fine. > > When I went to a more representative import (around 60000) I got error > > java -jar example/exampledocs/post.jar doc.xml > SimplePostTool: version 1.2 > SimplePostTool: WARNING: Make sure your XML documents are encoded in UTF-8, > other encodings are not currently supported > SimplePostTool: POSTing files to http://localhost:8983/solr/update.. > SimplePostTool: POSTing file add.xml > SimplePostTool: FATAL: Solr returned an error: > Illegal_character_CTRLCHAR_code_27__at_rowcol_unknownsource_37022847 > > I've tried to track where this problem is located without luck. > > Any ideas? >