Thanks,I detected that same problem. I have CP 1252 system file encoding and was recording data-config.xml file in UTF-8. DIH was reading using the default encoding. One possible workarround was using InputStream and OutputStream like DIH, but the files won't be in UTF-8 if the system has different encoding (not really good for XML files). I will get the latest 1.4 build and maintain the files in UTF-8.
On Fri, Mar 27, 2009 at 9:37 PM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > On Sat, Mar 28, 2009 at 12:51 AM, Shalin Shekhar Mangar < > shalinman...@gmail.com> wrote: > > > > > I see that you are specifying the topologyname's value in the query > itself. > > It might be a bug in DataImportHandler because it reads the data-config > as a > > string from an InputStream. If your default platform encoding is not > UTF-8, > > this may be the cause. > > > > I've opened SOLR-1090 to fix this issue. > > https://issues.apache.org/jira/browse/SOLR-1090 > > -- > Regards, > Shalin Shekhar Mangar. >