Thanks for your reply, this is one of the example where it fail. POST by using charset=utf-8 or other charset didn't help that CTRL-CHAR "^" error found in the title field, I hope solr can simply skip this record and go ahead to index the rest data.
<add> <doc> <field name="id">9780373773244</field> <field name="isbn13">9780373773244</field> <field name="title">Missing: Innocent By Association^Zachary's Law (Hqn Romance) </field> <field name="author">Lisa_Jackson </field> </doc> </add> curl "http://localhost:7070/solr/searchinfo/update?update.chain=tolerant-chain&maxErrors=100" -H 'Content-Type: text/xml; charset=utf-8' -d @data <?xml version="1.0" encoding="UTF-8"?> <response> <lst name="responseHeader"> <arr name="errors"/> <int name="maxErrors">100</int> <int name="status">400</int> <int name="QTime">0</int> </lst> <lst name="error"> <lst name="metadata"> <str name="error-class">org.apache.solr.common.SolrException</str> <str name="root-error-class">com.ctc.wstx.exc.WstxUnexpectedCharException</str> </lst> <str name="msg">Illegal character ((CTRL-CHAR, code 26)) at [row,col {unknown-source}]: [1,225]</str> <int name="code">400</int> </lst> </response> ________________________________ From: Thomas Corthals <tho...@klascement.net> Sent: Tuesday, June 9, 2020 2:12 PM To: solr-user@lucene.apache.org <solr-user@lucene.apache.org> Subject: Re: Fw: TolerantUpdateProcessorFactory not functioning If your XML or JSON can't be parsed, your content never makes it to the update chain. It looks like you're trying to index non-UTF-8 data. You can set the encoding of your XML in the Content-Type header of your POST request. -H 'Content-Type: text/xml; charset=GB18030' JSON only allows UTF-8, UTF-16 or UTF-32. Best, Thomas Op di 9 jun. 2020 07:11 schreef Hup Chen <chai...@hotmail.com>: > Any idea? > I still won't be able to get TolerantUpdateProcessorFactory working, solr > exited at any error without any tolerance, any suggestions will be > appreciated. > curl " > http://localhost:7070/solr/mycore/update?update.chain=tolerant-chain&maxErrors=100" > -d @data.xml > > <?xml version="1.0" encoding="UTF-8"?> > <response> > > <lst name="responseHeader"> > <arr name="errors"/> > <int name="maxErrors">100</int> > <int name="status">400</int> > <int name="QTime">1</int> > </lst> > <lst name="error"> > <lst name="metadata"> > <str name="error-class">org.apache.solr.common.SolrException</str> > <str name="root-error-class">com.ctc.wstx.exc.WstxEOFException</str> > </lst> > <str name="msg">Unexpected EOF; was expecting a close tag for element > <field> > at [row,col {unknown-source}]: [1,8191]</str> > <int name="code">400</int> > </lst> > </response> > > > ________________________________ > From: Hup Chen > Sent: Friday, May 29, 2020 7:29 PM > To: solr-user@lucene.apache.org <solr-user@lucene.apache.org> > Subject: TolerantUpdateProcessorFactory not functioning > > Hi, > > My solr indexing did not tolerate bad record but simply exited even I have > configured TolerantUpdateProcessorFactory in solrconfig.xml. > Please advise how could I get TolerantUpdateProcessorFactory to be > working? > > solrconfig.xml: > > <updateRequestProcessorChain name="tolerant-chain"> > <processor class="solr.TolerantUpdateProcessorFactory"> > <int name="maxErrors">100</int> > </processor> > <processor class="solr.RunUpdateProcessorFactory" /> > </updateRequestProcessorChain> > > restarted solr before indexing: > service solr stop > service solr start > > curl " > http://localhost:7070/solr/mycore/update?update.chain=tolerant-chain&maxErrors=100" > -d @test.json > > The first record is a bad record in test.json, the rest were not indexed. > > { > "responseHeader":{ > "errors":[{ > "type":"ADD", > "id":"0007264097", > "message":"ERROR: [doc=0007264097] Error adding field > 'usedshipping'='' msg=empty String"}], > "maxErrors":100, > "status":400, > "QTime":0}, > "error":{ > "metadata":[ > "error-class","org.apache.solr.common.SolrException", > "root-error-class","org.apache.solr.common.SolrException"], > "msg":"Cannot parse provided JSON: Expected key,value separator ':': > char=\",position=1240 AFTER='isbn\":\"4032171203\", \"sku\":\"\", > \"title\":\"ãã³ãã¡ã¡ããã³ã \"author\"' BEFORE=':\"Sachiko > OÃtomo\", ãã, \"ima'", > "code":400}} > >