Re: Unable to index rich-text documents in Solr Cloud

2015-03-19 Thread Zheng Lin Edwin Yeo
Hi Shawn, Yes, I'm using the /update/extract handler. I'm not sure about the shards.qt parameter too. Regards, Edwin On 19 March 2015 at 13:18, Shawn Heisey wrote: > On 3/18/2015 1:22 AM, Zheng Lin Edwin Yeo wrote: > > I'm having some issues with indexing rich-text documents from the Solr > >

Re: Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Zheng Lin Edwin Yeo
Hi Charlee, I've followed the setup from the Solr In Action book, and assign port 8983 to shard1 and port 8984 to shard2. Will it cause any issues? Regards, Edwin On 19 March 2015 at 13:02, Charlee Chitsuk wrote: > The http://192.168.2.2:8984/solr/ > < > http://192.168.2.2:8984/solr/logmill/up

Re: Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Zheng Lin Edwin Yeo
Oh ya. The previous log was from shard1. This log is from shard2. INFO - 2015-03-18 15:06:51.019; org.apache.solr.update.processor.LogUpdateProcessor; [logmill] webapp=/solr path=/update params={distrib.from= http://192.168.2.2:8983/solr/logmill/&update.distrib=TOLEADER&wt=javabin&version=2} {} 0

Re: Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Shawn Heisey
On 3/18/2015 1:22 AM, Zheng Lin Edwin Yeo wrote: > I'm having some issues with indexing rich-text documents from the Solr > Cloud. When I tried to index a pdf or word document, I get the following > error: > > > org.apache.solr.common.SolrException: Bad Request > > > > request: > http://192.1

Re: Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Charlee Chitsuk
The http://192.168.2.2:8984/solr/ , the port number 8984 may be an HTTPS. The HTTP port should be 8983. Hope this help. -- Best Regards,

Re: Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Damien Kamerman
It sounds like https://issues.apache.org/jira/browse/SOLR-5551 Have you checked the solr.log for all nodes? On 19 March 2015 at 14:43, Zheng Lin Edwin Yeo wrote: > This is the logs that I got from solr.log. I can't seems to figure out > what's wrong with it. Does anyone knows? > > > > ERROR - 20

Re: Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Zheng Lin Edwin Yeo
This is the logs that I got from solr.log. I can't seems to figure out what's wrong with it. Does anyone knows? ERROR - 2015-03-18 15:06:51.019; org.apache.solr.update.StreamingSolrClients$1; error org.apache.solr.common.SolrException: Bad Request request: http://192.168.2.2:8984/solr/logmill

Re: Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Zheng Lin Edwin Yeo
This is the logs that I got from solr.log. I can't seems to figure out what's wrong with it. Does anyone knows? ERROR - 2015-03-18 15:06:51.019; org.apache.solr.update.StreamingSolrClients$1; error org.apache.solr.common.SolrException: Bad Request request: http://192.168.2.2:8984/solr/logmill

Re: Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Damien Kamerman
I suggest you check your solr logs for more info as to the cause. On 19 March 2015 at 12:58, Zheng Lin Edwin Yeo wrote: > Hi Erick, > > No, the PDF file is a testing file which only contains 1 sentence. > > I've managed to get it to work by removing startup="lazy" in > the ExtractingRequestHandl

Re: Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Zheng Lin Edwin Yeo
Hi Erick, No, the PDF file is a testing file which only contains 1 sentence. I've managed to get it to work by removing startup="lazy" in the ExtractingRequestHandler and added the following lines: ignored_ true links ignored_ Does the presence of startup="lazy" affect th

Re: Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Erick Erickson
Shot in the dark, but is the PDF file significantly larger than the others? Perhaps your simply exceeding the packet limits for the servlet container? Best, Erick On Wed, Mar 18, 2015 at 12:22 AM, Zheng Lin Edwin Yeo wrote: > Hi everyone, > > I'm having some issues with indexing rich-text docume

Unable to index rich-text documents in Solr Cloud

2015-03-18 Thread Zheng Lin Edwin Yeo
Hi everyone, I'm having some issues with indexing rich-text documents from the Solr Cloud. When I tried to index a pdf or word document, I get the following error: org.apache.solr.common.SolrException: Bad Request request: http://192.168.2.2:8984/solr/logmill/update?update.distrib=TOLEADER&d