Re: Problems for indexing large documents on SolrCloud

2014-09-22 Thread Olivier
Hi, First thanks for your advices. I did some several tests and finally I could index all the data on my SolrCloud cluster. The error was client side, it's documented in this post : http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201406.mbox/%3ccfc09ae1.94f8%25rebecca.t...@ucsf.edu%3E "

Re: Problems for indexing large documents on SolrCloud

2014-09-10 Thread Shawn Heisey
On 9/10/2014 2:05 PM, Erick Erickson wrote: > bq: org.apache.solr.common.SolrException: Unexpected end of input > block; expected an identifier > > This is very often an indication that your packets are being > truncated by "something in the chain". In your case, make sure > that Tomcat is configur

Re: Problems for indexing large documents on SolrCloud

2014-09-10 Thread Erick Erickson
't rule out time problems of course. But I'd look at the Tomcat settings for maximum packet size first. Best, Erick On Wed, Sep 10, 2014 at 9:11 AM, Olivier wrote: > Hi, > > I have some problems for indexing large documents in a SolrCloud cluster of > 3 servers (Solr 4.8.1)

Problems for indexing large documents on SolrCloud

2014-09-10 Thread Olivier
Hi, I have some problems for indexing large documents in a SolrCloud cluster of 3 servers (Solr 4.8.1) with 3 shards and 2 replicas for each shard on Tomcat 7. For a specific document (with 300 K values in a multivalued field), I couldn't index it on SolrCloud but I could do it in a s

Re: Indexing large documents

2014-03-19 Thread Tom Burton-West
icine.com> wrote: > > > Hi Solr Users, > > > > I'm looking for advice on best practices when indexing large documents > > (100's of MB or even 1 to 2 GB text files). I've been hunting around on > > google and the mailing list, and have found some

Re: Indexing large documents

2014-03-19 Thread Alexei Martchenko
18 23:52 GMT-03:00 Stephen Kottmann < stephen_kottm...@h3biomedicine.com>: > Hi Solr Users, > > I'm looking for advice on best practices when indexing large documents > (100's of MB or even 1 to 2 GB text files). I've been hunting around on > google and the m

Re: Indexing large documents

2014-03-18 Thread Otis Gospodnetic
Tue, Mar 18, 2014 at 10:52 PM, Stephen Kottmann < stephen_kottm...@h3biomedicine.com> wrote: > Hi Solr Users, > > I'm looking for advice on best practices when indexing large documents > (100's of MB or even 1 to 2 GB text files). I've been hunting around on >

Indexing large documents

2014-03-18 Thread Stephen Kottmann
Hi Solr Users, I'm looking for advice on best practices when indexing large documents (100's of MB or even 1 to 2 GB text files). I've been hunting around on google and the mailing list, and have found some suggestions of splitting the logical document up into multiple solr docume

Re: Is indexing large documents still an issue?

2013-05-06 Thread Bai Shen
> >> can anyony give a general idea of what field/document size solr CAN > >> handle? > >> > >> thanks. > >> > >> > >> > >> -- > >> View this message in context: > >> > http://lucene.472066.n3.nabble.com/Is-in

Re: Is indexing large documents still an issue?

2013-05-02 Thread adfel70
document size solr CAN >> handle? >> >> thanks. >> >> >> >> -- >> View this message in context: >> http://lucene.472066.n3.nabble.com/Is-indexing-large-documents-still-an-issue-tp4060425.html >> Sent from the Solr - User mailing list archive

Re: Is indexing large documents still an issue?

2013-05-02 Thread Bai Shen
a of what field/document size solr CAN handle? > > thanks. > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Is-indexing-large-documents-still-an-issue-tp4060425.html > Sent from the Solr - User mailing list archive at Nabble.com. >

Is indexing large documents still an issue?

2013-05-02 Thread adfel70
hanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Is-indexing-large-documents-still-an-issue-tp4060425.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Indexing large documents

2007-08-20 Thread Fouad Mardini
thanks, i reindexed the documents and now it works, there was an issue with text extraction it seems. I also changed the maxFieldLength and it must have helped thanks On 8/20/07, Pieter Berkel <[EMAIL PROTECTED]> wrote: > > You will probably need to increase the value of maxFieldLength in your >

Re: Indexing large documents

2007-08-20 Thread Pieter Berkel
You will probably need to increase the value of maxFieldLength in your solrconfig.xml. The default value is 1 which might explain why your documents are not being completely indexed. Piete On 20/08/07, Peter Manis <[EMAIL PROTECTED]> wrote: > > The that should show some errors if something

Re: Indexing large documents

2007-08-20 Thread Peter Manis
The that should show some errors if something goes wrong, if not the console usually will. The errors will look like a java stacktrace output. Did increasing the heap do anything for you? Changing mine to 256mb max worked fine for all of our files. On 8/20/07, Fouad Mardini <[EMAIL PROTECTED]>

Re: Indexing large documents

2007-08-20 Thread Fouad Mardini
Well, I am using the java textmining library to extract text from documents, then i do a post to solr I do not have an error log, i only have *.request.log files in the logs directory Thanks On 8/20/07, Peter Manis <[EMAIL PROTECTED]> wrote: > > Fouad, > > I would check the error log or console f

Re: Indexing large documents

2007-08-20 Thread Peter Manis
Fouad, I would check the error log or console for any possible errors first. They may not show up, it really depends on how you are processing the word document (custom solr, feeding the text to it, etc). We are using a custom version of solr with PDF, DOC, XLS, etc text extraction and I have suc

RE: Indexing large documents

2007-08-20 Thread praveen jain
@lucene.apache.org Subject: Indexing large documents Hello, I am using solr to index text extracted from word documents, and it is working really well. Recently i started noticing that some documents are not indexed, that is i know that the word foobar is in a document, but when i search for foobar the id

Indexing large documents

2007-08-20 Thread Fouad Mardini
Hello, I am using solr to index text extracted from word documents, and it is working really well. Recently i started noticing that some documents are not indexed, that is i know that the word foobar is in a document, but when i search for foobar the id of that document is not returned. I suspect