I am posting json using curl. On Wed, Aug 17, 2016 at 4:41 AM, Alexandre Rafalovitch <arafa...@gmail.com> wrote:
> What format are those documents? Solr XML? Custom JSON? > > Or are you sending PDF/binary documents to Solr's extract handler and > asking it to do the extraction of the useful stuff? If later, you > could take that step out of Solr with a custom client using Tika (what > Solr has under the hood) and only send to Solr the processed output. > > Regards, > Alex. > ---- > Newsletter and resources for Solr beginners and intermediates: > http://www.solr-start.com/ > > > On 16 August 2016 at 22:49, kshitij tyagi <kshitij.shopcl...@gmail.com> > wrote: > > 400kb is size of single document and i am sending 100 documents per > request. > > solr heap size is 16gb and running on multithread. > > > > On Tue, Aug 16, 2016 at 5:10 PM, Emir Arnautovic < > > emir.arnauto...@sematext.com> wrote: > > > >> Hi, > >> > >> 400KB/doc * 100doc = 40MB. If you are running it single threaded, Solr > >> will be idle while accepting relatively large request. Or is 400KB 100 > doc > >> bulk that you are sending? > >> > >> What is Solr's heap size? I would try increasing number of threads and > >> monitor Solr's heap/CPU/IO to see where is the bottleneck. > >> > >> How complex is fields' analysis? > >> > >> Regards, > >> Emir > >> > >> > >> On 16.08.2016 13:25, kshitij tyagi wrote: > >> > >>> hi, > >>> > >>> we are sending about 100 documents per request for indexing? we have > >>> autocmmit set to false and commit only when 10000 documents are > >>> present.solr and the machine sending request are in same pool. > >>> > >>> > >>> > >>> On Tue, Aug 16, 2016 at 4:51 PM, Emir Arnautovic < > >>> emir.arnauto...@sematext.com> wrote: > >>> > >>> Hi, > >>>> > >>>> Do you send one doc per request? How frequently do you commit? Where > is > >>>> Solr running? What is network connection between your machine and > Solr? > >>>> What are JVM settings? Is 10-30s for entire indexing or single doc? > >>>> > >>>> Regards, > >>>> Emir > >>>> > >>>> > >>>> On 16.08.2016 11:34, kshitij tyagi wrote: > >>>> > >>>> Hi alexandre, > >>>>> > >>>>> 1 document of 400kb size is taking approx 10-30 sec and this is > >>>>> varying. I > >>>>> am posting document using curl > >>>>> > >>>>> On Tue, Aug 16, 2016 at 2:11 PM, Alexandre Rafalovitch < > >>>>> arafa...@gmail.com> > >>>>> wrote: > >>>>> > >>>>> How many records is that and what is 'slow'? Also is this standalone > or > >>>>> > >>>>>> cluster setup? > >>>>>> > >>>>>> On 16 Aug 2016 6:33 PM, "kshitij tyagi" < > kshitij.shopcl...@gmail.com> > >>>>>> wrote: > >>>>>> > >>>>>> Hi, > >>>>>> > >>>>>>> I am indexing a lot of data about 8GB, but it is taking a lot of > >>>>>>> time. I > >>>>>>> have read about maxBufferedDocs, ramBufferSizeMB, merge policy > ,etc in > >>>>>>> solrconfig file. > >>>>>>> > >>>>>>> It would be helpful if someone could help me out tune the segtting > for > >>>>>>> faster indexing speeds. > >>>>>>> > >>>>>>> *I have read the docs but not able to get what exactly means > changing > >>>>>>> > >>>>>>> these > >>>>>> > >>>>>> configs.* > >>>>>>> > >>>>>>> > >>>>>>> *Regards,* > >>>>>>> *Kshitij* > >>>>>>> > >>>>>>> > >>>>>>> -- > >>>> Monitoring * Alerting * Anomaly Detection * Centralized Log Management > >>>> Solr & Elasticsearch Support * http://sematext.com/ > >>>> > >>>> > >>>> > >> -- > >> Monitoring * Alerting * Anomaly Detection * Centralized Log Management > >> Solr & Elasticsearch Support * http://sematext.com/ > >> > >> >