R: SolrCloud become unresponsive after huge pivot facet query

2019-01-31 Thread Matteo Diarena
Hi Erick, first of all thanks a lot for your response! I suppose that in my case is happening exactly what you describe as "GC Hell" because I see continuous GC cycles and solr is not showing OOM errors. I absolutely agree with you that this is a bad query but I was wondering if there is any s

Re: by: java.util.zip.DataFormatException: invalid distance too far back reported by Solr API

2019-01-31 Thread Jan Høydahl
Hi This is Apache Tika that cannot parse a zip file or possibly a zip formatted office file. You have to post the full stack trace (which you'll find in the solr.log on server side) if you want help in locating the source of the issue, you may be able to configure Tika Have you tried to specif

Alternative for DIH

2019-01-31 Thread Srinivas Kashyap
Hello, As we all know DIH is single threaded and has it's own issues while indexing. Got to know that we can write our own API's to pull data from DB and push it into solr. One such I heard was Apache Kafka being used for the purpose. Can any of you send me the links and guides to use apache ka

Re: Alternative for DIH

2019-01-31 Thread Jörn Franke
I recommend to look at the underlying problem that you try to solve. Writing an own loader requires thorough technical design (eg recoverability in case of errors, stoping in case user requested it, proper multithreading without overloading the cluster etc) - I have not seen many that were well

Asynchronous Calls to Backup/Restore Collections ignoring errors

2019-01-31 Thread Steffen Moldenhauer
Hi all, we are using the collection API backup and restore to transfer collections from a pre-prod to a production system. We are currently using Solr version 6.6.5 But sometimes that automated process fails and collections are not working on the production system. It seems that the asynchronou

Re: Alternative for DIH

2019-01-31 Thread Mikhail Khludnev
Hello, I did this deck some time ago. It might be useful for choosing one. https://docs.google.com/presentation/d/e/2PACX-1vQzi3QOZAwLh_t3zs1gH9EGCB2HKUgiN3WJRGHpULyA-GleCrQ41dIOINa18h_XG64BX5D_ZG6jKmXL/pub?start=false&loop=false&delayms=3000 Note, as far as I understand Lucidworks' answer to this

Re: Alternative for DIH

2019-01-31 Thread Alexandre Rafalovitch
Apache NiFi may also be something of interest: https://nifi.apache.org/ Regards, Alex. On Thu, 31 Jan 2019 at 11:15, Mikhail Khludnev wrote: > > Hello, > > I did this deck some time ago. It might be useful for choosing one. > https://docs.google.com/presentation/d/e/2PACX-1vQzi3QOZAwLh_t3zs1g

Using the terms component in Solr Cloud gives random result

2019-01-31 Thread Markus Kalkbrenner
Hi, I tried to use the terms component with the techproducts example in cloud mode and was surprised, that the results toggle for each request. The response to http://localhost:8983/solr/techproducts/terms?terms=true&terms.fl=name permanently toggles between these two results: { "responseHead

Re: SolrCloud become unresponsive after huge pivot facet query

2019-01-31 Thread Ruchir Choudhry
Hello Erick, Wanted to start working on Solr bugs, will appreciate if you or some can allocate me with some minor bugs. Warm Regards, Ruchir On Wed, Jan 30, 2019 at 8:53 AM Erick Erickson wrote: > My suggestion is "don't do that" ;). > > Ok, seriously. Conceptually what you have is an N-dim

Re: SolrCloud become unresponsive after huge pivot facet query

2019-01-31 Thread Shawn Heisey
On 1/31/2019 12:11 PM, Ruchir Choudhry wrote: Wanted to start working on Solr bugs, will appreciate if you or some can allocate me with some minor bugs. It doesn't work like that. Issues are not handed out, it's a strictly volunteer system. You'll need to find the issues you want to work on

Re: Solr Size Limitation upto 32 kb limitation

2019-01-31 Thread Walter Underwood
Solr is not a database. It won’t store arbitrary length data. Put the file content in a database and put the key in Solr. I’m dropping the CC to d...@lucene.apache.org, because this does not belong on that list. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my