Re: deploy solr on cloud providers

2016-07-05 Thread Steven Bower
Looking deeper into zookeeper as truth mode I was wrong about existing replicas being recreated once storage is gone.. Seems there is intent for the type of behavior based upon existing tickets.. We'll look at creating a patch for this too.. Steve On Tue, Jul 5, 2016 at 6:00 PM Tomás Fernández Löb

Re: Getting a hit on "the}" but not on "the" or "}"

2016-07-05 Thread Erick Erickson
Either that's a typo or your problem is it should be terms.fl, not terms.f1 (lower case ell as opposed to the number one). You should be seeing the raw terms in your index with TermsComponent, similar to the "load terms" in the schema browser except it allows you to query specific terms starting wi

Re: Getting a hit on "the}" but not on "the" or "}"

2016-07-05 Thread Steven White
Hi Erick, By TermsCoponent, I think you meant me to try the following? http://vottopg15.ottawa.ibm.com:8983/solr/testdata/terms?terms.f1=ALL_FIELDS&terms.prefix=the If so, I tried it and I'm getting 0 hits: 0 0 In fact, I'm getting 0 hits on anything I pass to

Re: Getting a hit on "the}" but not on "the" or "}"

2016-07-05 Thread Steven White
Thanks for the quick reply Erick. Here is the analyzer I'm using: If in fact it is my analyzer, what part of it is causing this? If not, I'm not clear about the "TermsComponent" that you suggested having me look into. How do

Re: Getting a hit on "the}" but not on "the" or "}"

2016-07-05 Thread Erick Erickson
My guess is that your field analysis isn't stripping the various non alpha-num characters, thus "the]" is actually a token in your index, square bracket and all. If that's true, it certainly doesn't match the stopword "the". You can check by using the TermsComponent, pointing it at your field and

Re: deploy solr on cloud providers

2016-07-05 Thread Tomás Fernández Löbbe
The leader will do the replication before responding to the client, so lets say the leader gets to update it's local copy, but it's terminated before sending the request to the replicas, the client should get either an HTTP 500 or no http response. From the client code you can take action (log, ret

Getting a hit on "the}" but not on "the" or "}"

2016-07-05 Thread Steven White
HI Everyone, I'm trying to understand why I get a hit when I search for "the}" but not when I search for "the" (searches are done without the quotes and "the" is a stopword in my case). Here is the debugQuery output using "the}": "debug": { "rawquerystring": "the}", "querystring": "the}

ImplicitSnitch Documentation for querying Multi-DataCenter replicas using preferredNodes

2016-07-05 Thread Susheel Kumar
Hello, Can someone help me to clarify and document how to use ImplicitSnitch preferredNodes rule to implement scenario where search queries executed from data center DC1 client, uses all dc1 replica's and data center DC2 client, uses all dc2 replica's. The only source I see is the discussion from

simple setup help (UNCLASSIFIED)

2016-07-05 Thread Musshorn, Kris T CTR USARMY RDECOM ARL (US)
CLASSIFICATION: UNCLASSIFIED Can someone walk a noob through setting up a dataimport handler? I need to index a coldfusion website Thanks, Kris ~~ Kris T. Musshorn FileMaker Developer - Contractor – Catapult Technology Inc. US Army Research Lab Aberdeen Proving Gro

Solr more like this

2016-07-05 Thread sara hajili
Hi I indexed pdf files yo solr.and now I wanna to know is there any way to uplaod a pdf file and solr return related pdf in result? I mean I don't want to index pdf file (the file that I wanna to get pdf more like this for this pdf).and just upload pdf file and get mlt result.can I do this??

Re: deploy solr on cloud providers

2016-07-05 Thread Lorenzo Fundaró
@Tomas and @Steven I am a bit skeptical about this two statements: If a node just disappears you should be fine in terms of data > availability, since Solr in "SolrCloud" replicates the data as it comes it > (before sending the http response) and > > You shouldn't "need" to move the storage as

RE: deploy solr on cloud providers

2016-07-05 Thread Davis, Daniel (NIH/NLM) [C]
Lorenzo, this probably comes late, but my systems guys just don't want to give me real disk. Although RAID-5 or LVM on-top of JBOD may be better than Amazon EBS, Amazon EBS is still much closer to real disk in terms of IOPS and latency than NFS ;)I even ran a mini test (not an official ben

RE: Access Solr via Apache's mod_proxy_balancer or mod_jk (AJP)

2016-07-05 Thread Davis, Daniel (NIH/NLM) [C]
Because access to Solr is typically to an API, rather than to webapps having images and static files that can be served directly, I think you can use mod_proxy_http just as well as mod_jk. I would suggest you not pursue trying to get AJP to work. mod_proxy_balancer will work with mod_proxy_ht

Re: deploy solr on cloud providers

2016-07-05 Thread Tomás Fernández Löbbe
I think there are two parts to this question: * If a node just disappears you should be fine in terms of data availability, since Solr in "SolrCloud" replicates the data as it comes it (before sending the http response). Even if the leader disappears and never comes back as long as you have one rep

Re: Can we directly move the solr instance running as standalone to SOLR CLOUD?

2016-07-05 Thread Erick Erickson
Solr/Lucene indexes are identical between SolrCloud and stand-alone, so yes you can move them around. I'm assuming your existing stand-alone is NOT sharded. 1> bring up a new SolrCloud collection with a single replica (i.e. replicationFactor=1). 2a> use the replication API fetchindex (https://cwik

Re: deploy solr on cloud providers

2016-07-05 Thread Steven Bower
You shouldn't "need" to move the storage as SolrCloud will replicate all data to the new node and anything in the transaction log will already be distributed through the rest of the machines.. One option to keep all your data attached to nodes might be to use Amazon EFS (pretty new) to store your

Re: stateless solr ?

2016-07-05 Thread Steven Bower
The ticket in question is https://issues.apache.org/jira/browse/SOLR-9265 We are working on a patch now... will update when we have a working patch / tests.. Shawn is correct that when adding a new node to a SolrCloud cluster it will not automatically add replicas/etc.. The idea behind this patc

Re: stateless solr ?

2016-07-05 Thread Lorenzo Fundaró
Yes, as a matter of fact I ended up doing all those steps you described. When a new node joins I check if a shard is missing a replica, if it does then I add a live node not being used by other shards using the ADDREPLICA call. Same thing applies for when I found down nodes, I just delete them usin

Re: deploy solr on cloud providers

2016-07-05 Thread Lorenzo Fundaró
On 5 July 2016 at 15:55, Shawn Heisey wrote: > On 7/5/2016 1:19 AM, Lorenzo Fundaró wrote: > > Hi Shawn. Actually what im trying to find out is whether this is the best > > approach for deploying solr in the cloud. I believe solrcloud solves a > lot > > of problems in terms of High Availability b

Grouping performance with MLT

2016-07-05 Thread Robert Brown
Hi All, I have an index with 10m documents. When performing an MLT query and grouping by a field, response times are roughly 20s. The group field is currently populated with unique values, as we now start to manually group documents (hence using MLT). The group field has docValues turned o

Re: deploy solr on cloud providers

2016-07-05 Thread Shawn Heisey
On 7/5/2016 1:19 AM, Lorenzo Fundaró wrote: > Hi Shawn. Actually what im trying to find out is whether this is the best > approach for deploying solr in the cloud. I believe solrcloud solves a lot > of problems in terms of High Availability but when it comes to storage > there seems to be a limitat

Re: stateless solr ?

2016-07-05 Thread Shawn Heisey
On 7/4/2016 7:46 AM, Lorenzo Fundaró wrote: > I am trying to run Solr on my infrastructure using docker containers > and Mesos. My problem is that I don't have a shared filesystem. I have > a cluster of 3 shards and 3 replicas (9 nodes in total) so if I > distribute well my nodes I always have 2 fa

Re: solr shutdown error

2016-07-05 Thread Shawn Heisey
On 7/2/2016 2:26 AM, Kent Mu wrote: > I do understand your explaination, but another question accurs, When I > create CloudSolrServer Connection by singleton pattern like this, and > I call shutdown, but it's OK. the solr version is 4.9.0. that's why? If SolrJ 4.9.0 allows you to call shutdown on

Re: Spell check suggestions because of case

2016-07-05 Thread Kalpana
Hello Thanks for your reply. When I search for Diabetes I get the results back and also suggestion for diabetes - which is just a lower case version of Diabetes. This is true for any search words. When I type in Health - I get results back and also suggestion for health. I have applied the necess

RE: Memory issues when indexing

2016-07-05 Thread Mads Tomasgård Bjørgan
Another update: After creating a new certificate, properly specified for its use of context, do we still end up in the described situation. Thus, it seems SSL itself is the underlying reason for the leak - -Original Message- From: Mads Tomasgård Bjørgan [mailto:m...@dips.no] Sent: tir

Re: CDCR (Solr6.x) does not start

2016-07-05 Thread Renaud Delbru
Hi Uwe, At first look, your configuration seems correct, see my comments below. On 28/06/16 15:36, Uwe Reh wrote: 9. Start CDCR http://SOURCE:s_port/solr/scoll/cdcr?action=start&wt=json {"responseHeader":{"status":0,"QTime":13},"status":["process","started","buffer","enabled"]} ! (not even a

Re: Solr6 CDCR issue with a 3 cloud design

2016-07-05 Thread Renaud Delbru
Hi Dmitry, On 28/06/16 13:19, dmitry.medve...@barclays.com wrote: No ERRORS and queue size is equal to 0. Should I extend the logging lever to Max maybe? Currently it's default. How can I know, if a commit operation has been sent to the 2 target clusters after the replication? What command sho

Re: Suggester Issue

2016-07-05 Thread Rajesh Kapur
Hi, Thanks for the reply. Yes I am getting same result with and without suggest.cfq parameter. Thanks, Rajesh Kapur On Tue, Jul 5, 2016 at 2:08 PM, Alessandro Benedetti wrote: > Hi Rajesh, > I will take a look to your config as soon as I have 5 minutes, in the > meantime, can you specify what

Re: Suggester Issue

2016-07-05 Thread Alessandro Benedetti
Hi Rajesh, I will take a look to your config as soon as I have 5 minutes, in the meantime, can you specify what is the output you get and what is the output you expect ? You simply don't get the filter applied ? Cheers On Tue, Jul 5, 2016 at 5:10 AM, Rajesh Kapur wrote: > Hi, > > > > I tried to

RE: Memory issues when indexing

2016-07-05 Thread Mads Tomasgård Bjørgan
Hi again, We turned off SSL - and now everything works as normal. The certificate is not originally meant for being used on the current servers- but we would like to keep it as the certificate has been deployed already and used by our customers. Thus we need to launch the cloud with "-Dsolr.ssl

RE: Can we directly move the solr instance running as standalone to SOLR CLOUD?

2016-07-05 Thread Preeti Bhat
HI, Can some one guide me on this? I have a standalone SOLR instance running as a service in Windows, I would like to move towards the SOLR CLOUD, as it provides various benefits like DR, scalability etc. I would like to use the same instance or core if possible from the standalone instance to

Memory issues when indexing

2016-07-05 Thread Mads Tomasgård Bjørgan
Hello, We're struggling with memory-issues when posting documents to Solr - and unsure for which reason the problem occurs. The documents are indexed in a SolrCloud running Solr 6.1.0 on top of Zookeeper 3.4.8, utilizing three VMs running CentOS 7 and JRE 1.8.0. After various attempts with diff

Re: stateless solr ?

2016-07-05 Thread Lorenzo Fundaró
Hi guys. Thank you for your answers. @Upayavira I'm interested on your presentation. @steven im also interested on this patch you're working on. I actuallly believe that a solution is can be something like a supervisor or sidekick that knows when the containers die and provisions the new ones with

Re: deploy solr on cloud providers

2016-07-05 Thread Lorenzo Fundaró
Hi Shawn. Actually what im trying to find out is whether this is the best approach for deploying solr in the cloud. I believe solrcloud solves a lot of problems in terms of High Availability but when it comes to storage there seems to be a limitation that can be workaround of course but it's a bit