Re: SOLR 7.1 Top level element for similarity factory

2018-07-16 Thread Chris Hostetter
: So I have the following at the bottom of my schema.xml file : : : : : : The documentation says "top level element" - so should that actually be outside the schema tag? No, the schema tag is the "root" level element, it's direct children are the "top level elements" (the wording m

SOLR 7.1 Top level element for similarity factory

2018-07-16 Thread Hodder, Rick
I'm using SOLR 7.1 and I'm trying to set the similarity factory back to ClassicSimilarityFactory so that it will behave like SOLR 6 or before. In the document https://lucene.apache.org/solr/guide/7_1/other-schema-elements.html#similarity It says This default behavior can be overridden by decla

Re: Preferred PHP Client Library

2018-07-16 Thread John Blythe
We have envious using Solarium On Mon, Jul 16, 2018 at 14:19 Zimmermann, Thomas wrote: > Hi, > > We're in the midst of our first major Solr upgrade in years and are trying > to run some cleanup across all of our client codebases. We're currently > using the standard PHP Solr Extension when commu

RE: 7.3 appears to leak

2018-07-16 Thread Markus Jelsma
Hello Thomas, To be absolutely sure you suffer from the same problem as one of our collections, can you confirm that your Solr cores are leaking a SolrIndexSearcher instance on each commit? If not, there may be a second problem. Also, do you run any custom plugins or apply patches to your Solr

Preferred PHP Client Library

2018-07-16 Thread Zimmermann, Thomas
Hi, We're in the midst of our first major Solr upgrade in years and are trying to run some cleanup across all of our client codebases. We're currently using the standard PHP Solr Extension when communicating with our cluster from our Wordpress installs. http://php.net/manual/en/book.solr.php F

Re: Hardware-Aware Solr Coud Sharding?

2018-07-16 Thread Michael Braun
Ended up working well with nodeset EMPTY and placing all replicas manually. Thank you all for the assistance! On Thu, Jun 14, 2018 at 9:28 AM, Jan Høydahl wrote: > You could also look into the Autoscaling stuff in 7.x which can be > programmed to move shards around based on system load and HW sp

Re: SolrCloud and Kubernetes

2018-07-16 Thread ssivashunm
Hi Vincenzo, I used the repo, but encountering following hurdles and trying to solve them. I increased the replicas to 3 for both solr and zookeeper. I dont want to expose the nodeport directly for inter communication hence created a headless service and used FQDN of the solr-ss-0 pod for the

Re: Solr cloud in kubernetes

2018-07-16 Thread Paweł Ruciński
Hi, I am trying to achieve same, to host Solr on k8s. For now, I successfully created ZK as a statefulset (3 instances) with a headless service. Apart of that created deployment objects for storing Solr pods (again 3 instances). For each solr pod I have manually created persistent volume. Now I am

Re: terms present within fields

2018-07-16 Thread Vincenzo D'Amore
Ok, I got it, thank you very much. On Mon, Jul 16, 2018 at 6:25 PM Erick Erickson wrote: > Terms are already sorted when you use TermsComponent. So you fetch the > first 1,000 from each > field and compare... if you're starting with the same prefix for both > fields the lists should be > compara

Re: terms present within fields

2018-07-16 Thread Erick Erickson
Terms are already sorted when you use TermsComponent. So you fetch the first 1,000 from each field and compare... if you're starting with the same prefix for both fields the lists should be comparable in a straightforward manner. Best, Erick On Mon, Jul 16, 2018 at 9:10 AM, Vincenzo D'Amore wrot

Re: terms present within fields

2018-07-16 Thread Vincenzo D'Amore
Hi Alexandre, well... you're right. Sooner or later I had to create a collection with synthetic data where run my test. Well I have SolrCloud, I'm curious, could you please suggest me an example with the streaming expression you're talking? On Mon, Jul 16, 2018 at 4:50 PM Alexandre Rafalovitch w

Re: terms present within fields

2018-07-16 Thread Vincenzo D'Amore
Thanks Erick, at first glance I didn't understood your suggestion. But trying to sort the terms per index it make sense, absolutely make sense :))) Thanks for the suggestion, adding the prefix it very easy to implement. On Mon, Jul 16, 2018 at 4:34 PM Erick Erickson wrote: > There's no real wa

Re: terms present within fields

2018-07-16 Thread Alexandre Rafalovitch
For the test, can't you just use synthetic data where you know the terms from the start? Otherwise maybe something from streaming expressions will help, but it needs SolrCloud. Regards, Alex On Mon, Jul 16, 2018, 10:22 AM Vincenzo D'Amore, wrote: > Hi all, > > I have a question for you, So

Re: terms present within fields

2018-07-16 Thread Erick Erickson
There's no real way I know of to do what you want except to use TermsComponent. Note that you don't have to extract all of them, just advance the two lists until you find enough terms in long_title that aren't in short_title, extract, say, 1,000 terms at a time. You can also start with various pr

terms present within fields

2018-07-16 Thread Vincenzo D'Amore
Hi all, I have a question for you, Solr Gurus :) there is an index where there are two fields: short_title and long_title. As the field names suggest, this two fields are very similar, the long title has just more terms in it. So, looking at all the documents I have in the index, I would like to

Learning to rank

2018-07-16 Thread Akshay Patil
Hi I am student. for my master thesis I am working on the Learning To rank. As I did research on it. I found solution provided by the Bloomberg. But I would like to ask. With the example that you have provided It always shows the error of Bad Request. Do you have running example of it. So i can

Re: 7.3 appears to leak

2018-07-16 Thread Thomas Scheffler
Hi, we noticed the same problems here in a rather small setup. 40.000 metadata documents with nearly as much files that have „literal.*“ fields with it. While 7.2.1 has brought some tika issues the real problems started to appear with version 7.3.0 which are currently unresolved in 7.4.0. Memor

Re: Learning to rank - Bad Request

2018-07-16 Thread Diego Ceccarelli (BLOOMBERG/ LONDON)
Hi Akshay, did you run solr enabling learning to rank? ./bin/solr -e techproducts -Dsolr.ltr.enabled=true if you don't pass -Dsolr.ltr.enabled=true ltr will not be available. Cheers, Diego From: solr-user@lucene.apache.org At: 07/16/18 09:00:39To: solr-user@lucene.apache.org Subject: Re:

Re: Learning to rank - Bad Request

2018-07-16 Thread akshaypatil
Hi, I am using apache solr 7.4.0. I am trying to use learning to rank using the python script and related data provided by the lucene. which can be found at the Github repository of the lucene solr.