Re: terms present within fields

2018-07-16 Thread Vincenzo D'Amore
Ok, I got it, thank you very much. On Mon, Jul 16, 2018 at 6:25 PM Erick Erickson wrote: > Terms are already sorted when you use TermsComponent. So you fetch the > first 1,000 from each > field and compare... if you're starting with the same prefix for both > fields the lists should be > compara

Re: terms present within fields

2018-07-16 Thread Erick Erickson
Terms are already sorted when you use TermsComponent. So you fetch the first 1,000 from each field and compare... if you're starting with the same prefix for both fields the lists should be comparable in a straightforward manner. Best, Erick On Mon, Jul 16, 2018 at 9:10 AM, Vincenzo D'Amore wrot

Re: terms present within fields

2018-07-16 Thread Vincenzo D'Amore
Hi Alexandre, well... you're right. Sooner or later I had to create a collection with synthetic data where run my test. Well I have SolrCloud, I'm curious, could you please suggest me an example with the streaming expression you're talking? On Mon, Jul 16, 2018 at 4:50 PM Alexandre Rafalovitch w

Re: terms present within fields

2018-07-16 Thread Vincenzo D'Amore
Thanks Erick, at first glance I didn't understood your suggestion. But trying to sort the terms per index it make sense, absolutely make sense :))) Thanks for the suggestion, adding the prefix it very easy to implement. On Mon, Jul 16, 2018 at 4:34 PM Erick Erickson wrote: > There's no real wa

Re: terms present within fields

2018-07-16 Thread Alexandre Rafalovitch
For the test, can't you just use synthetic data where you know the terms from the start? Otherwise maybe something from streaming expressions will help, but it needs SolrCloud. Regards, Alex On Mon, Jul 16, 2018, 10:22 AM Vincenzo D'Amore, wrote: > Hi all, > > I have a question for you, So

Re: terms present within fields

2018-07-16 Thread Erick Erickson
There's no real way I know of to do what you want except to use TermsComponent. Note that you don't have to extract all of them, just advance the two lists until you find enough terms in long_title that aren't in short_title, extract, say, 1,000 terms at a time. You can also start with various pr

terms present within fields

2018-07-16 Thread Vincenzo D'Amore
Hi all, I have a question for you, Solr Gurus :) there is an index where there are two fields: short_title and long_title. As the field names suggest, this two fields are very similar, the long title has just more terms in it. So, looking at all the documents I have in the index, I would like to