Re: Index size issue in SOLR-6.5.1

2018-10-07 Thread SOLR4189
About which details do you ask? Yesterday we restarted all our solr services and index size in serverX descreased from 82Gb to 60Gb, and in serverY index size didn't change (49Gb). -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

SOLR Partial search

2018-10-07 Thread Rathor, Piyush (US - Philadelphia)
HI All, I am trying to use “KeywordTokenizerFactory” to consider searching against the special characters in the search. But the partial search does not work well with “KeywordTokenizerFactory”. The partial match results are better in “StandardTokenizerFactory”. Field type – text_general

Re: Deciding on the number of Shards and Replica

2018-10-07 Thread Shawn Heisey
On 10/7/2018 7:28 PM, Sourav Moitra wrote: I am Solr newbie. I am trying to setup three servers running both Zookeeper ensemble and Solr in cloud mode. Each server has 4 core and 16gb of RAM. To start with I have put Xmx value of 6144M to Zookeeper and Xmx value of 2048 to Solr.We have created 3

Re: matches missing highlight information

2018-10-07 Thread Zheng Lin Edwin Yeo
Have you tried to set hl.maxAnalyzedChar? If not, the highlighting will not look past the first 51,200 characters of the documents for the snippets. You can check the guide for more information: https://lucene.apache.org/solr/guide/7_5/highlighting.html Regards, Edwin On Mon, 1 Oct 2018 at 20:

Re: Boolean clauses in ComplexPhraseQuery

2018-10-07 Thread Zheng Lin Edwin Yeo
You may refer to this documentation: https://lucene.apache.org/solr/guide/7_5/other-parsers.html Regards, Edwin On Fri, 5 Oct 2018 at 15:48, Mikhail Khludnev wrote: > Why not? > > On Thu, Oct 4, 2018 at 6:52 PM Chuming Chen wrote: > > > Hi All, > > > > Does Solr supports boolean clauses inside

Deciding on the number of Shards and Replica

2018-10-07 Thread Sourav Moitra
Hello all, I am Solr newbie. I am trying to setup three servers running both Zookeeper ensemble and Solr in cloud mode. Each server has 4 core and 16gb of RAM. To start with I have put Xmx value of 6144M to Zookeeper and Xmx value of 2048 to Solr.We have created 3 shards and 3 replica each. The si

Re: Apache SOLR upgrade from 5.2.1 to 7.x

2018-10-07 Thread Erick Erickson
IIUC the 5x->7x upgrade won't immediately fail like a 6x->8x upgrade would, but neither is it really expected to produce the results you expect due to differences in the index formats. On Sat, Oct 6, 2018 at 1:31 PM Jan Høydahl wrote: > > Shawn, my impression is that you WILL be able to upgrade a

SV: DIH for different levels of XML

2018-10-07 Thread Martin Frank Hansen (MHQ)
Hi Alex, Thanks for your answer. I think I made it work. The problem was actually in the schema.xml, where the field "Journalnummer" should have multiValued="true". Martin Frank Hansen Lautrupparken 40-42, DK-2750 Ballerup E-mail m...@kmd.dk Web www.kmd.dk Mobil +4525571418 -Oprindel

Re: DIH for different levels of XML

2018-10-07 Thread Alexandre Rafalovitch
If your ID field comes from one XML level and your record details from another, they are processed as two separate records. Have a look at atom example that ships with DIH example set. Specifically, at commonField parameter, it may be useful for you: https://lucene.apache.org/solr/guide/7_4/uploadi

DIH for different levels of XML

2018-10-07 Thread Martin Frank Hansen (MHQ)
Hi, I am having some difficulties adding data from different levels of a xml document. The xml can be as simple as this: 2165432 5 10 The data-config-file looks like this. The result is the following: { "respon

Re: Index size issue in SOLR-6.5.1

2018-10-07 Thread Dominique Bejean
Hi, What about cores segment details in admin UI interface ? More deleted documents ? Regards Dominique Le dim. 7 oct. 2018 à 08:22, SOLR4189 a écrit : > Hi all, > > We use SOLR-6.5.1 and we have very strange issue. In our collection index > size is very different from server to server (33gb

Using function in fiter query

2018-10-07 Thread skanth2...@gmail.com
Hi, I need help on using a custom function in filter query. Can anyone help on how to get it wokring. Below is the problem statement. Have a date field in long and a buffer time in milliseconds in the documents which can vary. startTime: 153886680 bufferTime: 86400 Need to query for docs wh