RE: Atomic updates with nested documents

2020-06-06 Thread Kaminski, Adi
Hi Ludger, We had the same issue with Solr 7.6, and after discussing with the community we've found out that this partial update of parent document without "harm" parent-child association can work only on Solr 8.1 or higher, and It also requires some prerequisites. See the below item and it's la

Prefix for IndexBasedSpellChecker in Solr

2020-06-06 Thread Amrit Sarkar
Hi Solr folks, been a while. I was experimenting with Spell Checkers and adopted IndexBasedSpellChecker. It works well, except there is no way to configure *minimum** prefix* like DirectSolrSpellChecker (it obviously doesn't need any auxiliary index). I looked at the implementation and understoo

Re: Limit Solr Disk IO

2020-06-06 Thread Erick Erickson
New segments are created when 1> the RAMBufferSizeMB is exceeded or 2> a commit happens. The maximum segment size defaults to 5G, but TieredMergePolicy can be configured in solrconfig.xml to have larger max sizes by setting maxMergedSegmentMB Depending on your indexing rate, requiring commits e

Re: Limit Solr Disk IO

2020-06-06 Thread Anshuman Singh
Hi Eric, We are looking into TLOG/PULL replicas. But I have some doubts regarding segments. Can you explain what causes creation of a new segment and how large it can grow? And this is my index config: maxMergeAtOnce - 20 segmentsPerTier - 20 ramBufferSizeMB - 512 MB Can I configure these setting

Re: SolrClient.query take a 'collection' argument

2020-06-06 Thread Jim Anderson
Erick, Thanks for the clarification on the JVM heap space. I will invoke java as you advise. The program that I am writing is a java example that I took off the internet. The intent of the example is to read an existing core stored in solr. I created the core using instructions that I found in a

Re: SolrClient.query take a 'collection' argument

2020-06-06 Thread Erick Erickson
I’m not talking about how much memory your machine has, the critical bit it’s how much heap space is allocated to the JVM to run your app. You can increase it by specifying -Xmx2G say when you invoke Java. The version difference is suspicious indeed. I’m a little confused here. Exactly _what_

Re: SolrClient.query take a 'collection' argument

2020-06-06 Thread Jim Anderson
Erick, Thanks for the suggestion. I will keep it in the back of my mind for now. My PC has 8 G-bytes of memory and has roughly 4 G-bytes in use. If the forefront, I'm looking at the recommended solr/nutch combinations. I'm using Solr 8.5.1 with nutch 1.16. The recommendation is to use nutch 1.17

Re: SolrClient.query take a 'collection' argument

2020-06-06 Thread Erick Erickson
I’d look for an OutOfMemory problem before going too much farther. The simplest way to see if that’s in the right direction would be to run your SolrJ program with a massive memory size. Perhaps monitor your program with jconsole or similar to see if there’s any clues about memory usage. OOMs lead

Re: SolrClient.query take a 'collection' argument

2020-06-06 Thread Jim Anderson
Shawn, Thanks for the explanation. Very good response. The first paragraph helped clarify what a collection is. I have read quite about about Solr. There is so much to absorb that it is slowly sinking in. Your 2nd paragraph definitely answered my question, i.e. passing a core name should be ok wh

Re: Faster Vector Highlight

2020-06-06 Thread Yasufumi Mizoguchi
Hi, Kaya. How about using hl.maxAnalyzedChars parameter ? Thanks, Yasufumi > 2020/06/06 午後5:56、Kayak28 のメール: > > Hello, Solr Community: > > I have a question about FasterVectorHighlight. > I know Solr highlight does not return highlighted text if the text in the > highlighted field is too lo

Faster Vector Highlight

2020-06-06 Thread Kayak28
Hello, Solr Community: I have a question about FasterVectorHighlight. I know Solr highlight does not return highlighted text if the text in the highlighted field is too long. What is the good way to treat long text highlights? -- Sincerely, Kaya github: https://github.com/28kayak