Is this a bug or just something I am doing wrong?

2019-03-13 Thread wclarke
I have been running a data import of local files using Tika. It runs fine until it gets to a language it doesn't recognoze and then it stops processing and won't auto-commit. It is happening in tika-app-1.17 through 1.20 at least. It threw errors in 1.3, but kept running and did the commit. I a

Re: Help with a DIH config file

2019-03-13 Thread wclarke
Got each one working individually, but not multiples. Is it possible? Please see attached files. Thanks!!! tika-data-config-2.xml solrconfig.xml -- Sen

Re: Score field NaN in the expanded Results

2019-03-13 Thread Raveendra Yerraguntla
Found the issue.I am using solr 5.4.0 version. in ExpandComponent - the TermFieldCollector is created with the TotalDefaultScores and TotalDefaultMaxScores were passed as false. This limits any field sorting (other than score) makes the score to be returned as NaN. By passing either or both of

Re: RegexReplaceProcessorFactory pattern to detect multiple \n

2019-03-13 Thread Zheng Lin Edwin Yeo
Hi Paul, Thanks for your reply. So far we did not find cases of punctuation that are being removed. Our aim is to remove the list of spaces (\n) into 2 , and they are not likely to have any punctuation in between. Do you know if this pattern (\n\W*){2,} that we are using is ok? Or would the ot

RE: RequestHandlerBase java.lang.NullPointerException Error

2019-03-13 Thread Rathor, Piyush (US - Philadelphia)
What Solr version - solr 7.3.1 What configuration? Solr cloud What is the document you are indexing and how do you send it to Solr? We are using the middleware to update the data. Also the issue started happening yesterday. Thanks & Regards Piyush Please consider the environment before print

Re: RequestHandlerBase java.lang.NullPointerException Error

2019-03-13 Thread Jan Høydahl
What Solr version? What configuration? What is the document you are indexing and how do you send it to Solr? Think this may be a known bug that is already fixed.. Jan Høydahl > 13. mar. 2019 kl. 17:21 skrev Rathor, Piyush (US - Philadelphia) > : > > Facing following error suddenly for data up

solr search Ontology based data set

2019-03-13 Thread Jie Luo
Hi all, I have several ontology based data sets, I would like to use solr as search engine. Solr document is flat document. I would like to know how it is the best way to handle the search. Simple search is fine. One possible search I will need to retrieve the ontology tree or graph Best reg

RequestHandlerBase java.lang.NullPointerException Error

2019-03-13 Thread Rathor, Piyush (US - Philadelphia)
Facing following error suddenly for data update: null:java.lang.NullPointerException at org.apache.solr.update.processor.AddSchemaFieldsUpdateProcessorFactory$AddSchemaFieldsUpdateProcessor.mapValueClassesToFieldType(AddSchemaFieldsUpdateProcessorFactory.java:509) at org.apache.solr.update.proce

Re: solr 7 optimize with Tlog/Pull replicas

2019-03-13 Thread Aroop Ganguly
Thanks Erick ! Great details as always :) > On Mar 13, 2019, at 8:48 AM, Erick Erickson wrote: > > Wei: > > Right. You should count on the _entire_ index being replicated from the > leader, but only after the optimize is done. Pre 7.5, this would be a single > segment, 7.5+ it would be a bunc

Re: solr 7 optimize with Tlog/Pull replicas

2019-03-13 Thread Erick Erickson
Wei: Right. You should count on the _entire_ index being replicated from the leader, but only after the optimize is done. Pre 7.5, this would be a single segment, 7.5+ it would be a bunch of 5G flies unless you specified that the optimize create some number of segments. But unless you 1> have

Re: Using solr graph to traverse N relationships

2019-03-13 Thread Pratik Patel
Problem #1 can probably be solved by using "fetch" function. ( https://lucene.apache.org/solr/guide/6_6/stream-decorators.html#fetch) Problem #2 and #3 can be solved by normalizing the graph connections and by applying cartesianProduct on multi valued field, as described here. http://lucene.472066

Using solr graph to traverse N relationships

2019-03-13 Thread Nightingale, Jonathan A (US)
Hi, I posted this question originally on stack overflow and it was suggested I use this mailing list instead so I'm sending it out here also. Here's my original link if you want to maybe answer there also. But I also copied the question into the body of the email. https://stackoverflow.com/ques

Score field NaN in the expanded Results

2019-03-13 Thread Raveendra Yerraguntla
All, I am using the collapse plugin with the following query params text_field:( apple OR  apples) ) ) Fq :  {!collapse field=string_field  sort='numeric_field asc , score desc  '  nullPolicy=expand} Raw params : expand=true&expand.rows=600&expand.sort=numeric_field asc The above query result

Re: Help with a DIH config file

2019-03-13 Thread wclarke
I didn't know I could do an updateProcessorChain and call it in the config file. I tried doing it in the solrconfig, but it just wouldn't take. I will try this though! Thanks The value is the file path in id/url. -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Help with a DIH config file

2019-03-13 Thread wclarke
Absolutely! I attached it to the original message, But I can post here too. I am VERY new to Solr and am winging it and while the documentation has been a little helpful, I just need more complex examples. tika-data-config-2.xml

AW: RegexReplaceProcessorFactory pattern to detect multiple \n

2019-03-13 Thread paul.dodd
Hi Edwin, With \W you will also replace non-word characters such as punktuation. If that's OK fine. Otherwise you need to identify the white space characters that are causing the problem. Von: Zheng Lin Edwin Yeo Gesendet: Mittwoch, 13. März 2019 03:25:39 An: sol

Re: NPE deleting expired docs (SOLR-13281)

2019-03-13 Thread Hendrik Haddorp
We have the same issue on Solr 7.6. On 12.03.2019 16:05, Gerald Bonfiglio wrote: Has anyone else observed NPEs attempting to have expired docs removed? I'm seeing the following exceptions: 2019-02-28 04:06:34.849 ERROR (autoExpireDocs-30-thread-1) [ ] o.a.s.u.p.DocExpirationUpdateProcessorFa

Re: solr 7 optimize with Tlog/Pull replicas

2019-03-13 Thread Aroop Ganguly
Hi Erick A related question: Is optimize then ill advised for bulk indexer post solr 7.5 ? >> Especially in a situation where an index is being modified over many days ? Thanks Aroop > On Mar 12, 2019, at 9:30 PM, Wei wrote: > > Thanks Erick, it's very helpful. So for bulking indexing in a