Re: Arabic words search in solr

2017-02-14 Thread mohanmca01
Hi Steve, As per your suggestion,I added ICUFoldingFilterFactory in schema.xml as below: I attached expecting result document in previous mail thread for your references. Kindly check and let me know. Thanks -- View this mess

Re: Can SOLR-5730 patch be backported to Solr 5.5.3

2017-02-14 Thread Erick Erickson
Don't know, give it a try and see? But you're in uncharted/unsupported territory so it's really chancy. Best, Erick On Tue, Feb 14, 2017 at 1:45 AM, Sahil Agarwal wrote: > Can the patch for jira issue SOLR-5730 be backported to solr 5.5.3?? ie. > Can Lucene's SortingMergePolicy and EarlyTerminat

Re: Continual garbage collection loop

2017-02-14 Thread Erick Erickson
Yeah, 512M is the default if for Java, but Solr _really_ likes memory. These two lines are "smoking guns" Max heap after conc GC: 488.7M (99.6%) Max heap after full GC: 490M (99.9%) So what's happening (I think) is that you're spending a lot of cycles recovering a very little bit of memory and th

Re: Changing "configSetBaseDir" leads to, Can't find resource 'solrconfig.xml' in classpath error while creating core

2017-02-14 Thread Erick Erickson
Works for me if I camel-case configset, i.e. configSet. Is this a misunderstanding on your part or are there docs with it all lowercase that we should fix? See: https://cwiki.apache.org/confluence/display/solr/Config+Sets I did get the same error you did with an all lower-case 'configset'. Best,

Changing "configSetBaseDir" leads to, Can't find resource 'solrconfig.xml' in classpath error while creating core

2017-02-14 Thread saiks
Hi All, I have a core "core1" created with custom config {solr.solr.home}/configsets/custom_config I changed configSetBaseDir to a different directory in solr.xml and copied the folders over to the new dir and deleted the old configs ${configSetBaseDir:/xxx/Desktop/changed-configset} Now, if I r

Re: Getting "Error getting file length for [segments_5]" warnings in Solr 6.4.0

2017-02-14 Thread Shawn Heisey
On 2/14/2017 9:57 AM, Peter Matthew Eichman wrote: > I am running Solr 6.4.0, and while I am attempting to index my Fedora > 4 data, I keep getting warning messages in my solr.log: "WARN > (qtp401424608-18) [ x:fedora4] o.a.s.h.a.LukeRequestHandler Error > getting file length for [segments_5]". And

Re: alerting system with Solr's Streaming Expressions

2017-02-14 Thread Susheel Kumar
Hello Joel, I took a bigger trainingSet around 200K documents (amazon reviews) and it worked out well. I verified the feature terms extracted and classify function was able to output correct probability of reviews being negative or positive. Big thanks for adding this. I wonder what you have ne

Re: NumericDocValues only supports long?

2017-02-14 Thread Tomás Fernández Löbbe
I think you should use FloatFieldSource. Solr uses Float.floatToIntBits(floatValue) when adding the DV field, so you could use Float.intBitsToFloat((int)longValue) when reading (See TrieField.createFields(...)), but FloatFieldSource is already doing that for you. On Tue, Feb 14, 2017 at 10:37 AM,

NumericDocValues only supports long?

2017-02-14 Thread Ugo Matrangolo
Hi, I have a corpus where each document contains a field of type Float. I'm trying to write a PostFilter that returns a DelegatingCollector to filter all the docs where the value of a function applied to this float value is lower than a given threshold. I can't precompute/index anything here. I

Re: Getting "Error getting file length for [segments_5]" warnings in Solr 6.4.0

2017-02-14 Thread Peter Matthew Eichman
It is not. This is happening for other segment files, not just segment_5, too. On Tue, Feb 14, 2017 at 12:04 PM, David Hastings < hastings.recurs...@gmail.com> wrote: > well, if you look at the error: > NoSuchFileException > > its looking for the segment file for generation 5, are you sure its >

Getting "Error getting file length for [segments_5]" warnings in Solr 6.4.0

2017-02-14 Thread Peter Matthew Eichman
Hello all, I am running Solr 6.4.0, and while I am attempting to index my Fedora 4 data, I keep getting warning messages in my solr.log: "WARN (qtp401424608-18) [ x:fedora4] o.a.s.h.a.LukeRequestHandler Error getting file length for [segments_5]". And after that, the indexing stops, and the cor

Re: Getting "Error getting file length for [segments_5]" warnings in Solr 6.4.0

2017-02-14 Thread David Hastings
well, if you look at the error: NoSuchFileException its looking for the segment file for generation 5, are you sure its present? On Tue, Feb 14, 2017 at 11:57 AM, Peter Matthew Eichman wrote: > Hello all, > > I am running Solr 6.4.0, and while I am attempting to index my Fedora 4 > data, I keep

Re: Solr6.3.0 SolrJ API for Basic Authentication

2017-02-14 Thread Bryan Bende
Hello, The exception you are getting looks more like you can't connect to the IP address from where your SolrJ code is running, but not sure. For the basic credentials, rather than trying to do something with the http client, you can provide them on the request like this: QueryRequest req = new

Re: Continual garbage collection loop

2017-02-14 Thread Leon STRINGER
> > On 14 February 2017 at 15:49 Walter Underwood > wrote: > > > Yes, 512 MB is far too small. I’m surprised it even starts. We run with 8 > Gb. > Thanks, in fairness 512 MB was the default and we're new to this. We'll look at what we're allocating to Solr to tune this. > > wund

Re: Continual garbage collection loop

2017-02-14 Thread Walter Underwood
Yes, 512 MB is far too small. I’m surprised it even starts. We run with 8 Gb. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Feb 14, 2017, at 7:39 AM, Leon STRINGER wrote: > >> >>On 14 February 2017 at 14:44 Michael Kuhlmann wrote: >> >> >

Re: Continual garbage collection loop

2017-02-14 Thread Leon STRINGER
> > On 14 February 2017 at 14:44 Michael Kuhlmann wrote: > > > Wow, running 36 cores with only half a gigabyte of heap memory is > *really* optimistic! > > I'd raise the heap size to some gigabytes at least and see how it's > working then. > I'll try increasing the heap s

Re: Solr Search - Issue

2017-02-14 Thread Alexandre Rafalovitch
Where do the sentence boundaries come from? Solr would not be able to automatically identify those, but if you can feed this text as a multi-valued field with each sentence separate, you may be able to use something like SurroundQueryParser: https://cwiki.apache.org/confluence/display/solr/Other+Pa

RE: Issues with Solr Morphline reading RFC822 files

2017-02-14 Thread Anatharaman, Srinatha (Contractor)
>From the original email below lines are not indexed, These are metadata >appears before the actual email > Received: from resqmta-po-08v.sys..net ([196.114.154.167]) >by csp-imta02.westchester.pa.bo..net with bizsmtp >id EClZ1u0013cy81c01E9enp; Wed, 30 Nov 2016 14:09:38 +

Re: Arabic words search in solr

2017-02-14 Thread Steve Rowe
Hi Mohan, Did you change the order of the filters as I suggested? -- Steve eww.lucidworks.com On Tue, Feb 14, 2017 at 8:05 AM mohanmca01 wrote: > Hi Steve, > > any update on this .???.. I am waiting for your inputs.. > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.c

Solr Search - Issue

2017-02-14 Thread Neeraj Kumar
Hi Team, I am new to solr and need your help. My problem statement is as below I have uploaded document in solr as below. #sb# represents sentence begining and #se# represents senetence ending. Now I want to search terms which occur in same sentence . If I search for q=text:"Federer Wimble

Re: Continual garbage collection loop

2017-02-14 Thread Michael Kuhlmann
Wow, running 36 cores with only half a gigabyte of heap memory is *really* optimistic! I'd raise the heap size to some gigabytes at least and see how it's working then. -Michael Am 14.02.2017 um 15:23 schrieb Leon STRINGER: > Further background on the environment: > > There are 36 cores, with a

Re: Continual garbage collection loop

2017-02-14 Thread Leon STRINGER
Further background on the environment: There are 36 cores, with a total size of 131 MB (based on the size reported by "Master (Searching)" in the web console). The Java memory parameters in use are: -Xms512m -Xmx512m. > > On 14 February 2017 at 05:45 Erick Erickson > wrote: > > GCView

Re: Arabic words search in solr

2017-02-14 Thread mohanmca01
Hi Steve, any update on this .???.. I am waiting for your inputs.. -- View this message in context: http://lucene.472066.n3.nabble.com/Arabic-words-search-in-solr-tp4317733p4320253.html Sent from the Solr - User mailing list archive at Nabble.com.

Solr6.3.0 SolrJ API for Basic Authentication

2017-02-14 Thread vrindavda
Hello , I am trying to connect SolrCloud using SolrJ API using following code : String zkHostString = "localhost:9983"; String USER = "solr"; String PASSWORD = "SolrRocks"; CredentialsProvider credentialsProvider = new BasicCredentialsProvider();

Re: Performance degradation after upgrading from 6.2.1 to 6.4.1

2017-02-14 Thread Ere Maijala
It might be . --Ere 14.2.2017, 11.52, Henrik Brautaset Aronsen kirjoitti: We are seeing performance degradation on our SolrCloud instances after upgrading to 6.4.1. Here are a couple of graphs. As you can see, 6.4.1 was introduced 2/10 1200:

Re: Performance degradation after upgrading from 6.2.1 to 6.4.1

2017-02-14 Thread Dorian Hoxha
Did you see the other thread ? It looked like a problem with logging. On Tue, Feb 14, 2017 at 10:52 AM, Henrik Brautaset Aronsen < henrik.aron...@gmail.com> wrote: > We are seeing performance degradation on our SolrCloud instances after > upgrading to 6.4.1. > > > Here are a couple of graphs. As

Performance degradation after upgrading from 6.2.1 to 6.4.1

2017-02-14 Thread Henrik Brautaset Aronsen
We are seeing performance degradation on our SolrCloud instances after upgrading to 6.4.1. Here are a couple of graphs. As you can see, 6.4.1 was introduced 2/10 1200: https://www.dropbox.com/s/qrc0wodain50azz/solr1.png?dl=0 https://www.dropbox.com/s/sdk30imm8jlomz2/solr2.png?dl=0 These are

Can SOLR-5730 patch be backported to Solr 5.5.3

2017-02-14 Thread Sahil Agarwal
Can the patch for jira issue SOLR-5730 be backported to solr 5.5.3?? ie. Can Lucene's SortingMergePolicy and EarlyTerminatingSortingCollector configurable in Solr 5.5.3 too?? https://issues.apache.org/jira/browse/SOLR-5730 The SortingMergePolicy and EarlyTerminatingSortCollector are both availabl

Re: Continual garbage collection loop

2017-02-14 Thread Leon STRINGER
Why is this a problem? CPU load and log file size. The CPU load reported by top increases from it's normal 0-1% resting state to 25-50% continually, and higher when searching, indexing, etc. With 6.1.0 we just got a huge GC log file. With 6.4.1 the GC log is now cycled before it grows too large,

Re: Help with design choice: join or multiValued field

2017-02-14 Thread Karl Kildén
Thanks for the reply! We did it this way and it's seems to turn out really well On 6 February 2017 at 22:57, Fuad Efendi wrote: > Correct: multivalued field with 1 shop IDs. Use case: shopping network > in U.S. for example for a big brand such as Walmart, when user implicitly > provides IP a