Re: user field(uf) not working with Post filter

2018-09-11 Thread Zheng Lin Edwin Yeo
Hi Shruti, Which version of Solr are you using? Maybe you can take a look at the below guide first if you haven't done so: https://lucene.apache.org/solr/guide/7_4/common-query-parameters.html#fq-filter-query-parameter Regards, Edwin On Wed, 5 Sep 2018 at 21:17, shruti suri wrote: > Hi, > > I

Re: Error casting to PointField

2018-09-11 Thread Erick Erickson
People usually just use a string field in place of longs etc.. On Tue, Sep 11, 2018 at 9:15 PM Zahra Aminolroaya wrote: > > Thanks Erick. We used to use TrieLongField for our unique id and in the > document it is said that all Trie* fieldtypes are casting to > *pointfieldtypes. What would be the a

RE: 6.x to 7.x differences

2018-09-11 Thread Preeti Bhat
Hi John, Please check the solrQueryParser option, it was removed in 7.4 version, so you will need to provide AND in solrconfig.xml or give the q.op option while querying to solve this problem. By default solr makes it an "OR" operation leading to too many results. Old Way: In Managed-schema or

Re: Error casting to PointField

2018-09-11 Thread Zahra Aminolroaya
Thanks Erick. We used to use TrieLongField for our unique id and in the document it is said that all Trie* fieldtypes are casting to *pointfieldtypes. What would be the alternative solution? Best, Zahra -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Data Import Handler with Solr Source behind Load Balancer

2018-09-11 Thread Zimmermann, Thomas
We have a Solr v7 Instance sourcing data from a Data Import Handler with a Solr data source running Solr v4. When it hits a single server in that instance directly, all documents are read and written correctly to the v7. When we hit the load balancer DNS entry, the resulting data import handler

Re: Docker and Solr Indexing

2018-09-11 Thread Shawn Heisey
On 9/11/2018 9:20 PM, solrnoobie wrote: So what we did is we upgraded the instances to 16 gigs and we rarely encounter this now. So what we did was to increase the batch size to 500 instead of 50 and it worked for our test data. But when we tried 1000 batch size, the invalid content type error r

Re: error render solr data spatial from geoserver

2018-09-11 Thread Zheng Lin Edwin Yeo
Hi, Which version of Solr are you using? And are your different shard on the same machine or different machine? Regards, Edwin On Tue, 4 Sep 2018 at 18:04, tkg_cangkul wrote: > Hi i wanna try to rendering solr data spatial from geoserver layer. > when i try to render it from single shard solr

Re: parent/child rows in solr

2018-09-11 Thread John Smith
On Tue, Sep 11, 2018 at 11:05 PM Walter Underwood wrote: > Have you tried modeling it with multivalued fields? > > That's an interesting idea, but I don't think that would work. We would lose the concept of "rows". So let's say child1 has col "a" and col "b", both are turned into multi-value fiel

Re: Docker and Solr Indexing

2018-09-11 Thread solrnoobie
Thank you all for the kind and timely reply. So what we did is we upgraded the instances to 16 gigs and we rarely encounter this now. So what we did was to increase the batch size to 500 instead of 50 and it worked for our test data. But when we tried 1000 batch size, the invalid content type err

Re: parent/child rows in solr

2018-09-11 Thread John Smith
On Tue, Sep 11, 2018 at 11:00 PM Shawn Heisey wrote: > On 9/11/2018 8:35 PM, John Smith wrote: > > The problem is that the math isn't a simple case of adding up all the row > > counts. These are "left outer join"s. In sql, it would be this query: > > I think we'll just have to conclude that I do

Re: parent/child rows in solr

2018-09-11 Thread Walter Underwood
Have you tried modeling it with multivalued fields? Also, why do you think Solr is a good solution? What is the problem? wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Sep 11, 2018, at 7:35 PM, John Smith wrote: > > On Tue, Sep 11, 2018 at 9:32 PM

Re: parent/child rows in solr

2018-09-11 Thread Shawn Heisey
On 9/11/2018 8:35 PM, John Smith wrote: The problem is that the math isn't a simple case of adding up all the row counts. These are "left outer join"s. In sql, it would be this query: I think we'll just have to conclude that I do not understand what you are doing.  I have no idea what "left ou

Re: LIBLINEAR model lacks weight(s) when training for SolrFeatures in LTR

2018-09-11 Thread Zheng Lin Edwin Yeo
I have found that it is due to insufficient training data that are related to that feature. After I add more entries that are related to that features to the training data, the issues did not occur. Regards, Edwin On Tue, 28 Aug 2018 at 15:56, Zheng Lin Edwin Yeo wrote: > Hi, > > I am using Sol

Implementing NeuralNetworkModel RankNet in Solr LTR

2018-09-11 Thread Zheng Lin Edwin Yeo
Hi, I am working on to implementing Solr LTR in Solr 7.4.0 by using the NeuralNetworkModel for the feature selection and model training, and I have found this site which uses RankNet: https://github.com/airalcorn2/Solr-LTR#RankNet Has anyone tried on this before? And what is the format of the tra

Re: parent/child rows in solr

2018-09-11 Thread John Smith
On Tue, Sep 11, 2018 at 9:32 PM Shawn Heisey wrote: > On 9/11/2018 7:07 PM, John Smith wrote: > > header: 223,580 > > > > child1: 124,978 > > child2: 254,045 > > child3: 127,917 > > child4:1,009,030 > > child5: 225,311 > > child6: 381,561 > > child7: 438,315

6.x to 7.x differences

2018-09-11 Thread John Blythe
hi, all. we recently migrated to cloud. part of that migration jumped us from 6.1 to 7.4. one example query between our old solr instance and our new cloud instance produces 42 results and 19k results. the analyzer is the same aside from WordDelimiterFilterFactory moving over to the graph variat

Re: parent/child rows in solr

2018-09-11 Thread Shawn Heisey
On 9/11/2018 7:07 PM, John Smith wrote: header: 223,580 child1: 124,978 child2: 254,045 child3: 127,917 child4:1,009,030 child5: 225,311 child6: 381,561 child7: 438,315 child8: 18,850 Trying to index that into solr with a flatfile schema, blows up i

Re: parent/child rows in solr

2018-09-11 Thread John Smith
> > On 9/7/2018 7:44 PM, John Smith wrote: > > Thanks Shawn, for your comments. The reason why I don't want to go flat > > file structure, is due to all the wasted/duplicated data. If a department > > has 100 employees, then it's very wasteful in terms of disk space to > repeat > > the header data

Re: any way to post json document to a MoreLikeThisHandler?

2018-09-11 Thread Alexandre Rafalovitch
Hmm. I guess the issue is that the handler is the one doing parsing, so the input document can be in XML or JSON or CSV. And MLT as a handler is then a competing end point. So you actually want to use it later in a pipeline but with a document constructed on the fly and not stored. This may not

Re: any way to post json document to a MoreLikeThisHandler?

2018-09-11 Thread Matt Work Coarr
Thanks Alex. Yes, I've been using the MoreLikeThisHandler, but that takes a block of text as input posted to the request, not the structured json that corresponds to the fields. On Tue, Sep 11, 2018 at 10:14 AM Alexandre Rafalovitch wrote: > There are three ways to trigger MLT: > https://lucene

Re: group.limit>1 and sorting is not working as expected

2018-09-11 Thread Erick Erickson
OK, you just don't want to group as Shawn says. The group.main=true just flattens the list but still returns the docs in groups does all the work of grouping. So instead of getting value=1 [ doc31 doc64 ] value=2 [ doc 98 doc 6 ] you get doc31 doc64 doc98 doc6 Best, Erick On Tue,

Re: Solr RSIZE memory overusage

2018-09-11 Thread Erick Erickson
bq. We're using NRTCachingDirectoryFactory Which uses MMapDirectory under the covers. The file handle counts will vary. During merging, files are held open and while segments are merged so new and old segments are open. Once merged, the files in the old segment will be deleted so some variance is

Re: Update partial document

2018-09-11 Thread Vincenzo D'Amore
Hi Mikhail, Shawn, thanks for your prompt answer. The problem is that the indexed documents have dozen of fields and usually they are different for each document. For example document id 1 has few generic fields like title, description and all the attributes like attr_1224, attr_4343, attr_4454,

Re: Solr RSIZE memory overusage

2018-09-11 Thread Boris Pasko
On Tue, 2018-09-11 at 12:43 -0600, Shawn Heisey wrote: > On 9/11/2018 12:14 PM, Boris Pasko wrote: > > > > > > > > Run top, press shift-M to sort by memory usage, then grab a > > atop: http://oi68.tinypic.com/10pokkk.jpg > > top: http://oi63.tinypic.com/msbpfp.jpg > Looking at the second one: > > T

Re: Docker and Solr Indexing

2018-09-11 Thread Jan Høydahl
You have not shed any light on what the reason for the container restart was, and there is too little information about your setup and Solr usage to guess what goes on. Whether 4Gb is sufficient or not depends on how much data and queries you plan for each shard to handle, how much heap you give

Re: group.limit>1 and sorting is not working as expected

2018-09-11 Thread Shawn Heisey
On 9/11/2018 12:00 PM, Venkateswarlu Bommineni wrote: What i am expecting is (it might be silly) if i put group.main=true and sort by price then the results are: { "priceValueGLP_usd_double":32015.0, "sapRank_int":446}, { "priceValueGLP_usd_double":32015.

Re: Solr RSIZE memory overusage

2018-09-11 Thread Shawn Heisey
On 9/11/2018 12:14 PM, Boris Pasko wrote: Run top, press shift-M to sort by memory usage, then grab a atop: http://oi68.tinypic.com/10pokkk.jpg top: http://oi63.tinypic.com/msbpfp.jpg Looking at the second one: The SHR value is showing 90GB. Your Java process is in actuality only using in th

Solr RSIZE memory overusage

2018-09-11 Thread Boris Pasko
Hi. We're running Solr 6.6.1 (SolrCloud, 3 clusters). Recently I noticed it became significantly slower to respond and did some basic checks on servers. There is little IO, a bit of CPU usage (110% user, 3090% idle), but one thing is very strange - the resident memory usage of the Solr. Despite th

Re: Solr RSIZE memory overusage

2018-09-11 Thread Boris Pasko
>Run top, press shift-M to sort by memory usage, then grab a atop: http://oi68.tinypic.com/10pokkk.jpg top: http://oi63.tinypic.com/msbpfp.jpg – The information contained in this message and any attachments may be confidential and/or restricted and protected f

Re: Solr RSIZE memory overusage

2018-09-11 Thread Boris Pasko
On Tue, 2018-09-11 at 10:26 -0700, Erick Erickson wrote: > The memory usage is probably MMapDirectory, see: > http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.ht > ml, > that's not a problem I'd guess. We're using NRTCachingDirectoryFactory > The open file handles... and ones bei

Re: group.limit>1 and sorting is not working as expected

2018-09-11 Thread Venkateswarlu Bommineni
Erik and Shawn, Sorry for the confusion. Yes Solr is sorting inside the grouped results.but not on all the results. Example : we have got 6 records by solr response. if i do *sort=price desc , it is sorting indise each group.* *But the requirement is to sort on all the results.* *Current result

Re: Update partial document

2018-09-11 Thread Shawn Heisey
On 9/11/2018 10:23 AM, Vincenzo D'Amore wrote: I suppose to be able to remove attr_1 and add attr_3 with one atomic update. Like this: curl -X POST -H 'Content-Type: application/json' ' http://localhost:8983/solr/gettingstarted/update?versions=true&commit=true' --data-binary ' [ {

Re: Solr RSIZE memory overusage

2018-09-11 Thread Shawn Heisey
On 9/11/2018 11:07 AM, Boris Pasko wrote: Hi. We're running Solr 6.6.1 (SolrCloud, 3 nodes). Recently I noticed it became significantly slower to respond and did some basic checks on servers. There is little IO, a bit of CPU usage (110% user, 3090% idle), but one thing is very strange - the resid

Re: group.limit>1 and sorting is not working as expected

2018-09-11 Thread Erick Erickson
How this all works will be much clearer if you don't use "group.main=true" But you still haven't _shown_ us what you _expect_. In the second query, Solr is doing exactly what you're telling it to. Return groups of up to three docs lowest-priced docs in each group and ordering the groups by the lo

Re: Solr RSIZE memory overusage

2018-09-11 Thread Erick Erickson
The memory usage is probably MMapDirectory, see: http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html, that's not a problem I'd guess. The open file handles... and ones being open many times. 1> how many _total_ segment files do you have in all the replicas in the JVM? A single

Re: group.limit>1 and sorting is not working as expected

2018-09-11 Thread Shawn Heisey
On 9/11/2018 10:14 AM, Venkateswarlu Bommineni wrote: Please find the resonse and query when grouping and sorting by rank : I see no evidence of grouping happening in either of those responses.  They look like standard responses do when grouping is not enabled. Here's an example of a grouped

Re: Solr RSIZE memory overusage

2018-09-11 Thread Boris Pasko
> Same picture on Solr 6.6.2, tested on various Oracle JVMs ranging > from 1.8.0_171 to 1.8.0_171 From 1.8.0_171 to 1.8.0_181 – The information contained in this message and any attachments may be confidential and/or restricted and protected from disclosure.

Re: Error while creating a new solr core

2018-09-11 Thread Christopher Schultz
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 Shalvak, On 9/11/18 01:51, Shalvak Mittal (UST, ) wrote: > I have recently installed solr 7.2.1 in my ubuntu 16.04 system. > While creating a new core, the solr logging shows an error saying > > > " Caused by: org.apache.solr.common.SolrException:

Solr RSIZE memory overusage

2018-09-11 Thread Boris Pasko
Hi. We're running Solr 6.6.1 (SolrCloud, 3 nodes). Recently I noticed it became significantly slower to respond and did some basic checks on servers. There is little IO, a bit of CPU usage (110% user, 3090% idle), but one thing is very strange - the resident memory usage of the Solr. Despite the -

Re: local "q.op=AND" ignored for edismax query

2018-09-11 Thread Shawn Heisey
On 9/10/2018 5:45 PM, dshih wrote: Based on what you said, is my query supposed to work as is if I set luceneMatchVersion=7.1.0? It does not appear to. It does look like a luceneMatchVersion check was added to the change in SOLR-11501, so I would expect that to work.  Setting luceneMatchVersi

Re: Error while creating a new solr core

2018-09-11 Thread Shawn Heisey
On 9/10/2018 11:51 PM, Shalvak Mittal (UST, ) wrote: I have recently installed solr 7.2.1 in my ubuntu 16.04 system. While creating a new core, the solr logging shows an error saying " Caused by: org.apache.solr.common.SolrException: fips module was not loaded." I have never heard of a module

Re: Nutch 1.15 Indexing

2018-09-11 Thread Shawn Heisey
On 9/11/2018 12:22 AM, Bineesh wrote: Need help on Nutch 1.15 indexing issues. We are using Nutch 1.15 and Solr 7.3.1 in our setup 1 : Is there a way i can mention multiple collections in the Nutch 1.15 indexwriters.xml file for the same ? I see collection works fine if i hardcoded the collec

Re: Update partial document

2018-09-11 Thread Mikhail Khludnev
Hello, Vincenzo. What about adding 1 into "attr_" : [ "set" : null ], ? On Tue, Sep 11, 2018 at 7:23 PM Vincenzo D'Amore wrote: > Hi Solr gurus :) > > I have a delicious question (that I'm struggling with), really hope that > someone can help me. > > There is a document with many fields bu

Update partial document

2018-09-11 Thread Vincenzo D'Amore
Hi Solr gurus :) I have a delicious question (that I'm struggling with), really hope that someone can help me. There is a document with many fields but I have to modify only few of them. I thought to use atomic update but it seems that I cannot replace an entire list of dynamic fields. Here I t

Re: group.limit>1 and sorting is not working as expected

2018-09-11 Thread Venkateswarlu Bommineni
Please find the resonse and query when grouping and sorting by rank : http://localhost:8983/solr/master_shaneco_Product_flip/select?indent=on&q=rank:[1%20TO%20*]&wt=json&group=true&group.main=true&group.field=rank&rows=50&fl=code_String,price,rank&sort=rank+asc&group.limit=3 { "responseHeader":{

Re: Docker and Solr Indexing

2018-09-11 Thread Walter Underwood
4 Gb is very small for Solr. Solr is not designed for Dockerized, fail-often use. We use a LOT of Docker ECS, but all of our Solr servers are on EC2 instances. That’s about sixty instances in several clusters. We run an 8 Gb heap for all our Solr instances. Instances in our biggest cluster (in t

Re: group.limit>1 and sorting is not working as expected

2018-09-11 Thread Erick Erickson
This is still confusing: bq. But the requirement is to sort on all the results we show to the customer. What does grouping have to do with that statement? Would it be served by just _not_ grouping at all? If not, why not? Please provide a small set of example documents and what you want to show

Re: 504 timeout

2018-09-11 Thread John Blythe
ah, great thought. didn't even think of that. we already have a couple ngram-based fields. will send over to the stakeholder who was attempting this. thanks! -- John Blythe On Sun, Sep 9, 2018 at 11:31 PM Erick Erickson wrote: > First of all, wildcards are evil. Be sure that the reason people

Speakers needed for Apache DC Roadshow

2018-09-11 Thread Rich Bowen
We need your help to make the Apache Washington DC Roadshow on Dec 4th a success. What do we need most? Speakers! We're bringing a unique DC flavor to this event by mixing Open Source Software with talks about Apache projects as well as OSS CyberSecurity, OSS in Government and and OSS Career

Re: Error casting to PointField

2018-09-11 Thread Erick Erickson
point-based fields cannot be used for , see: https://issues.apache.org/jira/browse/SOLR-10829 This should be documented better in the ref guide On Tue, Sep 11, 2018 at 5:53 AM Zahra Aminolroaya wrote: > > We read that in Solr 7, Trie* fields are deprecated, so we decided to change > all of ou

Re: any way to post json document to a MoreLikeThisHandler?

2018-09-11 Thread Alexandre Rafalovitch
There are three ways to trigger MLT: https://lucene.apache.org/solr/guide/7_4/morelikethis.html MoreLikeThisHandler allows to supply text externally. Unfortunately, I can't find the specific example demonstrating it, so not sure if it just a blob of text or a document. Regards, Alex. On 11 Se

any way to post json document to a MoreLikeThisHandler?

2018-09-11 Thread Matt Work Coarr
Hello, Using a MoreLikeThisHandler, I was hoping to be able to pass in in the post body a json document (the same format as a document indexed in my core, but the document in the request is not and should not be added to the core). I'm thinking it would handle an incoming document similar to how

Error casting to PointField

2018-09-11 Thread Zahra Aminolroaya
We read that in Solr 7, Trie* fields are deprecated, so we decided to change all of our Trie* fields to *pointtype Fields. Our unique key field type is long, and we changed our long field type something like below; We get the error uniqueKey field can not be configured to use a Points based Fi

Error while creating a new solr core

2018-09-11 Thread Shalvak Mittal (UST, )
Hi, I have recently installed solr 7.2.1 in my ubuntu 16.04 system. While creating a new core, the solr logging shows an error saying " Caused by: org.apache.solr.common.SolrException: fips module was not loaded." I have downloaded the necessary jar files like cryptoj.jar and copied them in

Nutch 1.15 Indexing

2018-09-11 Thread Bineesh
Hi Team, Need help on Nutch 1.15 indexing issues. We are using Nutch 1.15 and Solr 7.3.1 in our setup 1 : Is there a way i can mention multiple collections in the Nutch 1.15 indexwriters.xml file for the same ? I see collection works fine if i hardcoded the collection name in indexer_solr_1 2

Re: Potential bug? maxConnectionsPerHost on requestHandler configuration

2018-09-11 Thread Greg Roodt
This is expected behaviour. The shardHandlerFactory element is configured in solr.xml, not solrconfig.xml See: https://lucene.apache.org/solr/guide/7_4/format-of-solr-xml.html On Tue, 11 Sep 2018 at 11:55, Ash Ramesh wrote: > Hi, > > I tried setting up a bespoke ShardHandlerFactory configura