Re: adding documents to a secured solr server.

2017-11-01 Thread Shawn Heisey
On 11/1/2017 10:04 PM, Phil Scadden wrote: For testing, I changed to HttpSolrClient and specifying the core on process and commit instead of opening it as server/core. This time worked... sort of. Despite deleting the entire index with deletebyquery and seeing that it was empty in the coreAdmi

RE: adding documents to a secured solr server.

2017-11-01 Thread Phil Scadden
Requested reload and now it indexes with secure server using HttpSolrClietn. Phew. I now look to see if I can optimize and get concurrentupdate server to work. At least I can get the index back now. -Original Message- From: Phil Scadden [mailto:p.scad...@gns.cri.nz] Sent: Thursday, 2 Nov

Re: Sum area polygon solr

2017-11-01 Thread David Smiley
Hi, Ah, no -- sorry. If you want to roll up your sleeves and write a Solr plugin (a ValueSource in this case, perhaps) then you could lookup the index polygon and then call out to JTS to compute the intersection and then ask it for the area. But that's going to be a very heavyweight computation

RE: adding documents to a secured solr server.

2017-11-01 Thread Phil Scadden
For testing, I changed to HttpSolrClient and specifying the core on process and commit instead of opening it as server/core. This time worked... sort of. Despite deleting the entire index with deletebyquery and seeing that it was empty in the coreAdmin, I get : possible analysis error: cannot c

Re: max docs, deleted docs optimization

2017-11-01 Thread kshitij tyagi
Thanks eric for your promp response, it was really helpful. On Tue, Oct 31, 2017 at 8:30 PM, Erick Erickson wrote: > 1> 2 lakh at most. If the standard background merging is going on it > may be less than that. > > 2> Some, but whether you notice or not is an open question. In an > index with on

RE: adding documents to a secured solr server.

2017-11-01 Thread Phil Scadden
So the real error is authentication, (the version is spurious) but why that when authentication is being set on the updateRequest? -Original Message- From: Shawn Heisey [mailto:elyog...@elyograg.org] Sent: Thursday, 2 November 2017 3:55 p.m. To: solr-user@lucene.apache.org Subject: Re: ad

Re: adding documents to a secured solr server.

2017-11-01 Thread Shawn Heisey
On 11/1/2017 8:13 PM, Phil Scadden wrote: 14:52:45,962 DEBUG ConcurrentUpdateSolrClient:177 - starting runner: org.apache.solr.client.solrj.impl.ConcurrentUpdateSolrClient$Runner@6eeba4a 14:52:46,224 WARN ConcurrentUpdateSolrClient:343 - Failed to parse error response from http://online-dev.gn

Re: adding documents to a secured solr server.

2017-11-01 Thread Shawn Heisey
On 11/1/2017 7:59 PM, Phil Scadden wrote: After some digging, I tried this approach... solr = new ConcurrentUpdateSolrClient.Builder(solrUrl) .withQueueSize(20) .build(); SolrInputDocument up = new SolrInputDocument(); up.addFiel

Re: Advice on Stemming in Solr

2017-11-01 Thread Zheng Lin Edwin Yeo
Hi Emir, We do have quite alot of words that should not be stemmed. Currently, the KStemFilterFactory are stemming all the non-English words that end with "ing" as well. There are quite alot of places and names which ends in "ing", and all these are being stemmed as well, which leads to an inaccur

RE: Stateless queries to secured SOLR server.

2017-11-01 Thread Phil Scadden
Thanks for that Shawn. What I am doing is working fine now. I need the middle proxy to audit and modify what client sends to solr (based on user rights) not to mention keeping solr from direct exposure to internet. -Original Message- From: Shawn Heisey [mailto:elyog...@elyograg.org] Sent

Re: Stateless queries to secured SOLR server.

2017-11-01 Thread Shawn Heisey
On 11/1/2017 4:22 PM, Phil Scadden wrote: Except that I am using solrj in an intermediary proxy and passing the response directly to a javascript client. It is expect json or csv depending on what it passes in wt= That's a different use case than I had imagined. Thanks for the detail. My st

RE: adding documents to a secured solr server.

2017-11-01 Thread Phil Scadden
And my security.json looks like: { "authentication":{ "class":"solr.BasicAuthPlugin", "blockUnknown":true, "credentials":{ "solrAdmin":" a hash ", "solrGuest":"another hash"}, "":{"v":0}}, "authorization":{ "class":"solr.RuleBasedAuthorizationPlugin", "permis

Re: Making a String field case-insensitive

2017-11-01 Thread Zheng Lin Edwin Yeo
Hi Emir, Thanks for your advice. This works. Regards, Edwin On 1 November 2017 at 18:08, Emir Arnautović wrote: > Hi, > You can use KeywordTokenizer and LowerCaseTokenFilterFactory. > > HTH, > Emir > -- > Monitoring - Log Management - Alerting - Anomaly Detection > Solr & Elasticsearch Consul

RE: adding documents to a secured solr server.

2017-11-01 Thread Phil Scadden
After some digging, I tried this approach... solr = new ConcurrentUpdateSolrClient.Builder(solrUrl) .withQueueSize(20) .build(); SolrInputDocument up = new SolrInputDocument(); up.addField("id",f.getCanonicalPath()); up.add

Re: Upgrade path from 5.4.1

2017-11-01 Thread Yonik Seeley
On Wed, Nov 1, 2017 at 2:36 PM, Erick Erickson wrote: > I _always_ prefer to reindex if possible. Additionally, as of Solr 7 > all the numeric types are deprecated in favor of points-based types > which are faster on all fronts and use less memory. They are a good step forward in genera, and fast

Error executing a SQL query sorting on both a field and an expression

2017-11-01 Thread Fabio Corneti
Hello, I'm doing some tests against the SQL handler on a single node SolrCloud 7.1.0 installation. When running this query: SELECT Org_Type, COUNT(*) FROM obesity GROUP BY Org_Type ORDER BY Org_Type DESC, COUNT(*) ASC I get the following error: Failed to execute sqlQuery : java.io.IOExcepti

Re: App Studio

2017-11-01 Thread Vincenzo D'Amore
Hi, I'm interested too. On Thu, Nov 2, 2017 at 12:46 AM, Kojo wrote: > I would like to try that! > > > Em 1 de nov de 2017 18:04, "Will Hayes" escreveu: > > There is a community edition of App Studio for Solr and Elasticsearch being > released by Lucidworks in November. Drop me a line if you wo

Re: App Studio

2017-11-01 Thread Kojo
I would like to try that! Em 1 de nov de 2017 18:04, "Will Hayes" escreveu: There is a community edition of App Studio for Solr and Elasticsearch being released by Lucidworks in November. Drop me a line if you would like to get a preview release. -wh -- Will Hayes | CEO | Lucidworks direct. +1

Re: Streaming Expression - cartesianProduct

2017-11-01 Thread Kojo
Pratik's information answered the question. Thanks! Em 1 de nov de 2017 19:45, "Amrit Sarkar" escreveu: Following Pratik's spot-on comment and not really related to your question, Even the "partitionKeys" parameter needs to be specified the "over" field while using "parallel" streaming. Am

adding documents to a secured solr server.

2017-11-01 Thread Phil Scadden
Solrj QueryRequest object has a method to set basic authorization username/password but what is the equivalent way to pass authorization when you are adding new documents to an index? ConcurrentUpdateSolrClient solr = new ConcurrentUpdateSolrClient(solrProperties.getServer(),10,2); ...

RE: Stateless queries to secured SOLR server.

2017-11-01 Thread Phil Scadden
Except that I am using solrj in an intermediary proxy and passing the response directly to a javascript client. It is expect json or csv depending on what it passes in wt= -Original Message- From: Shawn Heisey [mailto:apa...@elyograg.org] Sent: Thursday, 2 November 2017 2:48 a.m. To: sol

RE: App Studio

2017-11-01 Thread Kris Musshorn
Yes pleaSE -Original Message- From: Will Hayes [mailto:w...@lucidworks.com] Sent: Wednesday, November 1, 2017 4:04 PM To: solr-user@lucene.apache.org Subject: Re: App Studio There is a community edition of App Studio for Solr and Elasticsearch being released by Lucidworks in November. D

Re: Streaming Expression - cartesianProduct

2017-11-01 Thread Amrit Sarkar
Following Pratik's spot-on comment and not really related to your question, Even the "partitionKeys" parameter needs to be specified the "over" field while using "parallel" streaming. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidwor

Re: Streaming Expression - cartesianProduct

2017-11-01 Thread Pratik Patel
Roll up needs documents to be sorted by the "over" field. Check this for more details http://lucene.472066.n3.nabble.com/Streaming-Expressions-rollup-function-returning-results-with-duplicate-tuples-td4342398.html On Wed, Nov 1, 2017 at 3:41 PM, Kojo wrote: > Wrap cartesianProduct function with

Re: App Studio

2017-11-01 Thread Will Hayes
There is a community edition of App Studio for Solr and Elasticsearch being released by Lucidworks in November. Drop me a line if you would like to get a preview release. -wh -- Will Hayes | CEO | Lucidworks direct. +1.415.997.9455 | email. w...@lucidworks.com On Wed, Nov 1, 2017 at 12:54 PM, Dav

App Studio

2017-11-01 Thread David Hastings
Hey all, at the conference it was mentioned that lucidworks would release app studio as its own and free project. is that still the case?

Streaming Expression - cartesianProduct

2017-11-01 Thread Kojo
Wrap cartesianProduct function with fetch function works as expected. But rollup function over cartesianProduct doesn´t aggregate on a returned field of the cartesianProduct. The field "id_researcher" bellow is a Multivalued field: This one works: fetch(reasercher, cartesianProduct(

Re: Upgrade path from 5.4.1

2017-11-01 Thread Erick Erickson
I _always_ prefer to reindex if possible. Additionally, as of Solr 7 all the numeric types are deprecated in favor of points-based types which are faster on all fronts and use less memory. However, to use this functionality you'll need to re-index anyway. Solr 7 will still support Trie types, but

Re: SOLR-11504: Provide a config to restrict number of indexing threads

2017-11-01 Thread Nawab Zada Asad Iqbal
Well, the reason i want to control number of indexing threads is to restrict number of "segments" being created at one time in the RAM. One indexing thread in lucene corresponds to one segment being written. I need a fine control on the number of segments. Less than that, and I will not be fully u

Upgrade path from 5.4.1

2017-11-01 Thread Petersen, Robert (Contr)
Hi Guys, I just took over the care and feeding of three poor neglected solr 5.4.1 cloud clusters at my new position. While spinning up new collections and supporting other business initiatives I am pushing management to give me the green light on migrating to a newer version of solr. The last

Re: Solr streaming questions

2017-11-01 Thread Erick Erickson
Perhaps if you bothered to explain your use-case we could suggest alternatives. Streaming is built to handle very large result sets in a divide-and-conquer manner, thus the ability to specify worker nodes each of which handles a sub-set of the results. Partitioning the output streams requires a w

Re: Automatic creation of indexes

2017-11-01 Thread Emir Arnautović
>Emir, your message did not actually include anything related to the >presentation you mentioned. Ups - seems I forgot to paste: https://www.youtube.com/watch?v=1gzwAgrk47c Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & El

Re: Solr streaming questions

2017-11-01 Thread Webster Homer
I know that /select supports score. However, I don't want to have to page the results, I want to use stream to stream the results of a search, but I cannot sort by the relevancy of the result. This seems like a MAJOR deficit for the streaming API /select wants to do paging which in my case I don't

Using bits in multi tenant document routing index in single shard

2017-11-01 Thread Ketan Thanki
Hi, I have 4 shard and 4 replica and I do Composite document routing for my unique field 'Id' as mentions below. e.g : projectId:158380 modelId:3606 where tenants bits use as projectId/Numbits!modelId/Numbits! prefix with Id NumBits distributed as mention below 3 bits would spread the tenant o

Re: SOLR-11504: Provide a config to restrict number of indexing threads

2017-11-01 Thread Shawn Heisey
On 10/31/2017 4:57 PM, Nawab Zada Asad Iqbal wrote: I hit this issue https://issues.apache.org/jira/browse/SOLR-11504 while migrating to solr6 and locally working around it in Lucene code. I am thinking to fix it properly and hopefully patch back to Solr. Since, Lucene code does not want to keep

Re: Automatic creation of indexes

2017-11-01 Thread Shawn Heisey
On 10/31/2017 5:32 AM, Jokin Cuadrado wrote: Hi, I'm using solr to store time series data, log events etc. Right now I use a solr cloud collection and cleaning it deleting documents via queries, but I would like to know what approaches are other people using. Is there a way to create a collectio

Re: Stateless queries to secured SOLR server.

2017-11-01 Thread Shawn Heisey
On 10/31/2017 2:08 PM, Phil Scadden wrote: Thanks Shawn. I have done it with SolrJ. Apart from needing the NoopResponseParser to handle the wt=, it was pretty painless. This is confusing to me, because with SolrJ, you do not need to be concerned with the response format *AT ALL*. You don't n

Re: Solr response with original value

2017-11-01 Thread Shawn Heisey
On 10/31/2017 1:38 PM, Venkateswarlu Bommineni wrote: Thanks for the reply Shawn. But I am little confused on faceting on one field and return the result of another field. could you please give sample query. Thanks a lot in advance!!! I really don't know what you mean. Facets do not have res

Re: Automatic creation of indexes

2017-11-01 Thread Emir Arnautović
Hi Jokin, Here is presentation of my colleagues talking about using Solr for logs. Regards, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > On 31 Oct 2017, at 12:32, Jokin Cuadrado wrote: > > Hi, I'm

Re: Query regarding to multi tenant composite ID document routing

2017-11-01 Thread Emir Arnautović
Hi Katan, Here is blog post explaining how to use routing: https://sematext.com/blog/solrcloud-large-tenants-and-routing/ HTH, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Suppo

Re: Advice on Stemming in Solr

2017-11-01 Thread Emir Arnautović
Hi Edwin, If the number of words that should not be stemmed is not high you could use KeywordMarkerFilterFactory to flag those words as keywords and it should prevent stemmer from changing them. Depending on what you want to achieve, you might not be able to avoid using stemmer at indexing time.

Re: Making a String field case-insensitive

2017-11-01 Thread Emir Arnautović
Hi, You can use KeywordTokenizer and LowerCaseTokenFilterFactory. HTH, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > On 1 Nov 2017, at 09:50, Zheng Lin Edwin Yeo wrote: > > Hi, > > Would like to fi

LatLonPointSpatialField, sorting : sort param could not be parsed as a query, and is not a field that exists in the index

2017-11-01 Thread Clemens Wyss DEV
Context: solr 6.6.0 Im switching my schemas from derprecated solr.LatLonType to solr.LatLonPointSpatialField. Now my sortquery (which used to work with solr.LatLonType): sort=geodist(b4_location__geo_si,47.36667,8.55) asc raises the error "sort param could not be parsed as a query, and is not

Advice on Stemming in Solr

2017-11-01 Thread Zheng Lin Edwin Yeo
Hi, We are currently using KStemFilterFactory in Solr, but we found that it is actually doing stemming on non-English words like "ximenting", which it stem to "ximent". This is not what we wanted. Another option is to use the HunspellStemFilterFactory, but there are some English words like "runni

Making a String field case-insensitive

2017-11-01 Thread Zheng Lin Edwin Yeo
Hi, Would like to find out, what is the best way to lower-case a String index in Solr, to make it case insensitive, while preserving the structure of the string (ie It should not break into different tokens at space, and should not remove any characters or symbols) I found that solr.StrField does