A different result with filters

2018-10-25 Thread Владислав Властовский
Hi, I use 7.5.0 Solr Why do I get two different results for similar requests? First req/res: { "query": "*:*", "limit": 0, "filter": [ "{!parent which=kind_s:edition}condition_s:0", "{!parent which=kind_s:edition}price_i:[* TO 75]" ] } { "response": { "numFound": 453,

Re: Edismax query returning the same number of results using AND as it does with OR

2018-10-25 Thread Zheng Lin Edwin Yeo
Hi, What is your full query path or URL that you pass for the query? And how is your setting like for the edismax in your solrconfig.xml? Regards, Edwin On Fri, 26 Oct 2018 at 06:24, Nicky Mastin wrote: > > Oddity with edismax and queries involving boolean operators. Here's the > "parsedquery

Re: Solr Cell Input Parameter tika.config

2018-10-25 Thread Yasufumi Mizoguchi
Hello, I could not find the process that parse tika.config parameter from solr request. Maybe, tika.config parameter can only be defined in solrconfig.xml as following. tika-config.xml true ignored_ true links ignored_ Thanks, Yasufumi 2018年10月26日(金) 7:07 Robertson

Re: Internal Solr communication question

2018-10-25 Thread Erick Erickson
preferLocalShards is a bit of a misnomer. I usually think of it as "don't go to another Solr node if possible". On Thu, Oct 25, 2018 at 10:46 AM Fernando Otero wrote: > > Thanks Emir! > I was already looking at preferLocalShards but I wasn't sure it'll help > with only 1 shard, I'll give it a

Edismax query returning the same number of results using AND as it does with OR

2018-10-25 Thread Nicky Mastin
Oddity with edismax and queries involving boolean operators. Here's the "parsedquery_toString" from two different queries: input: "dog AND kiwi": https://apaste.info/gaQl input: "dog OR kiwi": https://apaste.info/sBwa Both queries return the same number of results (389). The query with OR was

Solr Cell Input Parameter tika.config

2018-10-25 Thread Robertson, Eric J
Hello all, Currently trying to define a tika config to use when posting a pdf to Solr Cell as we may want to override the default tika configuration depending on type of document being ingested. In the docs it lists tika.config as an input parameter to the Solr Cell endpoint. Though in my test

Fuzzy search expansion problem on 6.6.3

2018-10-25 Thread Ryan Wilson
Hello all, I am running a solr 6.6.3 3-shard cloud with one main collection that contains 587,371,821 rows of data. One of the fields in this collection is names. We are currently running into an issue with fuzzy searches on name where it seems unable to get all possible values for a number of dif

Re: Reading data using Tika to Solr

2018-10-25 Thread Tim Allison
If you’re processing actual msg (not eml), you’ll also need poi and poi-scratchpad and their dependencies, but then those msgs could have attachments, at which point, you may as just add tika-app. :D On Thu, Oct 25, 2018 at 2:46 PM Martin Frank Hansen (MHQ) wrote: > Hi Erick and Tim, > > Thanks

RE: Reading data using Tika to Solr

2018-10-25 Thread Martin Frank Hansen (MHQ)
Hi Erick and Tim, Thanks for your answers, I can see that my mail got messed up on the way through the server. It looked much more readable at my end 😉 The attachment simply included my build-path. @Erick I am compiling the program using Netbeans at the moment. I updated to tika-1.7 but that d

Re: Does ConcurrentUpdateSolrClient apply for SolrCloud ?

2018-10-25 Thread Jason Gerlowski
One comment to complicate Erick's already-good advice. > If a doc that needs to go to shard2 is received by a replica on shard1, it > must be forwarded to the leader of shard1, introducing an extra hop. Definitely true, but I don't think that's the only factor in the relative performance of CUSC

Re: Score relevancy

2018-10-25 Thread Walter Underwood
Use a bf of 10 * RANK. That will give the same ordering as dividing the score by 10 and adding RANK. There are problems with additive boosts, so I strongly recommend looking at the “boost” parameter, which is a multiplicative boost. That is more stable over a wide range of score values. wunder

Re: Reading data using Tika to Solr

2018-10-25 Thread Tim Allison
To follow up w Erick’s point, there are a bunch of transitive dependencies from tika-parsers. If you aren’t using maven or similar build system to grab the dependencies, it can be tricky to get it right. If you aren’t using maven, and you can afford the risks of jar hell, consider using tika-app or

Re: Score relevancy

2018-10-25 Thread Amjad Khan
We use ranking below 100 and yes it is float. > On Oct 25, 2018, at 1:08 PM, David Hastings > wrote: > > is this RANK value stored as a float/integer? and whats the range? one > idea is you could use edismax and have a really possible long boost query: > RANK:[1 TO 2]^10 OR RANK:[3 TO 4]^9 >

Re: Internal Solr communication question

2018-10-25 Thread Fernando Otero
Thanks Emir! I was already looking at preferLocalShards but I wasn't sure it'll help with only 1 shard, I'll give it a try On Thu, Oct 25, 2018 at 11:26 AM Emir Arnautović < emir.arnauto...@sematext.com> wrote: > Hi Fernando, > I did not look at code and not sure if there is special handling

Re: Score relevancy

2018-10-25 Thread David Hastings
is this RANK value stored as a float/integer? and whats the range? one idea is you could use edismax and have a really possible long boost query: RANK:[1 TO 2]^10 OR RANK:[3 TO 4]^9 but this isnt actually a great idea and gets sloppy fast. you could apply boost at index time, or a function query

Re: Reading data using Tika to Solr

2018-10-25 Thread Erick Erickson
Martin: The mail server is pretty aggressive about stripping attachments, your png didn't come though. You might also get a more informed answer on the Tika mailing list. That said (and remember I can't see your png so this may be a silly question), how are you executing the program .vs. compilin

Score relevancy

2018-10-25 Thread Amjad Khan
Hi Is there a way to achieve the following - We have a RANK field in each document, and essentially, I would like my score to be influenced by this RANK as follows - score = score*0.1 + RANK How can I achieve this with function queries? Thanks!

Re: Solr 7.5/skg

2018-10-25 Thread David Hastings
Yup, thats the one. Thanks. On Thu, Oct 25, 2018 at 11:54 AM Alexandre Rafalovitch wrote: > Probably this one: https://issues.apache.org/jira/browse/SOLR-9418 > > I am not sure if that's documented yet. > > Regards, >Alex. > On Thu, 25 Oct 2018 at 11:08, David Hastings > wrote: > > > > Alt

Re: Solr 7.5/skg

2018-10-25 Thread Alexandre Rafalovitch
Probably this one: https://issues.apache.org/jira/browse/SOLR-9418 I am not sure if that's documented yet. Regards, Alex. On Thu, 25 Oct 2018 at 11:08, David Hastings wrote: > > Although another of Treys examples, the semantic query parser, Doesn't seem > to have documentation unless im missi

RE: TLOG replica stucks

2018-10-25 Thread Vadim Ivanov
Thanks Erick for you attention! My comments below, but supposing that the problem resides in zookeeper I'll collect more information from zk logs and solr logs and be back soon. > bq. I've noticed that some replicas stop receiving updates from the > leader without any visible signs from the clus

Re: Solr 7.5/skg

2018-10-25 Thread David Hastings
Although another of Treys examples, the semantic query parser, Doesn't seem to have documentation unless im missing something? On Thu, Oct 25, 2018 at 10:41 AM David Hastings < hastings.recurs...@gmail.com> wrote: > Wow, thanks for that. Will do some research and come back with the > inevitable

Re: Index fetch failed. Exception: Server refused connection

2018-10-25 Thread Walter Underwood
A 1 Gb heap is probably too small on the master. Run with 8 Gb like the slaves. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Oct 24, 2018, at 10:20 PM, Bharat Yadav wrote: > > Hello Team, > > We are now a days frequently facing below issue on o

Re: Query to multiple collections

2018-10-25 Thread Atita Arora
Hi, This kind of was one of the problems I was facing recently. While in my use case I am supposed to be showing spellcheck suggestions (collated) from two different collections. To also mention both these collections are using the same schema while they need to be segregated as for the business n

Re: Solr 7.5/skg

2018-10-25 Thread David Hastings
Wow, thanks for that. Will do some research and come back with the inevitable questions I will have. On Thu, Oct 25, 2018 at 10:37 AM Alexandre Rafalovitch wrote: > See > https://www.slideshare.net/arafalov/searching-for-ai-leveraging-solr-for-classic-artificial-intelligence-tasks > , slides 19

Re: Solr 7.5/skg

2018-10-25 Thread Alexandre Rafalovitch
See https://www.slideshare.net/arafalov/searching-for-ai-leveraging-solr-for-classic-artificial-intelligence-tasks , slides 19+ But it is not a fully-supported usage, due to https://issues.apache.org/jira/browse/SOLR-12569 . So, at your own risk. Regards, Alex. On Thu, 25 Oct 2018 at 10:32,

Re: Solr 7.5/skg

2018-10-25 Thread David Hastings
Another skg question. the significantTerms says it queries a solrcloud collection, only, is there any way to have it work on standalone solr/cores as well? the MLT function works fine on standalone, was really hoping this would as well. On Thu, Oct 25, 2018 at 10:25 AM Alexandre Rafalovitch wr

Re: Internal Solr communication question

2018-10-25 Thread Emir Arnautović
Hi Fernando, I did not look at code and not sure if there is special handling in case of a single shard collection, but Solr does not have to choose local shard to query. It assumes that one node will receive all requests and that it needs to balance. What you can do is add preferLocalShards=tru

Re: Solr 7.5/skg

2018-10-25 Thread Alexandre Rafalovitch
That's being worked on as well. We've migrated the documentation from Confluence to standalone setup, so not all the pieces are in place yet. Regards, Alex. On Thu, 25 Oct 2018 at 10:12, David Hastings wrote: > > Thanks very much! for being a search product, the documentation isn't very > sea

Re: Internal Solr communication question

2018-10-25 Thread Fernando Otero
Hey Shawn Thanks for your answer!. I changed the config to 1 shard with 7 replicas but I still see communication between nodes, is that expected? Each node has 1 shard so it should have all the data needed to compute, I don't get why I'm seeing communication between them. Thanks On Tue, Oct 2

Re: Solr 7.5/skg

2018-10-25 Thread David Hastings
Thanks very much! for being a search product, the documentation isn't very search friendly :) On Thu, Oct 25, 2018 at 9:29 AM Alexandre Rafalovitch wrote: > I think you are looking for: > > http://lucene.apache.org/solr/guide/7_5/json-facet-api.html#semantic-knowledge-graphs > > Or, as a second

Re: Solr Stream vs Export Request Handlers

2018-10-25 Thread Joel Bernstein
I'm not sure why /stream is exporting faster then /export. It may be that the different approaches in the client are the reason for the difference. But the /export handler would be used in both scenarios if you specify qt=/export in the search() Streaming Expression. Joel Bernstein http://joelsol

Re: Streaming rollUp vs Streaming facet

2018-10-25 Thread Joel Bernstein
Your use case is somewhat special in that it involves 10 fields. With that many nested facets the JSON facet API may or may not outperform streaming rollups. For most other cases JSON facet API will outperform rollups. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Oct 17, 2018 at 11:21

Re: Solr 7.5/skg

2018-10-25 Thread Alexandre Rafalovitch
I think you are looking for: http://lucene.apache.org/solr/guide/7_5/json-facet-api.html#semantic-knowledge-graphs Or, as a second option, http://lucene.apache.org/solr/guide/7_5/stream-source-reference.html#significantterms Regards, Alex. On Thu, 25 Oct 2018 at 08:47, David Hastings wrote: >

Question about SynonymGraphFilter

2018-10-25 Thread Gianpiero Sportelli
Hi, I have a question about SynonymGraphFilter. During the query parsing I expected a query phrase for multi word synonyms but the query produced is an or of all the tokens that compose the multi word. is the correct behavior? I attach a test for this question. Examples: query: "text analysis

Solr 7.5/skg

2018-10-25 Thread David Hastings
Hey all, I was going throught the Solr 7.5 documentation: http://lucene.apache.org/solr/guide/7_5/index.html and it appears to be incomplete. last week Trey Grainger gave a presentation about the skg plugin, and said it was now included in the 7.5 distribution. There are no references to using i

Re: Securying ONLY the web interface console

2018-10-25 Thread Amanda Shuman
Thanks - but I think I'm past those steps now. I set up an nginx reverse proxy through the plesk panel initially, so that is fine. Binding it to port 8983 seems to be the issue. Anyways, I think I'll try out the instructions listed here and cross my fingers..: https://talk.plesk.com/threads/unable

different query for different dictionaries

2018-10-25 Thread Dan Rosher
Hi, If I have 2 fields e.g. location and products then I might have 2 dictionaries spell_location spell_products I cannot do for example spellcheck..q e.g. spellcheck.spell_location.q=... only spellcheck.q=... Does anyone have a workaround for this limitation? Cheers Dan

Re: How to add two fields (mobile,year) in SolrCloud Collection

2018-10-25 Thread Yogendra Kumar Soni
You need to change local copy of schema.xml and upload that schema.xml to zookeeper using upconfig command. you need to specify -n ( should be same as collection name ). https://lucene.apache.org/solr/guide/6_6/solr-control-script-reference.html#SolrControlScriptReference-UploadaConfigurationSet

Reading data using Tika to Solr

2018-10-25 Thread Martin Frank Hansen (MHQ)
Hi, I am trying to read content of msg-files using Tika and index these in Solr, however I am having some problems with the OfficeParser(). I keep getting the error java.lang.NoClassDefFoundError for the OfficeParcer, even though both tika-core and tika-parsers are included in the build path.

How to add two fields (mobile,year) in SolrCloud Collection

2018-10-25 Thread kbmanikanta90
Hello Everyone, I need small help, we have trying to migrate Solr Standalone to Solr Cloud. During this journey,we have done R and D a lot but we have unable to find any article related to our problem. Can anyone please help out this problem We have run zookeeper separately in local and star

Re: Solr Stream vs Export Request Handlers

2018-10-25 Thread Kamal Kishore Aggarwal
Any update on this. Regards Kamal On Thu, Oct 18, 2018 at 11:50 AM Kamal Kishore Aggarwal < kkroyal@gmail.com> wrote: > Hi, > > Thanks again Joel for your reply. I have noted your suggestions. > > I observed one more thing while using solrj to fetch the data using > /stream with export and d