Hi, I use 7.5.0 Solr
Why do I get two different results for similar requests?
First req/res:
{
"query": "*:*",
"limit": 0,
"filter": [
"{!parent which=kind_s:edition}condition_s:0",
"{!parent which=kind_s:edition}price_i:[* TO 75]"
]
}
{
"response": {
"numFound": 453,
Hi,
What is your full query path or URL that you pass for the query?
And how is your setting like for the edismax in your solrconfig.xml?
Regards,
Edwin
On Fri, 26 Oct 2018 at 06:24, Nicky Mastin wrote:
>
> Oddity with edismax and queries involving boolean operators. Here's the
> "parsedquery
Hello,
I could not find the process that parse tika.config parameter from solr
request.
Maybe, tika.config parameter can only be defined in solrconfig.xml as
following.
tika-config.xml
true
ignored_
true
links
ignored_
Thanks,
Yasufumi
2018年10月26日(金) 7:07 Robertson
preferLocalShards is a bit of a misnomer. I usually think of it as
"don't go to another Solr node if possible".
On Thu, Oct 25, 2018 at 10:46 AM Fernando Otero wrote:
>
> Thanks Emir!
> I was already looking at preferLocalShards but I wasn't sure it'll help
> with only 1 shard, I'll give it a
Oddity with edismax and queries involving boolean operators. Here's the
"parsedquery_toString" from two different queries:
input: "dog AND kiwi":
https://apaste.info/gaQl
input: "dog OR kiwi":
https://apaste.info/sBwa
Both queries return the same number of results (389). The query with OR was
Hello all,
Currently trying to define a tika config to use when posting a pdf to Solr Cell
as we may want to override the default tika configuration depending on type of
document being ingested.
In the docs it lists tika.config as an input parameter to the Solr Cell
endpoint. Though in my test
Hello all,
I am running a solr 6.6.3 3-shard cloud with one main collection that
contains 587,371,821 rows of data. One of the fields in this collection is
names. We are currently running into an issue with fuzzy searches on name
where it seems unable to get all possible values for a number of dif
If you’re processing actual msg (not eml), you’ll also need poi and
poi-scratchpad and their dependencies, but then those msgs could have
attachments, at which point, you may as just add tika-app. :D
On Thu, Oct 25, 2018 at 2:46 PM Martin Frank Hansen (MHQ)
wrote:
> Hi Erick and Tim,
>
> Thanks
Hi Erick and Tim,
Thanks for your answers, I can see that my mail got messed up on the way
through the server. It looked much more readable at my end 😉 The attachment
simply included my build-path.
@Erick I am compiling the program using Netbeans at the moment.
I updated to tika-1.7 but that d
One comment to complicate Erick's already-good advice.
> If a doc that needs to go to shard2 is received by a replica on shard1, it
> must be forwarded to the leader of shard1, introducing an extra hop.
Definitely true, but I don't think that's the only factor in the
relative performance of CUSC
Use a bf of 10 * RANK. That will give the same ordering as dividing the score
by 10 and adding RANK.
There are problems with additive boosts, so I strongly recommend looking at the
“boost” parameter, which is a multiplicative boost. That is more stable over a
wide range of score values.
wunder
To follow up w Erick’s point, there are a bunch of transitive dependencies
from tika-parsers. If you aren’t using maven or similar build system to
grab the dependencies, it can be tricky to get it right. If you aren’t
using maven, and you can afford the risks of jar hell, consider using
tika-app or
We use ranking below 100 and yes it is float.
> On Oct 25, 2018, at 1:08 PM, David Hastings
> wrote:
>
> is this RANK value stored as a float/integer? and whats the range? one
> idea is you could use edismax and have a really possible long boost query:
> RANK:[1 TO 2]^10 OR RANK:[3 TO 4]^9
>
Thanks Emir!
I was already looking at preferLocalShards but I wasn't sure it'll help
with only 1 shard, I'll give it a try
On Thu, Oct 25, 2018 at 11:26 AM Emir Arnautović <
emir.arnauto...@sematext.com> wrote:
> Hi Fernando,
> I did not look at code and not sure if there is special handling
is this RANK value stored as a float/integer? and whats the range? one
idea is you could use edismax and have a really possible long boost query:
RANK:[1 TO 2]^10 OR RANK:[3 TO 4]^9
but this isnt actually a great idea and gets sloppy fast. you could apply
boost at index time, or a function query
Martin:
The mail server is pretty aggressive about stripping attachments, your
png didn't come though. You might also get a more informed answer on
the Tika mailing list.
That said (and remember I can't see your png so this may be a silly
question), how are you executing the program .vs. compilin
Hi
Is there a way to achieve the following -
We have a RANK field in each document, and essentially, I would like my score
to be influenced by this RANK as follows -
score = score*0.1 + RANK
How can I achieve this with function queries?
Thanks!
Yup, thats the one. Thanks.
On Thu, Oct 25, 2018 at 11:54 AM Alexandre Rafalovitch
wrote:
> Probably this one: https://issues.apache.org/jira/browse/SOLR-9418
>
> I am not sure if that's documented yet.
>
> Regards,
>Alex.
> On Thu, 25 Oct 2018 at 11:08, David Hastings
> wrote:
> >
> > Alt
Probably this one: https://issues.apache.org/jira/browse/SOLR-9418
I am not sure if that's documented yet.
Regards,
Alex.
On Thu, 25 Oct 2018 at 11:08, David Hastings
wrote:
>
> Although another of Treys examples, the semantic query parser, Doesn't seem
> to have documentation unless im missi
Thanks Erick for you attention!
My comments below, but supposing that the problem resides in zookeeper
I'll collect more information from zk logs and solr logs and be back soon.
> bq. I've noticed that some replicas stop receiving updates from the
> leader without any visible signs from the clus
Although another of Treys examples, the semantic query parser, Doesn't seem
to have documentation unless im missing something?
On Thu, Oct 25, 2018 at 10:41 AM David Hastings <
hastings.recurs...@gmail.com> wrote:
> Wow, thanks for that. Will do some research and come back with the
> inevitable
A 1 Gb heap is probably too small on the master. Run with 8 Gb like the slaves.
wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/ (my blog)
> On Oct 24, 2018, at 10:20 PM, Bharat Yadav wrote:
>
> Hello Team,
>
> We are now a days frequently facing below issue on o
Hi,
This kind of was one of the problems I was facing recently.
While in my use case I am supposed to be showing spellcheck suggestions
(collated) from two different collections.
To also mention both these collections are using the same schema while they
need to be segregated as for the business n
Wow, thanks for that. Will do some research and come back with the
inevitable questions I will have.
On Thu, Oct 25, 2018 at 10:37 AM Alexandre Rafalovitch
wrote:
> See
> https://www.slideshare.net/arafalov/searching-for-ai-leveraging-solr-for-classic-artificial-intelligence-tasks
> , slides 19
See
https://www.slideshare.net/arafalov/searching-for-ai-leveraging-solr-for-classic-artificial-intelligence-tasks
, slides 19+
But it is not a fully-supported usage, due to
https://issues.apache.org/jira/browse/SOLR-12569 .
So, at your own risk.
Regards,
Alex.
On Thu, 25 Oct 2018 at 10:32,
Another skg question. the significantTerms
says it queries a solrcloud collection, only, is there any way to have it
work on standalone solr/cores as well? the MLT function works fine on
standalone, was really hoping this would as well.
On Thu, Oct 25, 2018 at 10:25 AM Alexandre Rafalovitch
wr
Hi Fernando,
I did not look at code and not sure if there is special handling in case of a
single shard collection, but Solr does not have to choose local shard to query.
It assumes that one node will receive all requests and that it needs to
balance. What you can do is add preferLocalShards=tru
That's being worked on as well. We've migrated the documentation from
Confluence to standalone setup, so not all the pieces are in place
yet.
Regards,
Alex.
On Thu, 25 Oct 2018 at 10:12, David Hastings
wrote:
>
> Thanks very much! for being a search product, the documentation isn't very
> sea
Hey Shawn
Thanks for your answer!. I changed the config to 1 shard with 7
replicas but I still see communication between nodes, is that expected?
Each node has 1 shard so it should have all the data needed to compute, I
don't get why I'm seeing communication between them.
Thanks
On Tue, Oct 2
Thanks very much! for being a search product, the documentation isn't very
search friendly :)
On Thu, Oct 25, 2018 at 9:29 AM Alexandre Rafalovitch
wrote:
> I think you are looking for:
>
> http://lucene.apache.org/solr/guide/7_5/json-facet-api.html#semantic-knowledge-graphs
>
> Or, as a second
I'm not sure why /stream is exporting faster then /export. It may be that
the different approaches in the client are the reason for the difference.
But the /export handler would be used in both scenarios if you specify
qt=/export in the search() Streaming Expression.
Joel Bernstein
http://joelsol
Your use case is somewhat special in that it involves 10 fields. With that
many nested facets the JSON facet API may or may not outperform streaming
rollups. For most other cases JSON facet API will outperform rollups.
Joel Bernstein
http://joelsolr.blogspot.com/
On Wed, Oct 17, 2018 at 11:21
I think you are looking for:
http://lucene.apache.org/solr/guide/7_5/json-facet-api.html#semantic-knowledge-graphs
Or, as a second option,
http://lucene.apache.org/solr/guide/7_5/stream-source-reference.html#significantterms
Regards,
Alex.
On Thu, 25 Oct 2018 at 08:47, David Hastings
wrote:
>
Hi,
I have a question about SynonymGraphFilter.
During the query parsing I expected a query phrase for multi word
synonyms but the query produced is an or of all the tokens that compose
the multi word. is the correct behavior? I attach a test for this
question. Examples: query: "text analysis
Hey all, I was going throught the Solr 7.5 documentation:
http://lucene.apache.org/solr/guide/7_5/index.html
and it appears to be incomplete. last week Trey Grainger gave a
presentation about the skg plugin, and said it was now included in the 7.5
distribution. There are no references to using i
Thanks - but I think I'm past those steps now. I set up an nginx reverse
proxy through the plesk panel initially, so that is fine. Binding it to
port 8983 seems to be the issue. Anyways, I think I'll try out the
instructions listed here and cross my fingers..:
https://talk.plesk.com/threads/unable
Hi,
If I have 2 fields e.g. location and products then I might have 2
dictionaries
spell_location
spell_products
I cannot do for example spellcheck..q e.g.
spellcheck.spell_location.q=... only spellcheck.q=...
Does anyone have a workaround for this limitation?
Cheers
Dan
You need to change local copy of schema.xml and upload that schema.xml to
zookeeper using upconfig command.
you need to specify -n ( should be same as collection name ).
https://lucene.apache.org/solr/guide/6_6/solr-control-script-reference.html#SolrControlScriptReference-UploadaConfigurationSet
Hi,
I am trying to read content of msg-files using Tika and index these in Solr,
however I am having some problems with the OfficeParser(). I keep getting the
error java.lang.NoClassDefFoundError for the OfficeParcer, even though both
tika-core and tika-parsers are included in the build path.
Hello Everyone,
I need small help, we have trying to migrate Solr Standalone to Solr
Cloud. During this journey,we have done R and D a lot but we have unable to
find any article related to our problem.
Can anyone please help out this problem
We have run zookeeper separately in local and star
Any update on this.
Regards
Kamal
On Thu, Oct 18, 2018 at 11:50 AM Kamal Kishore Aggarwal <
kkroyal@gmail.com> wrote:
> Hi,
>
> Thanks again Joel for your reply. I have noted your suggestions.
>
> I observed one more thing while using solrj to fetch the data using
> /stream with export and d
41 matches
Mail list logo