Hi Shruti,
Which version of Solr are you using?
Maybe you can take a look at the below guide first if you haven't done so:
https://lucene.apache.org/solr/guide/7_4/common-query-parameters.html#fq-filter-query-parameter
Regards,
Edwin
On Wed, 5 Sep 2018 at 21:17, shruti suri wrote:
> Hi,
>
> I
People usually just use a string field in place of longs etc..
On Tue, Sep 11, 2018 at 9:15 PM Zahra Aminolroaya
wrote:
>
> Thanks Erick. We used to use TrieLongField for our unique id and in the
> document it is said that all Trie* fieldtypes are casting to
> *pointfieldtypes. What would be the a
Hi John,
Please check the solrQueryParser option, it was removed in 7.4 version, so you
will need to provide AND in solrconfig.xml or give the
q.op option while querying to solve this problem. By default solr makes it an
"OR" operation leading to too many results.
Old Way: In Managed-schema or
Thanks Erick. We used to use TrieLongField for our unique id and in the
document it is said that all Trie* fieldtypes are casting to
*pointfieldtypes. What would be the alternative solution?
Best,
Zahra
--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
We have a Solr v7 Instance sourcing data from a Data Import Handler with a Solr
data source running Solr v4. When it hits a single server in that instance
directly, all documents are read and written correctly to the v7. When we hit
the load balancer DNS entry, the resulting data import handler
On 9/11/2018 9:20 PM, solrnoobie wrote:
So what we did is we upgraded the instances to 16 gigs and we rarely
encounter this now.
So what we did was to increase the batch size to 500 instead of 50 and it
worked for our test data. But when we tried 1000 batch size, the invalid
content type error r
Hi,
Which version of Solr are you using?
And are your different shard on the same machine or different machine?
Regards,
Edwin
On Tue, 4 Sep 2018 at 18:04, tkg_cangkul wrote:
> Hi i wanna try to rendering solr data spatial from geoserver layer.
> when i try to render it from single shard solr
On Tue, Sep 11, 2018 at 11:05 PM Walter Underwood
wrote:
> Have you tried modeling it with multivalued fields?
>
>
That's an interesting idea, but I don't think that would work. We would
lose the concept of "rows". So let's say child1 has col "a" and col "b",
both are turned into multi-value fiel
Thank you all for the kind and timely reply.
So what we did is we upgraded the instances to 16 gigs and we rarely
encounter this now.
So what we did was to increase the batch size to 500 instead of 50 and it
worked for our test data. But when we tried 1000 batch size, the invalid
content type err
On Tue, Sep 11, 2018 at 11:00 PM Shawn Heisey wrote:
> On 9/11/2018 8:35 PM, John Smith wrote:
> > The problem is that the math isn't a simple case of adding up all the row
> > counts. These are "left outer join"s. In sql, it would be this query:
>
> I think we'll just have to conclude that I do
Have you tried modeling it with multivalued fields?
Also, why do you think Solr is a good solution? What is the problem?
wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/ (my blog)
> On Sep 11, 2018, at 7:35 PM, John Smith wrote:
>
> On Tue, Sep 11, 2018 at 9:32 PM
On 9/11/2018 8:35 PM, John Smith wrote:
The problem is that the math isn't a simple case of adding up all the row
counts. These are "left outer join"s. In sql, it would be this query:
I think we'll just have to conclude that I do not understand what you
are doing. I have no idea what "left ou
I have found that it is due to insufficient training data that are related
to that feature.
After I add more entries that are related to that features to the training
data, the issues did not occur.
Regards,
Edwin
On Tue, 28 Aug 2018 at 15:56, Zheng Lin Edwin Yeo
wrote:
> Hi,
>
> I am using Sol
Hi,
I am working on to implementing Solr LTR in Solr 7.4.0 by using the
NeuralNetworkModel for the feature selection and model training, and I have
found this site which uses RankNet:
https://github.com/airalcorn2/Solr-LTR#RankNet
Has anyone tried on this before? And what is the format of the tra
On Tue, Sep 11, 2018 at 9:32 PM Shawn Heisey wrote:
> On 9/11/2018 7:07 PM, John Smith wrote:
> > header: 223,580
> >
> > child1: 124,978
> > child2: 254,045
> > child3: 127,917
> > child4:1,009,030
> > child5: 225,311
> > child6: 381,561
> > child7: 438,315
hi, all.
we recently migrated to cloud. part of that migration jumped us from 6.1 to
7.4.
one example query between our old solr instance and our new cloud instance
produces 42 results and 19k results.
the analyzer is the same aside from WordDelimiterFilterFactory moving over
to the graph variat
On 9/11/2018 7:07 PM, John Smith wrote:
header: 223,580
child1: 124,978
child2: 254,045
child3: 127,917
child4:1,009,030
child5: 225,311
child6: 381,561
child7: 438,315
child8: 18,850
Trying to index that into solr with a flatfile schema, blows up i
>
> On 9/7/2018 7:44 PM, John Smith wrote:
> > Thanks Shawn, for your comments. The reason why I don't want to go flat
> > file structure, is due to all the wasted/duplicated data. If a department
> > has 100 employees, then it's very wasteful in terms of disk space to
> repeat
> > the header data
Hmm.
I guess the issue is that the handler is the one doing parsing, so the
input document can be in XML or JSON or CSV. And MLT as a handler is then a
competing end point.
So you actually want to use it later in a pipeline but with a document
constructed on the fly and not stored.
This may not
Thanks Alex. Yes, I've been using the MoreLikeThisHandler, but that takes
a block of text as input posted to the request, not the structured json
that corresponds to the fields.
On Tue, Sep 11, 2018 at 10:14 AM Alexandre Rafalovitch
wrote:
> There are three ways to trigger MLT:
> https://lucene
OK, you just don't want to group as Shawn says. The group.main=true
just flattens the list but still returns the docs in groups does all
the work of grouping. So instead of getting
value=1 [
doc31
doc64
]
value=2 [
doc 98
doc 6
]
you get
doc31
doc64
doc98
doc6
Best,
Erick
On Tue,
bq. We're using NRTCachingDirectoryFactory
Which uses MMapDirectory under the covers.
The file handle counts will vary. During merging,
files are held open and while segments are merged
so new and old segments are open. Once merged,
the files in the old segment will be deleted so some
variance is
Hi Mikhail, Shawn,
thanks for your prompt answer.
The problem is that the indexed documents have dozen of fields and usually
they are different for each document.
For example document id 1 has few generic fields like title, description
and all the attributes like attr_1224, attr_4343, attr_4454,
On Tue, 2018-09-11 at 12:43 -0600, Shawn Heisey wrote:
> On 9/11/2018 12:14 PM, Boris Pasko wrote:
> >
> > >
> > > Run top, press shift-M to sort by memory usage, then grab a
> > atop: http://oi68.tinypic.com/10pokkk.jpg
> > top: http://oi63.tinypic.com/msbpfp.jpg
> Looking at the second one:
>
> T
You have not shed any light on what the reason for the container restart was,
and there is too little information about your setup and Solr usage to guess
what goes on. Whether 4Gb is sufficient or not depends on how much data and
queries you plan for each shard to handle, how much heap you give
On 9/11/2018 12:00 PM, Venkateswarlu Bommineni wrote:
What i am expecting is (it might be silly) if i put group.main=true and
sort by price then the results are:
{
"priceValueGLP_usd_double":32015.0,
"sapRank_int":446},
{
"priceValueGLP_usd_double":32015.
On 9/11/2018 12:14 PM, Boris Pasko wrote:
Run top, press shift-M to sort by memory usage, then grab a
atop: http://oi68.tinypic.com/10pokkk.jpg
top: http://oi63.tinypic.com/msbpfp.jpg
Looking at the second one:
The SHR value is showing 90GB.
Your Java process is in actuality only using in th
Hi. We're running Solr 6.6.1 (SolrCloud, 3 clusters). Recently I noticed
it became significantly slower to respond and did some basic checks on
servers. There is little IO, a bit of CPU usage (110% user, 3090% idle),
but one thing is very strange - the resident memory usage of the Solr.
Despite th
>Run top, press shift-M to sort by memory usage, then grab a
atop: http://oi68.tinypic.com/10pokkk.jpg
top: http://oi63.tinypic.com/msbpfp.jpg
–
The information contained in this message and any attachments may be
confidential and/or restricted and protected f
On Tue, 2018-09-11 at 10:26 -0700, Erick Erickson wrote:
> The memory usage is probably MMapDirectory, see:
> http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.ht
> ml,
> that's not a problem I'd guess.
We're using NRTCachingDirectoryFactory
> The open file handles... and ones bei
Erik and Shawn,
Sorry for the confusion.
Yes Solr is sorting inside the grouped results.but not on all the results.
Example : we have got 6 records by solr response. if i do *sort=price desc
, it is sorting indise each group.*
*But the requirement is to sort on all the results.*
*Current result
On 9/11/2018 10:23 AM, Vincenzo D'Amore wrote:
I suppose to be able to remove attr_1 and add attr_3 with one atomic update.
Like this:
curl -X POST -H 'Content-Type: application/json' '
http://localhost:8983/solr/gettingstarted/update?versions=true&commit=true'
--data-binary '
[
{
On 9/11/2018 11:07 AM, Boris Pasko wrote:
Hi. We're running Solr 6.6.1 (SolrCloud, 3 nodes). Recently I noticed
it became significantly slower to respond and did some basic checks on
servers. There is little IO, a bit of CPU usage (110% user, 3090%
idle),
but one thing is very strange - the resid
How this all works will be much clearer if you don't use "group.main=true"
But you still haven't _shown_ us what you _expect_.
In the second query, Solr is doing exactly what you're telling it to.
Return groups of up to three docs lowest-priced docs in each group and
ordering the groups by the lo
The memory usage is probably MMapDirectory, see:
http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html,
that's not a problem I'd guess.
The open file handles... and ones being open many times.
1> how many _total_ segment files do you have in all the replicas in
the JVM? A single
On 9/11/2018 10:14 AM, Venkateswarlu Bommineni wrote:
Please find the resonse and query when grouping and sorting by rank :
I see no evidence of grouping happening in either of those responses.
They look like standard responses do when grouping is not enabled.
Here's an example of a grouped
> Same picture on Solr 6.6.2, tested on various Oracle JVMs ranging
> from 1.8.0_171 to 1.8.0_171
From 1.8.0_171 to 1.8.0_181
–
The information contained in this message and any attachments may be
confidential and/or restricted and protected from disclosure.
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA256
Shalvak,
On 9/11/18 01:51, Shalvak Mittal (UST, ) wrote:
> I have recently installed solr 7.2.1 in my ubuntu 16.04 system.
> While creating a new core, the solr logging shows an error saying
>
>
> " Caused by: org.apache.solr.common.SolrException:
Hi. We're running Solr 6.6.1 (SolrCloud, 3 nodes). Recently I noticed
it became significantly slower to respond and did some basic checks on
servers. There is little IO, a bit of CPU usage (110% user, 3090%
idle),
but one thing is very strange - the resident memory usage of the Solr.
Despite the -
On 9/10/2018 5:45 PM, dshih wrote:
Based on what you said, is my query supposed to work as is if I set
luceneMatchVersion=7.1.0? It does not appear to.
It does look like a luceneMatchVersion check was added to the change in
SOLR-11501, so I would expect that to work. Setting luceneMatchVersi
On 9/10/2018 11:51 PM, Shalvak Mittal (UST, ) wrote:
I have recently installed solr 7.2.1 in my ubuntu 16.04 system. While creating
a new core, the solr logging shows an error saying
" Caused by: org.apache.solr.common.SolrException: fips module was not loaded."
I have never heard of a module
On 9/11/2018 12:22 AM, Bineesh wrote:
Need help on Nutch 1.15 indexing issues. We are using Nutch 1.15 and Solr
7.3.1 in our setup
1 : Is there a way i can mention multiple collections in the Nutch 1.15
indexwriters.xml file for the same ?
I see collection works fine if i hardcoded the collec
Hello, Vincenzo.
What about adding 1 into "attr_" : [ "set" : null ], ?
On Tue, Sep 11, 2018 at 7:23 PM Vincenzo D'Amore wrote:
> Hi Solr gurus :)
>
> I have a delicious question (that I'm struggling with), really hope that
> someone can help me.
>
> There is a document with many fields bu
Hi Solr gurus :)
I have a delicious question (that I'm struggling with), really hope that
someone can help me.
There is a document with many fields but I have to modify only few of them.
I thought to use atomic update but it seems that I cannot replace an entire
list of dynamic fields.
Here I t
Please find the resonse and query when grouping and sorting by rank :
http://localhost:8983/solr/master_shaneco_Product_flip/select?indent=on&q=rank:[1%20TO%20*]&wt=json&group=true&group.main=true&group.field=rank&rows=50&fl=code_String,price,rank&sort=rank+asc&group.limit=3
{
"responseHeader":{
4 Gb is very small for Solr.
Solr is not designed for Dockerized, fail-often use.
We use a LOT of Docker ECS, but all of our Solr servers are on EC2
instances. That’s about sixty instances in several clusters.
We run an 8 Gb heap for all our Solr instances. Instances in our biggest
cluster (in t
This is still confusing:
bq. But the requirement is to sort on all the results we show to the customer.
What does grouping have to do with that statement? Would it be served
by just _not_ grouping at all? If not, why not?
Please provide a small set of example documents and what you want to
show
ah, great thought. didn't even think of that. we already have a couple
ngram-based fields. will send over to the stakeholder who was attempting
this.
thanks!
--
John Blythe
On Sun, Sep 9, 2018 at 11:31 PM Erick Erickson
wrote:
> First of all, wildcards are evil. Be sure that the reason people
We need your help to make the Apache Washington DC Roadshow on Dec 4th a
success.
What do we need most? Speakers!
We're bringing a unique DC flavor to this event by mixing Open Source
Software with talks about Apache projects as well as OSS CyberSecurity,
OSS in Government and and OSS Career
point-based fields cannot be used for , see:
https://issues.apache.org/jira/browse/SOLR-10829
This should be documented better in the ref guide
On Tue, Sep 11, 2018 at 5:53 AM Zahra Aminolroaya
wrote:
>
> We read that in Solr 7, Trie* fields are deprecated, so we decided to change
> all of ou
There are three ways to trigger MLT:
https://lucene.apache.org/solr/guide/7_4/morelikethis.html
MoreLikeThisHandler allows to supply text externally. Unfortunately, I
can't find the specific example demonstrating it, so not sure if it
just a blob of text or a document.
Regards,
Alex.
On 11 Se
Hello,
Using a MoreLikeThisHandler, I was hoping to be able to pass in in the post
body a json document (the same format as a document indexed in my core, but
the document in the request is not and should not be added to the core).
I'm thinking it would handle an incoming document similar to how
We read that in Solr 7, Trie* fields are deprecated, so we decided to change
all of our Trie* fields to *pointtype Fields.
Our unique key field type is long, and we changed our long field type
something like below;
We get the error uniqueKey field can not be configured to use a Points based
Fi
Hi,
I have recently installed solr 7.2.1 in my ubuntu 16.04 system. While creating
a new core, the solr logging shows an error saying
" Caused by: org.apache.solr.common.SolrException: fips module was not loaded."
I have downloaded the necessary jar files like cryptoj.jar and copied them in
Hi Team,
Need help on Nutch 1.15 indexing issues. We are using Nutch 1.15 and Solr
7.3.1 in our setup
1 : Is there a way i can mention multiple collections in the Nutch 1.15
indexwriters.xml file for the same ?
I see collection works fine if i hardcoded the collection name in
indexer_solr_1
2
This is expected behaviour. The shardHandlerFactory element is configured
in solr.xml, not solrconfig.xml See:
https://lucene.apache.org/solr/guide/7_4/format-of-solr-xml.html
On Tue, 11 Sep 2018 at 11:55, Ash Ramesh wrote:
> Hi,
>
> I tried setting up a bespoke ShardHandlerFactory configura
56 matches
Mail list logo