hll stands for https://en.wikipedia.org/wiki/HyperLogLog
You will not get the exact distinct count, but a distinct count very close to
the real number. It is very fast and memory efficient for large number of
distinct values.
> Am 10.03.2020 um 00:25 schrieb Nicolas Paris :
>
>
> Erick Erick
Erick Erickson writes:
> Have you looked at the HyperLogLog stuff? Here’s at least a mention of
> it: https://lucene.apache.org/solr/guide/8_4/the-stats-component.html
I am used to hll in the context of count distinct values -- cardinality.
I have to admit that section
https://lucene.apache.o
Toke Eskildsen writes:
> JSON faceting allows you to skip the fine counting with the parameter
> refine:
I also tried the facet.refine parameter, but didn't notice any improvement.
>> I am wondering how I could filter the documents to get approximate
>> facets ?
>
> Clunky idea: Introduce a
Have you looked at the HyperLogLog stuff? Here’s at least a mention of it:
https://lucene.apache.org/solr/guide/8_4/the-stats-component.html
Best,
Erick
> On Mar 9, 2020, at 02:39, Nicolas Paris wrote:
>
> Hello,
>
>
> Environment:
> - SolrCloud 8.4.1
> - 4 shards with xmx = 120GO and ssd
On Mon, 2020-03-09 at 10:39 +0100, Nicolas Paris wrote:
> I want to provide terms facet on a string multivalue field.
> ...
> How to improve brute performances ?
It might help to have everything in a single shard, to avoid the
secondary fine count. But your index is rather large for single-shard
s
Thanks Edwin for the information. I will try highlighting..
On Sun, Apr 14, 2019 at 7:53 PM Zheng Lin Edwin Yeo
wrote:
> You need to set the field which you want to highlight in the hl.fl
> parameter in your solrconfig.xml.
> Highlighting can work in String field.
>
> Regards,
> Edwin
>
> On Mon
son for my Solr jira bug Solr-13367.
This used to work in Solr 5.1.0 but it is broken in newer versions of Solr (at
least in 7.5.0, 7.7.1, 8.0.0)
Karl
-Original Message-
From: Zheng Lin Edwin Yeo
Sent: Sunday, April 14, 2019 9:36 PM
To: solr-user@lucene.apache.org
Subject: Re: Multival
You need to set the field which you want to highlight in the hl.fl
parameter in your solrconfig.xml.
Highlighting can work in String field.
Regards,
Edwin
On Mon, 15 Apr 2019 at 05:53, Kumaresh AK wrote:
> How would I go about change by field type to facilitate highlighting ? I
> tried hl on my
How would I go about change by field type to facilitate highlighting ? I
tried hl on my current schema and the highlight section is empty. I guess I
need to change to multivalue text. Currently it is String Field as these
are identifiers and no analysis needed on those
On Sun, Apr 14, 2019 at
Btw, can it be done by highlighting?
On Sun, Apr 14, 2019 at 3:22 PM Kumaresh AK wrote:
> Hi Mikhail,
> Thanks for the response. I see the results have the details I am looking
> for. But it is not well structured. I sense that the debug=results is for
> understanding the scoring behind the resu
Hi Mikhail,
Thanks for the response. I see the results have the details I am looking
for. But it is not well structured. I sense that the debug=results is for
understanding the scoring behind the results which can be more than what I
am looking for. Is there any other way ? The multivalued field ma
Hello, Kumaresh!
If you turn it to q you may try debug=results&debug.explain.structured=true
with some performance impact.
On Sun, Apr 14, 2019 at 2:41 AM Kumaresh AK wrote:
> Hello!
> I am new to SOLR. This is my field type definition:
>
> > > stored="true" multiValued="true" omitTermFreqAndP
Nope, got to re-index.
bq: Assuming there is a multiValued field called "Name" of type string stored
in index -
bq: I tested both cases with empty index. When I inserted the document after
changing fieldType to StandardTokenizerFactory, it worked fine with the
standard phrase query. But I was
Hi Ahmet,
I have tested this and it doesn't work for existing documents. I couldn't
make much sense of the field analysis. I didn't find an option to see
indexed terms in "Analysis" tab. Instead you feed it the value you want
analyzed and it prints index or query time analysis. Is this what you'
Hi Ethan,
With that type standard phrase query should work. If you paste you sample text
in analysis page, you will see indexed terms.
q=Name:"steve wonder" should work. You don't need wildcard search in this case.
Just do a phrase query. (surrounded with quotes)
Ahmet
On Tuesday, June 24,
Ahmet,
Here the xml for the field "Name" - Let me know if I need to update it.
Thanks,
E
On Mon, Jun 23, 2014 at 12:38 PM, Ahmet Arslan
wrote:
> Hi Ethan,
>
> I understand that you are dealing legacy system.
>
>
> Can you paste analy
Hi Ethan,
I understand that you are dealing legacy system.
Can you paste analysis chain used for already indexed docs. I mean xml snippet
taken from schema xml.
With this, we will figure out how that text is indexed. We will write our query
according to that info.
Ahmet
On Monday, June
Hey Ahmet, Yes, brackets, commas and quotes are part of fields value.
It's something I inherited and working on improving it.
The field is of type solr.TextField. Adding StandardTokenizer solves the
problem for the new documents. It doesn't work on already indexed docs.
Is there a solution for
Hi Ethan,
XML response is helpful, so you still have brackets, commas, quotes in field
value?
What is the field type you use for Name field?
If you tokenize it StandardTokenizer simple phrase query would do the trick
q=Name:"Steve Wonder"
Also consider cleaning up your values. Why would you s
Ahmet,
Yes, they were part of JSON output, Here is the xml response
[["Hifte", "Grop", "", ""]] []
[["Ethan",
"G", "", ""],["Steve", "Wonder", "", ""]]
I solution suggested by Jack to look up Steve Wonder doesn't work as
asterick is replaced by the defaultsearch field. Any suggestions?
Thanks,
Hi,
What are these square brackets, back slashes, quotes?
Are they part of JSON output? Can you paste human reman able XML response
writer output?
Thanks,
Ahmet
On Friday, June 20, 2014 12:17 AM, Ethan wrote:
Ahmet,
Assuming there is a multiValued field called "Name" of type string stored
i
sky
-Original Message-
From: Ethan
Sent: Thursday, June 19, 2014 5:16 PM
To: solr-user ; Ahmet Arslan
Subject: Re: Multivalue wild card search
Ahmet,
Assuming there is a multiValued field called "Name" of type string stored
in index -
//Doc 1
"id" : 23512
"Ho
Ahmet,
Assuming there is a multiValued field called "Name" of type string stored
in index -
//Doc 1
"id" : 23512
"HotelId" : [
"12",
"23",
"12"
]
"Name" : [
"[[\"Ethan\", \"G\", \"\"],[\"Steve\", \"Wonder\", \"\"]]",
"[]",
"[[\"hifte\", \"Grop\", \"\"]]"
]
// Doc 2
"id" : 23513
"Ho
Wildcard, fuzzy, and regex query do work on multivalued fields, but on only
a single value at a time. Your match cannot occur across values.
-- Jack Krupansky
-Original Message-
From: Ethan
Sent: Thursday, June 5, 2014 12:52 PM
To: solr-user
Subject: Re: Multivalue wild card search
Hi Ethan,
It is hard to understand your example. Can you re-write it? Using xml?
On Friday, June 6, 2014 9:07 PM, Ethan wrote:
Bumping the thread to see if anyone has a solution.
On Thu, Jun 5, 2014 at 9:52 AM, Ethan wrote:
> Wildcard search do work on multiValued field. I was able t
Bumping the thread to see if anyone has a solution.
On Thu, Jun 5, 2014 at 9:52 AM, Ethan wrote:
> Wildcard search do work on multiValued field. I was able to pull up
> records for following multiValued field -
>
> Code : [
> "12344",
> "4534",
> "674"
> ]
>
> q=Code:45* fetched the correct do
Wildcard search do work on multiValued field. I was able to pull up
records for following multiValued field -
Code : [
"12344",
"4534",
"674"
]
q=Code:45* fetched the correct document. It doesn't work in
quotes(q="Code:45*"), however. Is there a workaround?
On Thu, Jun 5, 2014 at 9:34 AM, Et
Are you implying there is not way to lookup on a multiValued field with a
substring? If so, then how is it usually handled?
On Wed, Jun 4, 2014 at 4:44 PM, Jack Krupansky
wrote:
> Wildcard, fuzzy, and regex query operate on a single term of a single
> tokenized field value or a single string f
Wildcard, fuzzy, and regex query operate on a single term of a single
tokenized field value or a single string field value.
-- Jack Krupansky
-Original Message-
From: Ethan
Sent: Wednesday, June 4, 2014 6:59 PM
To: solr-user
Subject: Multivalue wild card search
I can't seem to find a
Hi Barbra,
Solr needs to see a String for each point value, not a 2-element array.
Your doc should look like:
[{"id":"054ac6377d6ca4ad387f73b063000910","keywords":["time", "trouble",
"exactly"],"description":"a anno is an anno is an anno",
"location":["33.44844800999897,-111.98840074003"
Hi All,
I managed to *solve* the issue I had posted earlier with respect to
multiValued.
Here is the Query suppose to configured this way in *data-config.xml *
Description: in the below, first query has associated table images. Each
person would have many images. Here the JSON/XML would return all
Hi All,
I wanted to achieve something similar
[ { "last_name" : "jain", "training_skill":["c", "c++", "php,java,.net"] }]
[{"person_id" : "111", "documents" :
["doc_1","doc_2","doc_3","doc4,doc5,doc6"]}]
Meaning one person could have more than one document.
how do we need to write the query, an
Try my suggested field definition and see if it helps with faceting. It
should. Try it on a small example or a fake schema.
But I would still recommend escalating the problem up the chain to an
architect or similar. Because I bet that data is stored in multiple places
(e.g. in the database) and yo
@Alexandre Rafalovitch Thanks.
yeah you got my point.
training_skill:["c", "c++", "php", "java", ".net"]
but it is not possible for me to split "php,java,.net" because data can
very and data is very large. i mean i have to perform on 5 line data.
it might come["c++,php,java",".net","c#,
I think the problem here is that the list has 3-values, but the last one is
actually a set of several as well. Anurag seem to be able to split them
into separate values whether they came as individual array items or as part
of joint list. So, we have a mix of multiValue submission and desire to
spl
On 18 January 2013 00:31, anurag.jain wrote:
>
> [ { "last_name" : "jain", "training_skill":["c", "c++", "php,java,.net"]
> }
> ]
>
> actually i want to tokenize in c c++ php java .net
What do you mean by "tokenize" in this case? It has
been a while since I had occasion to use JSON input,
and
actually [ { "last_name" : "jain", "training_skill":*["c", "c++",
"php,java,.net"]* } ] training_skill is list. and if i want to store in
string field type then it will include [ and , also. so how to avoid ? or it
will not.
or do you have any other field type definition through which my wor
You mean to say that the problem is with json which is being ingested.
What you are trying to achieve is that you want to split the values on the
basis of comma and index it as multiple value.
What problem you are facing in indexing json in format Solr expects. If you
don't have control over it,
[ { "last_name" : "jain", "training_skill":["c", "c++", "php,java,.net"] }
]
actually i want to tokenize in c c++ php java .net
so through this i can make them as facet.
but problem is in list
"training_skill":["c", "c++", *"php,java,.net"*]
--
View this message in context:
http://l
you just need to make the field as multivalued.
type should be set based on your search requirements.
On Thu, Jan 17, 2013 at 11:27 PM, anurag.jain wrote:
> my json file look like
>
> [ { "last_name" : "jain", "training_skill":["c", "c++", "php,java,.net"] }]
>
> can u please suggest me how
hi erick,
I will be searching only on search_term.
I did exactly as u said in application layer,
I was not sure how multi-valued fields works in co-relation
--
View this message in context:
http://lucene.472066.n3.nabble.com/multivalue-or-denormalise-tp3197942p3198710.html
Sent from the Solr
I'm a little confused. Are you searching against these
different titles or is the search something else and you're really
only interested in displaying different titles for documents returned
for the query?
If it's just a display issue, you can use multivalued fields, the order
in which you put va
Yep, my schema does this all day long.
--
View this message in context:
http://lucene.472066.n3.nabble.com/MultiValue-dynamicField-and-copyField-tp965941p966536.html
Sent from the Solr - User mailing list archive at Nabble.com.
I guess the following works.
A. similar to your option 2, but using the filtercache
fq=-item_id:001 -item_id:002
B. similar to your option 3, but using the filtercache
fq=-users_excluded_field:
the advantage being that the filter is cached independently from the rest of
the query so it can be re
You should do a preprocessing(multiply your document as many documents as
values you have in your multivalue field, with the principalFlag:T in your
first document) before you indexing the data with that logic
Marco Martínez Bautista
http://www.paradigmatecnologico.com
Avenida de Europa, 26. Ática
hi Marco,
Thanks for quick reply..
I have another doubt: In 2nd solution: How to set flag for duplicate value.
because I am not sure about the no fo duplicate rows (it could be random
no..)
so how can I set the flag..
thank
On Wed, May 12, 2010 at 12:59 PM, Marco Martinez <
mmarti...@paradigmatec
Hi,
2º solution:
Not use multiValue fields, instead use two single fields, in your example
will be:
doc1:
dept: student1
city: city1
principalFlag:T
doc2:
dept: student2
city: city2
principalFlag:F
So, if you search without specify any city or dept, you should put
princiaplFlag:T for no get dup
Hi Marco,
I am trying to patch for collapse component support (till now no luck)..
In mean time I would like to know the 2nd and 3rd option you mentioned
(logic in solrj)..
with regards
On Thu, May 6, 2010 at 2:36 PM, Marco Martinez <
mmarti...@paradigmatecnologico.com> wrote:
> Hi Jonty,
>
> I
Hi Jonty,
I think you have three possible solutions:
1. Use the collapse component with your name field for not have any
duplicates documents.
2. Create a simple logic in your index with flags, like one flag to
determine the first element of the same document (in your example you wil
thanks
:General solution is to index 3 different SolrDocument in your example. id
and name fields will repeat themselves. All fields will be single-valued.
if I am indexing 3 different field then if user is searching by name + dept
then it will return duplicate value.. is there any other best pos
> recently I start to work on solr, So I am still very new to
> use solr. Sorry
> if I am logically wrong.
> I have two table, parent and referenced (child).
>
> for that I set multivalue field following is my schema
> details
> stored="true" required="true"
> />
>
>
> indexed="true" stor
Have a look at UninvertedField.java. I think that might help.
On Sep 23, 2009, at 2:35 PM, Amit Nithian wrote:
Are there any good implementations of a field cache that will return
all
values of a multivalued field? I am in the process of writing one
for my
immediate needs but I was wonderin
Yes. It's totally acceptable.
2009/1/30 Bruno Aranda
> Hi, it is possible to create a dynamic field that is multi valued?
>
> Cheers,
>
> Bruno
>
--
Alexander Ramos Jardim
53 matches
Mail list logo