Re: multivalue faceting term optimization

2020-03-09 Thread Jörn Franke
hll stands for https://en.wikipedia.org/wiki/HyperLogLog You will not get the exact distinct count, but a distinct count very close to the real number. It is very fast and memory efficient for large number of distinct values. > Am 10.03.2020 um 00:25 schrieb Nicolas Paris : > >  > Erick Erick

Re: multivalue faceting term optimization

2020-03-09 Thread Nicolas Paris
Erick Erickson writes: > Have you looked at the HyperLogLog stuff? Here’s at least a mention of > it: https://lucene.apache.org/solr/guide/8_4/the-stats-component.html I am used to hll in the context of count distinct values -- cardinality. I have to admit that section https://lucene.apache.o

Re: multivalue faceting term optimization

2020-03-09 Thread Nicolas Paris
Toke Eskildsen writes: > JSON faceting allows you to skip the fine counting with the parameter > refine: I also tried the facet.refine parameter, but didn't notice any improvement. >> I am wondering how I could filter the documents to get approximate >> facets ? > > Clunky idea: Introduce a

Re: multivalue faceting term optimization

2020-03-09 Thread Erick Erickson
Have you looked at the HyperLogLog stuff? Here’s at least a mention of it: https://lucene.apache.org/solr/guide/8_4/the-stats-component.html Best, Erick > On Mar 9, 2020, at 02:39, Nicolas Paris wrote: > > Hello, > > > Environment: > - SolrCloud 8.4.1 > - 4 shards with xmx = 120GO and ssd

Re: multivalue faceting term optimization

2020-03-09 Thread Toke Eskildsen
On Mon, 2020-03-09 at 10:39 +0100, Nicolas Paris wrote: > I want to provide terms facet on a string multivalue field. > ... > How to improve brute performances ? It might help to have everything in a single shard, to avoid the secondary fine count. But your index is rather large for single-shard s

Re: Multivalue Field lookup

2019-04-15 Thread Kumaresh AK
Thanks Edwin for the information. I will try highlighting.. On Sun, Apr 14, 2019 at 7:53 PM Zheng Lin Edwin Yeo wrote: > You need to set the field which you want to highlight in the hl.fl > parameter in your solrconfig.xml. > Highlighting can work in String field. > > Regards, > Edwin > > On Mon

RE: Multivalue Field lookup

2019-04-15 Thread Wolf, Karl (NIH/NLM/LHC) [C]
son for my Solr jira bug Solr-13367. This used to work in Solr 5.1.0 but it is broken in newer versions of Solr (at least in 7.5.0, 7.7.1, 8.0.0) Karl -Original Message- From: Zheng Lin Edwin Yeo Sent: Sunday, April 14, 2019 9:36 PM To: solr-user@lucene.apache.org Subject: Re: Multival

Re: Multivalue Field lookup

2019-04-14 Thread Zheng Lin Edwin Yeo
You need to set the field which you want to highlight in the hl.fl parameter in your solrconfig.xml. Highlighting can work in String field. Regards, Edwin On Mon, 15 Apr 2019 at 05:53, Kumaresh AK wrote: > How would I go about change by field type to facilitate highlighting ? I > tried hl on my

Re: Multivalue Field lookup

2019-04-14 Thread Kumaresh AK
How would I go about change by field type to facilitate highlighting ? I tried hl on my current schema and the highlight section is empty. I guess I need to change to multivalue text. Currently it is String Field as these are identifiers and no analysis needed on those On Sun, Apr 14, 2019 at

Re: Multivalue Field lookup

2019-04-14 Thread Mikhail Khludnev
Btw, can it be done by highlighting? On Sun, Apr 14, 2019 at 3:22 PM Kumaresh AK wrote: > Hi Mikhail, > Thanks for the response. I see the results have the details I am looking > for. But it is not well structured. I sense that the debug=results is for > understanding the scoring behind the resu

Re: Multivalue Field lookup

2019-04-14 Thread Kumaresh AK
Hi Mikhail, Thanks for the response. I see the results have the details I am looking for. But it is not well structured. I sense that the debug=results is for understanding the scoring behind the results which can be more than what I am looking for. Is there any other way ? The multivalued field ma

Re: Multivalue Field lookup

2019-04-14 Thread Mikhail Khludnev
Hello, Kumaresh! If you turn it to q you may try debug=results&debug.explain.structured=true with some performance impact. On Sun, Apr 14, 2019 at 2:41 AM Kumaresh AK wrote: > Hello! > I am new to SOLR. This is my field type definition: > > > > stored="true" multiValued="true" omitTermFreqAndP

Re: Multivalue wild card search

2014-06-23 Thread Erick Erickson
Nope, got to re-index. bq: Assuming there is a multiValued field called "Name" of type string stored in index - bq: I tested both cases with empty index. When I inserted the document after changing fieldType to StandardTokenizerFactory, it worked fine with the standard phrase query. But I was

Re: Multivalue wild card search

2014-06-23 Thread Ethan
Hi Ahmet, I have tested this and it doesn't work for existing documents. I couldn't make much sense of the field analysis. I didn't find an option to see indexed terms in "Analysis" tab. Instead you feed it the value you want analyzed and it prints index or query time analysis. Is this what you'

Re: Multivalue wild card search

2014-06-23 Thread Ahmet Arslan
Hi Ethan, With that type standard phrase query should work. If you paste you sample text in analysis page, you will see indexed terms. q=Name:"steve wonder" should work. You don't need wildcard search in this case. Just do a phrase query. (surrounded with quotes) Ahmet  On Tuesday, June 24,

Re: Multivalue wild card search

2014-06-23 Thread Ethan
Ahmet, Here the xml for the field "Name" - Let me know if I need to update it. Thanks, E On Mon, Jun 23, 2014 at 12:38 PM, Ahmet Arslan wrote: > Hi Ethan, > > I understand that you are dealing legacy system. > > > Can you paste analy

Re: Multivalue wild card search

2014-06-23 Thread Ahmet Arslan
Hi Ethan, I understand that you are dealing legacy system. Can you paste analysis chain used for already indexed docs. I mean xml snippet taken from schema xml. With this, we will figure out how that text is indexed. We will write our query according to that info. Ahmet On Monday, June

Re: Multivalue wild card search

2014-06-23 Thread Ethan
Hey Ahmet, Yes, brackets, commas and quotes are part of fields value. It's something I inherited and working on improving it. The field is of type solr.TextField. Adding StandardTokenizer solves the problem for the new documents. It doesn't work on already indexed docs. Is there a solution for

Re: Multivalue wild card search

2014-06-23 Thread Ahmet Arslan
Hi Ethan, XML response is helpful, so you still have brackets, commas, quotes in field value? What is the field type you use for Name field? If you tokenize it StandardTokenizer simple phrase query would do the trick q=Name:"Steve Wonder" Also consider cleaning up your values. Why would you s

Re: Multivalue wild card search

2014-06-23 Thread Ethan
Ahmet, Yes, they were part of JSON output, Here is the xml response [["Hifte", "Grop", "", ""]] [] [["Ethan", "G", "", ""],["Steve", "Wonder", "", ""]] I solution suggested by Jack to look up Steve Wonder doesn't work as asterick is replaced by the defaultsearch field. Any suggestions? Thanks,

Re: Multivalue wild card search

2014-06-20 Thread Ahmet Arslan
Hi, What are these square brackets, back slashes, quotes? Are they part of JSON output? Can you paste human reman able XML response writer output? Thanks, Ahmet On Friday, June 20, 2014 12:17 AM, Ethan wrote: Ahmet, Assuming there is a multiValued field called "Name" of type string stored i

Re: Multivalue wild card search

2014-06-19 Thread Jack Krupansky
sky -Original Message- From: Ethan Sent: Thursday, June 19, 2014 5:16 PM To: solr-user ; Ahmet Arslan Subject: Re: Multivalue wild card search Ahmet, Assuming there is a multiValued field called "Name" of type string stored in index - //Doc 1 "id" : 23512 "Ho

Re: Multivalue wild card search

2014-06-19 Thread Ethan
Ahmet, Assuming there is a multiValued field called "Name" of type string stored in index - //Doc 1 "id" : 23512 "HotelId" : [ "12", "23", "12" ] "Name" : [ "[[\"Ethan\", \"G\", \"\"],[\"Steve\", \"Wonder\", \"\"]]", "[]", "[[\"hifte\", \"Grop\", \"\"]]" ] // Doc 2 "id" : 23513 "Ho

Re: Multivalue wild card search

2014-06-06 Thread Jack Krupansky
Wildcard, fuzzy, and regex query do work on multivalued fields, but on only a single value at a time. Your match cannot occur across values. -- Jack Krupansky -Original Message- From: Ethan Sent: Thursday, June 5, 2014 12:52 PM To: solr-user Subject: Re: Multivalue wild card search

Re: Multivalue wild card search

2014-06-06 Thread Ahmet Arslan
Hi Ethan, It is hard to understand your example. Can you re-write it? Using xml?  On Friday, June 6, 2014 9:07 PM, Ethan wrote: Bumping the thread to see if anyone has a solution. On Thu, Jun 5, 2014 at 9:52 AM, Ethan wrote: > Wildcard search do work on multiValued field.  I was able t

Re: Multivalue wild card search

2014-06-06 Thread Ethan
Bumping the thread to see if anyone has a solution. On Thu, Jun 5, 2014 at 9:52 AM, Ethan wrote: > Wildcard search do work on multiValued field. I was able to pull up > records for following multiValued field - > > Code : [ > "12344", > "4534", > "674" > ] > > q=Code:45* fetched the correct do

Re: Multivalue wild card search

2014-06-05 Thread Ethan
Wildcard search do work on multiValued field. I was able to pull up records for following multiValued field - Code : [ "12344", "4534", "674" ] q=Code:45* fetched the correct document. It doesn't work in quotes(q="Code:45*"), however. Is there a workaround? On Thu, Jun 5, 2014 at 9:34 AM, Et

Re: Multivalue wild card search

2014-06-05 Thread Ethan
Are you implying there is not way to lookup on a multiValued field with a substring? If so, then how is it usually handled? On Wed, Jun 4, 2014 at 4:44 PM, Jack Krupansky wrote: > Wildcard, fuzzy, and regex query operate on a single term of a single > tokenized field value or a single string f

Re: Multivalue wild card search

2014-06-04 Thread Jack Krupansky
Wildcard, fuzzy, and regex query operate on a single term of a single tokenized field value or a single string field value. -- Jack Krupansky -Original Message- From: Ethan Sent: Wednesday, June 4, 2014 6:59 PM To: solr-user Subject: Multivalue wild card search I can't seem to find a

Re: multivalue location_rpt field not indexing with JSON format

2013-05-24 Thread David Smiley (@MITRE.org)
Hi Barbra, Solr needs to see a String for each point value, not a 2-element array. Your doc should look like: [{"id":"054ac6377d6ca4ad387f73b063000910","keywords":["time", "trouble", "exactly"],"description":"a anno is an anno is an anno", "location":["33.44844800999897,-111.98840074003"

Re: MultiValue

2013-05-13 Thread manju16832003
Hi All, I managed to *solve* the issue I had posted earlier with respect to multiValued. Here is the Query suppose to configured this way in *data-config.xml * Description: in the below, first query has associated table images. Each person would have many images. Here the JSON/XML would return all

Re: MultiValue

2013-05-12 Thread manju16832003
Hi All, I wanted to achieve something similar [ { "last_name" : "jain", "training_skill":["c", "c++", "php,java,.net"] }] [{"person_id" : "111", "documents" : ["doc_1","doc_2","doc_3","doc4,doc5,doc6"]}] Meaning one person could have more than one document. how do we need to write the query, an

Re: MultiValue

2013-01-17 Thread Alexandre Rafalovitch
Try my suggested field definition and see if it helps with faceting. It should. Try it on a small example or a fake schema. But I would still recommend escalating the problem up the chain to an architect or similar. Because I bet that data is stored in multiple places (e.g. in the database) and yo

Re: MultiValue

2013-01-17 Thread anurag.jain
@Alexandre Rafalovitch Thanks. yeah you got my point. training_skill:["c", "c++", "php", "java", ".net"] but it is not possible for me to split "php,java,.net" because data can very and data is very large. i mean i have to perform on 5 line data. it might come["c++,php,java",".net","c#,

Re: MultiValue

2013-01-17 Thread Alexandre Rafalovitch
I think the problem here is that the list has 3-values, but the last one is actually a set of several as well. Anurag seem to be able to split them into separate values whether they came as individual array items or as part of joint list. So, we have a mix of multiValue submission and desire to spl

Re: MultiValue

2013-01-17 Thread Gora Mohanty
On 18 January 2013 00:31, anurag.jain wrote: > > [ { "last_name" : "jain", "training_skill":["c", "c++", "php,java,.net"] > } > ] > > actually i want to tokenize in c c++ php java .net What do you mean by "tokenize" in this case? It has been a while since I had occasion to use JSON input, and

Re: MultiValue

2013-01-17 Thread anurag.jain
actually [ { "last_name" : "jain", "training_skill":*["c", "c++", "php,java,.net"]* } ] training_skill is list. and if i want to store in string field type then it will include [ and , also. so how to avoid ? or it will not. or do you have any other field type definition through which my wor

Re: MultiValue

2013-01-17 Thread Dikchant Sahi
You mean to say that the problem is with json which is being ingested. What you are trying to achieve is that you want to split the values on the basis of comma and index it as multiple value. What problem you are facing in indexing json in format Solr expects. If you don't have control over it,

Re: MultiValue

2013-01-17 Thread anurag.jain
[ { "last_name" : "jain", "training_skill":["c", "c++", "php,java,.net"] } ] actually i want to tokenize in c c++ php java .net so through this i can make them as facet. but problem is in list "training_skill":["c", "c++", *"php,java,.net"*] -- View this message in context: http://l

Re: MultiValue

2013-01-17 Thread Dikchant Sahi
you just need to make the field as multivalued. type should be set based on your search requirements. On Thu, Jan 17, 2013 at 11:27 PM, anurag.jain wrote: > my json file look like > > [ { "last_name" : "jain", "training_skill":["c", "c++", "php,java,.net"] }] > > can u please suggest me how

Re: multivalue or denormalise

2011-07-25 Thread abhayd
hi erick, I will be searching only on search_term. I did exactly as u said in application layer, I was not sure how multi-valued fields works in co-relation -- View this message in context: http://lucene.472066.n3.nabble.com/multivalue-or-denormalise-tp3197942p3198710.html Sent from the Solr

Re: multivalue or denormalise

2011-07-25 Thread Erick Erickson
I'm a little confused. Are you searching against these different titles or is the search something else and you're really only interested in displaying different titles for documents returned for the query? If it's just a display issue, you can use multivalued fields, the order in which you put va

Re: MultiValue dynamicField and copyField

2010-07-14 Thread kenf_nc
Yep, my schema does this all day long. -- View this message in context: http://lucene.472066.n3.nabble.com/MultiValue-dynamicField-and-copyField-tp965941p966536.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: MultiValue Exclusion

2010-06-04 Thread Geert-Jan Brits
I guess the following works. A. similar to your option 2, but using the filtercache fq=-item_id:001 -item_id:002 B. similar to your option 3, but using the filtercache fq=-users_excluded_field: the advantage being that the filter is cached independently from the rest of the query so it can be re

Re: multivalue fields logic required

2010-05-12 Thread Marco Martinez
You should do a preprocessing(multiply your document as many documents as values you have in your multivalue field, with the principalFlag:T in your first document) before you indexing the data with that logic Marco Martínez Bautista http://www.paradigmatecnologico.com Avenida de Europa, 26. Ática

Re: multivalue fields logic required

2010-05-12 Thread Jonty Rhods
hi Marco, Thanks for quick reply.. I have another doubt: In 2nd solution: How to set flag for duplicate value. because I am not sure about the no fo duplicate rows (it could be random no..) so how can I set the flag.. thank On Wed, May 12, 2010 at 12:59 PM, Marco Martinez < mmarti...@paradigmatec

Re: multivalue fields logic required

2010-05-12 Thread Marco Martinez
Hi, 2º solution: Not use multiValue fields, instead use two single fields, in your example will be: doc1: dept: student1 city: city1 principalFlag:T doc2: dept: student2 city: city2 principalFlag:F So, if you search without specify any city or dept, you should put princiaplFlag:T for no get dup

Re: multivalue fields logic required

2010-05-12 Thread Jonty Rhods
Hi Marco, I am trying to patch for collapse component support (till now no luck).. In mean time I would like to know the 2nd and 3rd option you mentioned (logic in solrj).. with regards On Thu, May 6, 2010 at 2:36 PM, Marco Martinez < mmarti...@paradigmatecnologico.com> wrote: > Hi Jonty, > > I

Re: multivalue fields logic required

2010-05-06 Thread Marco Martinez
Hi Jonty, I think you have three possible solutions: 1. Use the collapse component with your name field for not have any duplicates documents. 2. Create a simple logic in your index with flags, like one flag to determine the first element of the same document (in your example you wil

Re: multivalue fields logic required

2010-05-06 Thread Jonty Rhods
thanks :General solution is to index 3 different SolrDocument in your example. id and name fields will repeat themselves. All fields will be single-valued. if I am indexing 3 different field then if user is searching by name + dept then it will return duplicate value.. is there any other best pos

Re: multivalue fields logic required

2010-05-06 Thread Ahmet Arslan
> recently I start to work on solr, So I am still very new to > use solr. Sorry > if I am logically wrong. > I have two table, parent and referenced (child). > > for that I set multivalue field following is my schema > details > stored="true" required="true" > /> > > >     indexed="true" stor

Re: Multivalue Field Cache

2009-09-24 Thread Grant Ingersoll
Have a look at UninvertedField.java. I think that might help. On Sep 23, 2009, at 2:35 PM, Amit Nithian wrote: Are there any good implementations of a field cache that will return all values of a multivalued field? I am in the process of writing one for my immediate needs but I was wonderin

Re: MultiValue DynamicFields?

2009-01-30 Thread Alexander Ramos Jardim
Yes. It's totally acceptable. 2009/1/30 Bruno Aranda > Hi, it is possible to create a dynamic field that is multi valued? > > Cheers, > > Bruno > -- Alexander Ramos Jardim