Re[4]: Block Join faceting on intermediate levels with JSON Facet API (might be related to block join rollups & SOLR-8998)

2016-05-02 Thread Alisa Z .
-04:00 от Yonik Seeley : > >On Fri, Apr 22, 2016 at 12:26 PM, Alisa Z. < prol...@mail.ru > wrote: >> Hi Yonik, >> >> Thanks a lot for your response. >> >> I have discussed this with Mikhail Khludnev already and tried this >> suggestion. Here's what

Re[2]: Block Join faceting on intermediate levels with JSON Facet API (might be related to block join rollups & SOLR-8998)

2016-04-22 Thread Alisa Z .
uery?q=2.blog-posts.body-id :* ) So I think that it's just a JSON facet API limitation...  Best, --Alisa >Пятница, 22 апреля 2016, 9:55 -04:00 от Yonik Seeley : > >Hi Alisa, >This was a bit too hard for me to grok on a first pass... then I saw >your related blog post which include

Re[2]: how to restrict phrase to appear in same child document

2016-04-21 Thread Alisa Z .
ple, "which black driver >has a white mercedes", it is difficult to distinguish which color belongs >to which field, because there can be thousands of car brands and >professions. Is there anyway that can achieve the feature I stated been >fore? > >On Wednesday, April 20, 201

Re: pivoting with json facet api

2016-04-21 Thread Alisa Z .
Hi Yangrui, I have summarized some experiments about Solr nesting capabilities (however, it does not include precisely pivoting yet more of faceting up to parents and down to children with some statictics) so maybe you could find an idea there: https://medium.com/@alisazhila/solr-s-nesting-o

Re[2]: Traversal of documents through network

2016-04-21 Thread Alisa Z .
Well, it took me 7 milliseconds to index a 100MB dataset on a local Solr. So you could assume that for 1 GB it would take 70ms= 0.07s which is still pretty fast. Yet dealing with network delays is a separate issue.  100 wikipedia article-size documents shouldn't be a big problem. >Четверг,

Block Join faceting on intermediate levels with JSON Facet API (might be related to block join rollups & SOLR-8998)

2016-04-20 Thread Alisa Z .
Hi all, I have been stretching some SOLR's capabilities for nested documents handling and I've come up with the following issue... Let's say I have the following structure: { "blog-posts":{  //level 1     "leaf-fields":[     "date",     "author"],     "title":{

Re: how to restrict phrase to appear in same child document

2016-04-20 Thread Alisa Z .
Yangrui, First, have you indexed your documents with proper nested document structure [https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Index+Handlers#UploadingDatawithIndexHandlers-NestedChildDocuments]? From the peice of data you showed, it seems that you just put it righ

Re: Traversal of documents through network

2016-04-20 Thread Alisa Z .
Viday, No, not all of those 500 result docs will be brought to your client (browser, etc.)   Only as many documents as fit into the 1st "search result page" will be brought. There is a notion of "pagination" in Solr (as well as in most search engines). The counts of occurrence might be appro

Re[2]: [possible bug]: [child] - ChildDocTransformerFactory returns top level documents nested under middle level documents when queried for the middle level ones

2016-03-31 Thread Alisa Z .
iched.text.* > >You could also possibly enrich the document with depth information and use >that for filtering out. > >On Wed, Mar 30, 2016 at 11:34 AM, Alisa Z. < prol...@mail.ru > wrote: > >> I think I am observing an unexpected behavior of >> ChildDocTransformerFac

[possible bug]: [child] - ChildDocTransformerFactory returns top level documents nested under middle level documents when queried for the middle level ones

2016-03-30 Thread Alisa Z .
I think I am observing an unexpected behavior of ChildDocTransformerFactory. The query is like this: /select?q={!parent which= "type_s:doc.enriched.text "}t ype_s:doc.enriched.text.entities +text_t:pjm +type_t:Company +relevance_tf:[0.7%20TO%20*]&fl=*,[child parentFilter=type_s:doc.enriche

Re[5]: [nesting] JSON Facet API vs. BlockJoin Faceting: need help on queries (Facet API facets by wrong doc level VS. BlockJoin Faceting does not return top 10 most frequent)

2016-03-29 Thread Alisa Z .
016, 14:38 -04:00 от Alisa Z. : > >Mikhail, > >I totally see the point: the corresponding wiki page ( >https://cwiki.apache.org/confluence/display/solr/BlockJoin+Faceting ) does not >mention it and says it's an experimental feature. > >Is it correct that no additiona

Re[4]: [nesting] JSON Facet API vs. BlockJoin Faceting: need help on queries (Facet API facets by wrong doc level VS. BlockJoin Faceting does not return top 10 most frequent)

2016-03-29 Thread Alisa Z .
uot;,2, > >> "30",1, > >> "60",1, > >> "78xx",1, > >> "82xx",1, > >> "ab",2, > >> "access",5, > >> "account",1, > >> "accounts",1, > >>... >

Re[2]: [nesting] JSON Facet API vs. BlockJoin Faceting: need help on queries (Facet API facets by wrong doc level VS. BlockJoin Faceting does not return top 10 most frequent)

2016-03-29 Thread Alisa Z .
t_t.facet.limit ... nothing worked :( >Понедельник, 28 марта 2016, 17:20 -04:00 от Alisa Z. : > >Ok, so for the 1st question, I think I'm getting closer:  adding facet: >{top_terms_by_doc: "unique(_root_)"} as indicated in >http://blog.griddynamics.com/sea

Re: [nesting] JSON Facet API vs. BlockJoin Faceting: need help on queries (Facet API facets by wrong doc level VS. BlockJoin Faceting does not return top 10 most frequent)

2016-03-28 Thread Alisa Z .
    "val":"plan",     "count":13,     "top_terms_by_doc":3},   {     "val":"hou",     "count":12,     "top_terms_by_doc":5},   {     "val"

[nesting] JSON Facet API vs. BlockJoin Faceting: need help on queries (Facet API facets by wrong doc level VS. BlockJoin Faceting does not return top 10 most frequent)

2016-03-28 Thread Alisa Z .
Hi all, I am trying to perform faceting of parent docs by nested document fields. I've tried 2 approaches as in subject, yet in first the results are not quite correct and in the 2nd I cannot get the query right. So I need help on either of them and any explication or documentation or blogs o

Re[2]: Solr-5.5.0 doesn't recognize difefrent types of _childDocuments_ any more --degrading since 5.3.1?

2016-03-28 Thread Alisa Z .
nior Solutions Architect >http://www.lucidworks.com > > > >>On Mar 28, 2016, at 2:04 PM, Alisa Z. < prol...@mail.ru > wrote: >>@Yonik, thank you for your response. >> >>I think that the issue is of a different kind because my upload used to work >>well on So

Re[2]: Solr-5.5.0 doesn't recognize difefrent types of _childDocuments_ any more --degrading since 5.3.1?

2016-03-28 Thread Alisa Z .
s.apache.org/jira/browse/SOLR-7042 >It looks like you can try adding >   -format solr >to your bin/post command line to get back to normal "solr JSON" > >-Yonik > > >On Fri, Mar 25, 2016 at 8:43 PM, Yonik Seeley < ysee...@gmail.com > wrote: >> On Fri, Mar

Re: Solr-5.5.0 doesn't recognize difefrent types of _childDocuments_ any more --degrading since 5.3.1?

2016-03-25 Thread Alisa Z .
ocalhost:8985/solr/my_collection/update... Time spent: 0:00:05.137 So now it dumps all the values of relevance_tf into one array disregarding the type of the nested field they actually belonged... It really does not seem to account for proper hierarchy handling with branches of different types.  :

Solr-5.5.0 doesn't recognize difefrent types of _childDocuments_ any more --degrading since 5.3.1?

2016-03-25 Thread Alisa Z .
Hi all, It is partially a question, partially a discussion. I am working with documents with deep levels of nesting. The documents are in a single JSON file (see a sample below). When I was on Solr 5.3.1, solr-5.3.1$ bin/post -c my_collection ../data/data-solr.json caused no problems. Now, I

Re[2]: [nesting] Any way to return the whole hierarchical structure when doing Block Join queries?

2016-03-25 Thread Alisa Z .
t;Четверг, 24 марта 2016, 15:52 -04:00 от Mikhail Khludnev >: > >I think you cal already kick tires and contribute a test case into >https://issues.apache.org/jira/browse/SOLR-8208 that's already reachable >there I believe, but I still working on core design. > >On Thu, Ma

[nesting] Any way to return the whole hierarchical structure when doing Block Join queries?

2016-03-24 Thread Alisa Z .
Hi all, I apologize for duplicating my previous message: Solr 5.3:  anything similar to ChildDocTransformerFactory  that does not flatten the hierarchical structure?    However, it is still an open and interesting question:  Following the example from https://dzone.com/articles/using-solr

Solr 5.3: anything similar to ChildDocTransformerFactory that does not flatten the hierarchical structure?

2016-03-22 Thread Alisa Z .
Hi all, Following the example from https://dzone.com/articles/using-solr-49-new , let's say we are given multiple-level nested structure: 1 I am the parent PARENT 1.1 I am the 1st child CHILD

date range faceting on the whole dataset

2016-03-21 Thread Alisa Z .
Hello, Is it possible to perform date range faceting on the whole dataset without indicating facet.range.start and facet.range.end? What if  I have no clue about when my data starts and when it ends (might be some point in the future)?  A sample query: http://localhost:8983/solr/enron-path/

Re[2]: [nested] how to specify a path for multiple nesting?

2016-03-21 Thread Alisa Z .
Thanks, Mikhail. I eventually added a distinguishing field "path" and queried unambiguously.  >Четверг, 17 марта 2016, 9:46 -04:00 от Mikhail Khludnev >: > >Hello, > >Please find inline > >On Wed, Mar 16, 2016 at 10:10 PM, Alisa Z. < prol...@mail.ru

[nested] how to specify a path for multiple nesting?

2016-03-19 Thread Alisa Z .
Hi all, I have a deeply multi-level data structure (up to 6-7 levels deep) where due to the nature of the data some nested documents can have same type names at various levels. How to form a proper query on a nested field that would contain "a path"  that defines that field? I'll clarify wit