Solr Search Handler Suggestion

2017-01-24 Thread Moenieb Davids
Hi Guys, Just an Idea for easier config of search handlers: Will it be feasible to configure a search handler that has its own schema based on the current core as well as inserting nested objects from cross core queries. Example (for illustration purpose, ignore syntax :) ) htt

Re: Problems with stored/not-stored field filter queries

2017-01-24 Thread Shawn Heisey
On 1/24/2017 6:15 PM, Stanislav Sandalnikov wrote: > Thanks a lot for your valuable input. Of course you were right, the > data was changed after reindex step, I completely forgot that > categories are done by separate application and this application was > pushing empty IndexDate field after updat

Re: Problems with stored/not-stored field filter queries

2017-01-24 Thread Erick Erickson
bq: By the way is there any way to see if there is a index for some particular field of a document? not really conveniently. To know that you have to unwind the inverted index. The "luke" program can do this. Of course if the field is _stored_ it's easy, just return q=id:doc_id&fl=* Also note th

Re: Latest advice on G1 collector?

2017-01-24 Thread Shawn Heisey
On 1/23/2017 1:00 PM, Walter Underwood wrote: > We have a workload with very long queries, and that can drive the CMS > collector into using about 20% of the CPU time. So I’m ready to try G1 on a > couple of replicas and see what happens. I’ve already upgraded to Java 8 > update 121. > > I’ve re

Re: Problems with stored/not-stored field filter queries

2017-01-24 Thread Stanislav Sandalnikov
Thanks Mikhail, didn’t know about debugQuery and explainOther, could be useful. Regarding $q, you can find this information here - https://cwiki.apache.org/confluence/display/solr/Function+Queries#FunctionQueries-AvailableFunctions

Re: Problems with stored/not-stored field filter queries

2017-01-24 Thread Stanislav Sandalnikov
Hi Shawn, Thanks a lot for your valuable input. Of course you were right, the data was changed after reindex step, I completely forgot that categories are done by separate application and this application was pushing empty IndexDate field after update, because it couldn’t extract a value from

Re: Multivalued Fields queries for Occurences.

2017-01-24 Thread Erick Erickson
You might be able to do something with termfreq here: https://cwiki.apache.org/confluence/display/solr/Function+Queries as well as some of the conditionals. You need to be sure that those functions are in your version of Solr of course... Best, Erick On Tue, Jan 24, 2017 at 12:42 PM, slee wrote

Re: Multivalued Fields queries for Occurences.

2017-01-24 Thread slee
Anyone? -- View this message in context: http://lucene.472066.n3.nabble.com/Multivalued-Fields-queries-for-Occurences-tp4315482p4315580.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Single call for distributed IDF?

2017-01-24 Thread Joel Bernstein
Reading your blogs now. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Jan 24, 2017 at 3:28 PM, Joel Bernstein wrote: > Ok my mistake, I was thinking you were writing your own component and > needed a fast way to get global IDF. You're looking for fast global IDF > during the scoring it s

Re: Single call for distributed IDF?

2017-01-24 Thread Joel Bernstein
Ok my mistake, I was thinking you were writing your own component and needed a fast way to get global IDF. You're looking for fast global IDF during the scoring it sounds like. That seems like a reasonable thing to want. In the piggy backing approach you mention does the aggregator node parse the

Re: Feedback on Match Query Parser (for fixing multiterm synonyms and other things)

2017-01-24 Thread Doug Turnbull
Just throwing this back out there as a bit more official. Finally got around to documenting how I use it. You can also download the plugin jar from github http://opensourceconnections.com/blog/2017/01/23/our-solution-to-solr-multiterm-synonyms/ https://github.com/o19s/match-query-parser Enjoy! GH

Re: Single call for distributed IDF?

2017-01-24 Thread Walter Underwood
Specifically, I’m talking about this: http://observer.wunderwood.org/ (my blog) > On Jan 24, 2017, at 10:43 AM, Joel Bernstein wrote: > > Ah, I thought you were just interested in a fast way to get at IDF. This > approach does take a callback but it's really fast. > > Joel Bernstein > ht

Re: Single call for distributed IDF?

2017-01-24 Thread Joel Bernstein
Ah, I thought you were just interested in a fast way to get at IDF. This approach does take a callback but it's really fast. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Jan 24, 2017 at 1:39 PM, Walter Underwood wrote: > I know how to do it. You return df for each term and num_docs then

Re: Single call for distributed IDF?

2017-01-24 Thread Walter Underwood
I know how to do it. You return df for each term and num_docs then recalculate idf. I wrote up how we did it in Ultraseek XPA about ten years ago, though with MonkeyRank instead of global IDF. https://observer.wunderwood.org/2007/04/04/progressive-reranking/

Re: Single call for distributed IDF?

2017-01-24 Thread Joel Bernstein
This may help out: https://github.com/apache/lucene-solr/blob/master/solr/solrj/src/java/org/apache/solr/client/solrj/io/stream/ScoreNodesStream.java#L208 This points to some code that calculates global idf for a list of terms. Not sure if this matches you use case. It seems to be very fast. Joel

RE: Indexing nested documents giving back unrelated parents when asking for children

2017-01-24 Thread Fabien Renaud
But the problem is already there with only two levels. If I change the code to add document to Solr by the following: client1.add(doc1); client1.commit(); client1.add(doc4); client1.commit(); Then things work as expected as I get the follwing result (as

Single call for distributed IDF?

2017-01-24 Thread Walter Underwood
I tried running with the LRUStatsCache for global IDF, but the performance penalty was pretty big. The 95th percentile response time went from 3.4 seconds to 13 seconds. Oops. We should not need a separate call to get the tf and df stats. Those are already calculated when doing the first reques

RE: Indexing nested documents giving back unrelated parents when asking for children

2017-01-24 Thread Mikhail Khludnev
Fabien, Giving this you have three levels can you update the sample code accordingly? I might already replied on such question earlier, iirc filter should enumerate all types beside of the certain one. 24 янв. 2017 г. 16:21 пользователь "Fabien Renaud" < fabien.ren...@findwise.com> написал: I kno

Re: Problems with stored/not-stored field filter queries

2017-01-24 Thread Mikhail Khludnev
Hello Stanislav, Stored fields have nothing which findability, I believe. Usually debugQuery and explainOther is a right way to get what's going on there. What is $q ? How it's supposed to work? 24 янв. 2017 г. 18:29 пользователь "Stanislav Sandalnikov" < s.sandalni...@gmail.com> написал: > Hi ev

Re: Problems with stored/not-stored field filter queries

2017-01-24 Thread Shawn Heisey
On 1/24/2017 8:29 AM, Stanislav Sandalnikov wrote: > With indexed but not stored IndexDate field: > > 1) With this query everything works fine, I’m getting the results back: > /select?fl=taskid,docid,score&q=*:*&fq=category:"Security")))+AND+(datasource:(sites)))&fq={!frange+l%3D0}query($q)&sor

Problems with stored/not-stored field filter queries

2017-01-24 Thread Stanislav Sandalnikov
Hi everyone, I’m facing strange Solr behavior, which could be better described in examples: With indexed but not stored IndexDate field: 1) With this query everything works fine, I’m getting the results back: /select?fl=taskid,docid,score&q=*:*&fq=category:"Security")))+AND+(datasource:(sit

Re: no dataimport-handler defined!

2017-01-24 Thread Alexandre Rafalovitch
Strange. If you run a pre-built DIH example, do any of the cores work? (not the RSS one, that is broken anyway). Regards, Alex. http://www.solr-start.com/ - Resources for Solr users, new and experienced On 24 January 2017 at 08:32, Chris Rogers wrote: > Hi Alex, > > I’m editing the sol

Get Handler Returning Null

2017-01-24 Thread Chris Ulicny
Recently started using the get handler on a solr cloud collection and it seems that it does not return any documents even when I can find those documents by filtering for their unique ids. I explicitly enabled the get handler, reindexed one of the documents, and it seems to work fine for that sing

Re: no dataimport-handler defined!

2017-01-24 Thread Chris Rogers
A quick update. I rolled back to solr 6.2, and the data import handler is recognized there. So there has either been a change in the config required between 6.2 and 6.4, or there’s a bug in 6.4 Any thoughts? On 24/01/2017, 13:32, "Chris Rogers" wrote: Hi Alex, I’m editing the

Re: no dataimport-handler defined!

2017-01-24 Thread Chris Rogers
Hi Alex, I’m editing the solrconfig.xml file at /solr/server/solr/tei_config (ie the one generated from the configset when the node was created). I’m running standalone, not cloud. I’m restarting sole after every change. Do I need to reload the core instead of restarting? I’ve also tried repl

Re: no dataimport-handler defined!

2017-01-24 Thread Alexandre Rafalovitch
Which solrconfig.xml are you editing and what kind of Solr install are you running (cloud?). And did you reload the core. I suspect you are not editing the file that is actually in use. For example, if you are running a cloud setup, the solrconfig.xml on the filesystem is disconnected from the con

RE: Indexing nested documents giving back unrelated parents when asking for children

2017-01-24 Thread Fabien Renaud
I know it works as expected when I set type_s:up as you describe. But I was expecting no children at all in my query. In my real query I have a document with several children and thus can't specify a specific type with childFilter. And I can't give back all children because some of them do not

SQL-like queries (with percent character) - matching an exact substring, with parts of words

2017-01-24 Thread Maciej Ł. PCSS
Dear SOLR users, please point me to the right solution of my problem. I'm using SOLR to implement a Google-like search in my application and this scenario is working fine. However, in specific use-cases I need to filter documents that include a specific substring in a given field. It's about

Re: Indexing nested documents giving back unrelated parents when asking for children

2017-01-24 Thread Mikhail Khludnev
Hello Fabien, I believe parentFilter should be type_s:up, and consequently the type_s:up should go in fq. On Tue, Jan 24, 2017 at 3:30 PM, Fabien Renaud wrote: > Hello, > > I'm wondering if I missed something in my code (which uses solrj 6.3): > > public class Main { > > private SolrClient

no dataimport-handler defined!

2017-01-24 Thread Chris Rogers
Hi all, Having frustrating issues with getting SOLR 6.4.0 to recognize the existence of my DIH config. I’m using Oracle Java8 jdk on Ubuntu 14.04. The DIH .jar file appears to be loading correctly. There are no errors in the SOLR logs. It just says “Sorry, no dataimport-handler defined” in the

Indexing nested documents giving back unrelated parents when asking for children

2017-01-24 Thread Fabien Renaud
Hello, I'm wondering if I missed something in my code (which uses solrj 6.3): public class Main { private SolrClient client1; public void run() { client1 = new HttpSolrClient.Builder("http://localhost:8983/solr";).build(); SolrInputDocument doc1 = new SolrInputDocument

Re: Upgrade SOLR version - facets perfomance regression

2017-01-24 Thread alessandro.benedetti
Hi Solr, I admit the issue you mentioned has not been transparently solved, and indeed you would need to explicitly use the method=uif to get 4.10.1 behavior. This is valid if you were using fc/fcs approaches with high cardinality fields. In the case you facet method is enum ( Term Enumeration),

Re: A tool to quickly browse Solr documents ?

2017-01-24 Thread Charlie Hull
On 24/01/2017 04:36, Fengtan wrote: Hi All, I am looking for a tool to quickly browse/investigate documents indexed in a Solr core. The default web admin interface already offers this, but you need to know the Solr query syntax if you want to list/filter/sort documents. I have started to build

Re: Information on classifier based key word suggestion

2017-01-24 Thread alessandro.benedetti
Hi Shamik, for classification you can take a look to the Lucene module and the Solr integration ( through UpdateRequestProcessor [1] ) . Unfortunately I didn't have the time to work on the request handler version [2], anyway you are free to contribute ! Related the extraction of interesting terms