Re: How to combine third party search data as top results ?

2017-02-01 Thread Charlie Hull
On 31/01/2017 19:04, Shamik Bandopadhyay wrote: Hi, I'm trying to integrate results from a third party source with our existing search. The idea is to include the top 5 results from this source as the top result of our search.Though the external data is indexed in our system, the use case dict

Re: Upgrade SOLR version - facets perfomance regression

2017-02-01 Thread SOLR4189
I noticed if I don't write list of components in request handler it works fine, but if I add something like query facet Facets don't work... How can you explian it? -- View this message in context: http://lucene.472066.n3.nabble.com/Upgrade-SOLR-version-facets-perfomance-regression-tp431

Re: Upgrade SOLR version - facets perfomance regression

2017-02-01 Thread alessandro.benedetti
The reason must to be found in the default list of components : protected List getDefaultComponents() { ArrayList names = new ArrayList<>(8); names.add( QueryComponent.COMPONENT_NAME ); names.add( FacetComponent.COMPONENT_NAME ); *names.add( FacetModule.COMPONENT_NAME );* nam

Re: [Benchmark SOLR] JETTY VS TOMCAT - Jetty 15% slower - need advice to improve Jetty performance

2017-02-01 Thread Rick Leir
There is a profiling tool in Eclipse that can show you a tree of method calls, with timing information. I have found this useful in the past to investigate a performance problem. But it might not help if the problem only occurs at 165 queries per second (is that true?). cheers -- Rick On 201

1:n relation and function queries

2017-02-01 Thread Ewald Moitzi
Hello, I am unsure if solr is the right solution for a problem that we have, of if it is better to stick with a relational database (and if it should be done in solr how to implement it). The explanation is a bit lengthy, but please bear with me. The problem: Sort results of a vendor search for a

Re: [Benchmark SOLR] JETTY VS TOMCAT - Jetty 15% slower - need advice to improve Jetty performance

2017-02-01 Thread Gerald Reinhart
We have done some profiling with Visualvm, but nothing obvious appeared. Thank Rick for the advice. Gérald Reinhart On 02/01/2017 11:17 AM, Rick Leir wrote: There is a profiling tool in Eclipse that can show you a tree of method calls, with timing information. I have found this useful in th

Re: 1:n relation and function queries

2017-02-01 Thread Mikhail Khludnev
Ewald, Functional queries combines well with block join as well as query time join, here are examples for latter one http://blog-archive.griddynamics.com/2015/08/scoring-join-party-in-solr-53.html It must be the same for block join. What doesn't work exactly? On Wed, Feb 1, 2017 at 1:39 PM, Ewald

Re: Upgrade SOLR version - facets perfomance regression

2017-02-01 Thread SOLR4189
Alessandro, it helped! Thank you. But I asked which changes do we do in configuration and I think these things must be documented in the reference guide. About your question, first of all I don't override default componets. Second of all, I add my own components and for many reasons (For example, I

Re: Upgrade SOLR version - facets perfomance regression

2017-02-01 Thread SOLR4189
And still I have a question: Is there some convertor from the legacy api to the new API? Or a search component that converts from legacy api to json facet api? I explained why I need it in my first post. Thank you -- View this message in context: http://lucene.472066.n3.nabble.com/Upgrade-SOL

Re: Query structure

2017-02-01 Thread KRIS MUSSHORN
I really need some guidance on this query structure issue. I've got to get this solved today for my employer. "Help me Obiwan. Your my only hope" K - Original Message - From: "KRIS MUSSHORN" To: solr-user@lucene.apache.org Sent: Tuesday, January 31, 2017 12:31:13 PM Subject: Qu

Re: Query structure

2017-02-01 Thread Maciej Ł. PCSS
Why not use filtering query? I mean the 'fq' param. Regards Maciej W dniu 01.02.2017 o 13:43, KRIS MUSSHORN pisze: I really need some guidance on this query structure issue. I've got to get this solved today for my employer. "Help me Obiwan. Your my only hope" K - Original Message

Re: Query structure

2017-02-01 Thread Maciej Ł. PCSS
You should be able to put 'facetMetatagDatePrefix4:2015 OR facetMetatagDatePrefix4:2016' into the filtering query. Maciej W dniu 01.02.2017 o 13:43, KRIS MUSSHORN pisze: I really need some guidance on this query structure issue. I've got to get this solved today for my employer. "Help me Ob

Re: Upgrade SOLR version - facets perfomance regression

2017-02-01 Thread alessandro.benedetti
What I meant is that : "Components If you define components, the default components (see above) will not be executed, and first-components and last-components are disallowed:" Anyway it is documented in the Confluence page. If you don't override the default components, json facets will be there.

Fw: solr-user-unsubscribe

2017-02-01 Thread Syed Mudasseer
Can someone help me with unsubscription of solr emails? I tried sending "unsubscribe" emails to "solr-user@lucene.apache.org" but no luck. Thanks, Mudasseer From: Syed Mudasseer Sent: Monday, January 30, 2017 12:55 PM To: solr-user@lucene.apache.org Subject:

Re: Fw: solr-user-unsubscribe

2017-02-01 Thread alessandro.benedetti
Gents, have you read the instructions ? Have you sent an email to : solr-user-unsubscr...@lucene.apache.org ? You don't need to send messages to the mailing list with that address as content. Just follow what's in the official Solr documentation page : http://lucene.apache.org/solr/community.html

Re: Query structure

2017-02-01 Thread KRIS MUSSHORN
This was the solution. Thank you! - Original Message - From: "Maciej Ł. PCSS" To: solr-user@lucene.apache.org Sent: Wednesday, February 1, 2017 7:57:05 AM Subject: Re: Query structure You should be able to put 'facetMetatagDatePrefix4:2015 OR facetMetatagDatePrefix4:2016' into th

Collection will not replicate

2017-02-01 Thread tedsolr
I have a collection (1 shard, 2 replicas) that was doing a batch update when one solr host ran out of disk space. The batch job failed at that point, and one replica got corrupted. I deleted the bad replica. I've tried several times since then to add a new replica. The status of the request is "run

Re: 1:n relation and function queries

2017-02-01 Thread Ewald Moitzi
Hello Mikhail, I was using the functions as fl and sort parameters, and this gives no result. When sorting by score, as you did in your blog, I get the values from the child documents. (I missed the score=max parameter) However, i need to combine values from the parent and child, like this: {!fu

Re: Collection will not replicate

2017-02-01 Thread tedsolr
Update! I did find an error: 2017-02-01 09:23:22.673 ERROR org.apache.solr.common.SolrException :org.apache.solr.common.SolrException: Error getting leader from zk for shard shard1 Caused by: org.apache.solr.common.SolrException: Could not get leader props at org.apache.solr.cloud.Zk

Need help in Tika on SolrCloud

2017-02-01 Thread Anatharaman, Srinatha (Contractor)
Hi, I am new to Solr, I have implemented Solr on single node and my code is working good When I move the same code to SolrCloud it fails (I made few changes for SolrCloud) I am trying to load data using Dataimporthandler but it throws error as below 2017-02-01 03:23:07.727 ERROR (Thread-18) [c

Re: 1:n relation and function queries

2017-02-01 Thread Mikhail Khludnev
Why you can't get score from child level and combining it on a parent level? On Wed, Feb 1, 2017 at 5:33 PM, Ewald Moitzi wrote: > Hello Mikhail, > > I was using the functions as fl and sort parameters, and this > gives no result. > > When sorting by score, as you did in your blog, I get the val

Upserting doc fields from a SearchComponent

2017-02-01 Thread Ugo Matrangolo
Hi, I'm trying to write a SearchComponent that personalizes on the fly a field on all the docs resulting from the initial search query. What I would like to do is to write a SearchComponent that intercepts the documents belonging to the result sets of a search query and upsert one or more of thei

Re: Upserting doc fields from a SearchComponent

2017-02-01 Thread Erick Erickson
If the data is from another Solr instance, consider the [subquery] Document Transformer here: https://cwiki.apache.org/confluence/display/solr/Transforming+Result+Documents#TransformingResultDocuments-[subquery] More broadly, you can write a custom DocTransformer plugin do insert anything you want

Re: Need help in Tika on SolrCloud

2017-02-01 Thread Erick Erickson
Not quite sure. Are all the directories you expect the Tika docs to be in available to the Solr node? Sidestepping your question, I would be very reluctant to use Tika in SolrCloud mode because you're then putting all of the heavy-duty processing on nodes that also serve queries. I have the same r

Re: Collection will not replicate

2017-02-01 Thread Erick Erickson
What version of Solr? since 5.4 there's been a FORCELEADER colelctions API call that might help. I'd run it with the newly added replicas offline. you only want it to have good replicas to choose from. Best, Erick On Wed, Feb 1, 2017 at 6:48 AM, tedsolr wrote: > Update! I did find an error: > >

Re: Upserting doc fields from a SearchComponent

2017-02-01 Thread Ugo Matrangolo
Hi, tx for the speedy response. What if I need to filter on the result matches ?? Example: I have a price I need to personalize per user/request and need then to apply filter ranges on this personalized price (show only stuff in the 10$ - 99$ range). WDYT ? Best Ugo On Wed, Feb 1, 2017 at 3:3

Re: Upserting doc fields from a SearchComponent

2017-02-01 Thread Erick Erickson
You need to be clear about what to do when. The [subquery], is completely separate from _finding_ the top N docs. Your last question is about finding the top N. There are really two choices that spring to mind, depending on where you keep your data about the user. Solr can't magically know that us

RE: Collection will not replicate

2017-02-01 Thread Anatharaman, Srinatha (Contractor)
Erick, Thank you for your quick response, I appreciate your help I am new to Solr and not from Java background I have developed the code in Dev on single core and it works perfectly good On QA box I have installed Solr 6.3 on 2 nodes(SolrCloud) Made few changes to the code to fit SolrCloud I hav

Re: Collection will not replicate

2017-02-01 Thread tedsolr
I have version 5.2.1. Short of an upgrade, are there any remedies? Erick Erickson wrote > What version of Solr? since 5.4 there's been a FORCELEADER colelctions > API call that might help. > > I'd run it with the newly added replicas offline. you only want it to > have good replicas to choose fr

Re: Upserting doc fields from a SearchComponent

2017-02-01 Thread Ugo Matrangolo
Hi Erick, Personalizing a 'price' involves using an external service (luckily we could cache most of the interactions) and it is accessed using a lib that gets dropped in the Solr classpath. What I need to do is this kind of flow: 1. Query (.. personalization params ...) 2. Find the initial sear

Phrase Queries and Punctuation

2017-02-01 Thread alessandro.benedetti
Hi all, I was just thinking about Phrase Queries and punctuation ( and in general how to manage increment positions when such a sentence delimiter happens). At the moment for multi valued fields we have the "increment position gap" which allow to avoid phrase queries to span different values for t

Re: Streaming Expressions result-set fields not in order

2017-02-01 Thread Zheng Lin Edwin Yeo
Hi Joel, Thanks for your reply. I've created the jira about this, with the issue number SOLR-10085 Regards, Edwin On 28 January 2017 at 10:26, Joel Bernstein wrote: > The issue is that fields are held in HashMaps internally so field order is

Solr 6.0.0 Returns Blank Highlights for alpha-numeric combos

2017-02-01 Thread Teague James
Hello everyone! I'm still stuck on this issue and could really use some help. I have a Solr 6.0.0 instance that is storing documents peppered with text like "1a", "2e", "4c", etc. If I search the documents for a word, "ms", "in", "the", etc., I get the correct number of hits and the results are hig

Re: Upserting doc fields from a SearchComponent

2017-02-01 Thread Charlie Hull
On 01/02/2017 15:55, Ugo Matrangolo wrote: Hi Erick, Personalizing a 'price' involves using an external service (luckily we could cache most of the interactions) and it is accessed using a lib that gets dropped in the Solr classpath. Hi Ugo, We built a Solr plugin 'XJoin', allowing you to use

RE: Return specific field from child documents.

2017-02-01 Thread Mikhail Khludnev
ok. let's add emp.logParamsList=q,fl,rows,*row.id * *And check logs for this request?* 31 янв. 2017 г. 14:21 пользователь "Preeti Bhat" написал: Same result. Thanks, Preeti -Original Message- From: Mikhail Khludnev [mailto:m...@apache.org] Sent: Tuesday, January 31, 201

Re: DIH - Parent-Child-Problems - GrapQuery-Or-BlockJoin - Order with Orderlines

2017-02-01 Thread Mikhail Khludnev
Have you checked https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-BlockJoinQueryParsers ? 01 февр. 2017 г. 10:42 пользователь "Kent Iversen" написал: > I'm a newbie to Solr and can't seem to get this to work, properly. Gonna > use Order with Orderlines as an example. >

RE: Need help in Tika on SolrCloud

2017-02-01 Thread Anatharaman, Srinatha (Contractor)
Hi All, I see below code which is causing my code not to work in SolrCloud @Override public String getConfigDir() { throw new ZooKeeperException( ErrorCode.SERVER_ERROR, "ZkSolrResourceLoader does not support getConfigDir() - likely, what you are trying to do is no

Re: project related configsets need to be deployed in both data and solr install folders ?

2017-02-01 Thread Chris Hostetter
Based on your description of the problem, and the fact that explicitly setting configSetBaseDir in your solr.xml works, I suspect there is some sort of weird bug in how the "default" configSetBaseDir is determined in some diff code paths in Solr. We should definitely file a jira issue tracking

Re: Arabic words search in solr

2017-02-01 Thread mohanmca01
Dear Steve,Thanks for investigating our problem. Our project is basically business directory search platform, and we have more than 100+ K business details information. I’m providing you some examples of Arabic words to reproduce the problem. please find attached word file where i explained everyth

Query.extractTerms dissapeared from 5.1.0 to 5.2.0

2017-02-01 Thread Max Bridgewater
Hi, It seems Query.extractTerms() disapeared from 5.1.0 ( http://lucene.apache.org/core/5_1_0/core/org/apache/lucene/search/Query.html) to 5.2.0 ( http://lucene.apache.org/core/5_2_0/core/org/apache/lucene/search/Query.html ). However, I cannot find any comment on it in 5.2.0 release notes. Any r

Solr 6.3.0 - recovery failed

2017-02-01 Thread Joe Obernberger
Hi All - I had one node in a 45 shard cluster (9 physical machines) run out of memory. I stopped all the nodes in the cluster and removed any lingering write.lock files from the OOM in HDFS. All the nodes recovered except one replica of one shard that happens to be on the node that ran out of

Do long auto commit times interfere with delete?

2017-02-01 Thread Hans Zhou
We have a solr cloud with a pretty long auto commit time (30 seconds for a hard commit, 180 seconds for a soft commit). We’re also doing lots of delete-then-insert operations. i.e. Each document has a product.id, and to update a product, we do an update request with { "delete": { "query": "prod

Re: Solr Kafka DIH

2017-02-01 Thread Susheel Kumar
Hello Joel, This definitely seems like a good feature to add. In fact I was also looking to push data into Solr from Kafka and this can be good feature to have. I have created JIRA https://issues.apache.org/jira/browse/SOLR-10086 and can contribute as well. Thanks, Susheel On Tue, Jan 31, 2017

Re: Solr 6.3.0 - recovery failed

2017-02-01 Thread alessandro.benedetti
Let me try to summarize . How many virtual machines on top of the 9 physical ? How many Solr processes ( replicas ? ) If you had 1 node compromised. I assume you have replicas as well right ? Can you explain a little bit better your replicas configuration ? Why you had to stop all the nodes ? I

Re: Solr 6.3.0 - recovery failed

2017-02-01 Thread Joe Obernberger
Thank you for the response. There are no virtual machines in the configuration. The collection has 45 shards with 3 replicas each spread across the 9 physical boxes; each box is running one copy of solr. I've tried to restart just the one node after the other 8 (and all their shards/replicas)

Re: Solr 6.3.0 - recovery failed

2017-02-01 Thread Alessandro Benedetti
Ok, it is clearer now. You have 9 solr nodes running, one per physical machine. So each node has a number cores ( both replicas and leaders). When the node died, you got a lot of indexes corrupted. I still miss why you restarted the others 8 working nodes ( I was expecting you to restart only th

Re: Solr 6.3.0 - recovery failed

2017-02-01 Thread Joe Obernberger
In HDFS when a node fails it will leave behind write.lock files in HDFS. These files have to be manually removed; otherwise the shards/replicas that have write.lock files left behind will not start. Since I can't tell which physical node is hosting which shard/replica, I stop all the nodes, d

Re: Solr 6.3.0 - recovery failed

2017-02-01 Thread Alessandro Benedetti
I can't debug the code now, but if you access the logs, directly ( not from the ui), is there any " caused by" associated to the recovery failure exception? Cheers On 1 Feb 2017 6:28 p.m., "Joe Obernberger" wrote: > In HDFS when a node fails it will leave behind write.lock files in HDFS. >

Re: Solr 6.3.0 - recovery failed

2017-02-01 Thread Joe Obernberger
Thank you. I do not see any caused block in the solr.log. --- 2017-02-01 18:37:57.566 INFO (recoveryExecutor-3-thread-8-processing-n:bilbo:9100_solr x:Worldline2New_shard22_replica2 s:shard22 c:Worldline2New r:core_node34) [c:Worldline2New s:shard22 r:core_node34 x:Worldline2New_sha

Re: Query.extractTerms dissapeared from 5.1.0 to 5.2.0

2017-02-01 Thread Alan Woodward
Hi, extractTerms() is now on Weight rather than on Query. Alan > On 1 Feb 2017, at 17:43, Max Bridgewater wrote: > > Hi, > > It seems Query.extractTerms() disapeared from 5.1.0 ( > http://lucene.apache.org/core/5_1_0/core/org/apache/lucene/search/Query.html) > to 5.2.0 ( > http://lucene.apache

Re: Solr 6.3.0 - recovery failed

2017-02-01 Thread Joe Obernberger
I brought down the whole cluster again, and brought up one server at a time, waiting for it to go green before launching another. Now all replicas are OK, including the one that was in the perma-recovery mode before. I do notice a large amount of network activity (basically pegging the interfa

Re: Query.extractTerms dissapeared from 5.1.0 to 5.2.0

2017-02-01 Thread Max Bridgewater
Perfect. Thanks a lot. On Wed, Feb 1, 2017 at 2:01 PM, Alan Woodward wrote: > Hi, extractTerms() is now on Weight rather than on Query. > > Alan > > > On 1 Feb 2017, at 17:43, Max Bridgewater > wrote: > > > > Hi, > > > > It seems Query.extractTerms() disapeared from 5.1.0 ( > > http://lucene.ap

Re: Do long auto commit times interfere with delete?

2017-02-01 Thread Erick Erickson
This should work fine. There is no requirement that a commit happen between updates and deletes for the same document. That said, this can be tricky so if you have a demonstrable case where this isn't so let us know. Best, Erick On Wed, Feb 1, 2017 at 9:53 AM, Hans Zhou wrote: > We have a solr

Re: Solr 6.0.0 Returns Blank Highlights for alpha-numeric combos

2017-02-01 Thread Erick Erickson
How far into the text field are these tokens? The highlighter defaults to the first 10K characters under control of hl.maxAnalyzedChars. It's vaguely possible that the values happen to be farther along in the text than that. Not likely, mind you but possible. Best, Erick On Wed, Feb 1, 2017 at 8:

Re: How to combine third party search data as top results ?

2017-02-01 Thread shamik
Charlie, thanks for sharing the information. I'm going to take a look and get back to you. -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-combine-third-party-search-data-as-top-results-tp4318116p4318349.html Sent from the Solr - User mailing list archive at Nabble.co

RE: Solr 6.0.0 Returns Blank Highlights for alpha-numeric combos

2017-02-01 Thread Teague James
Hi Erick! Thanks for the reply. The goal is to get two character terms like 1a, 1b, 2a, 2b, 3a, etc. to get highlighted in the documents. Additional testing shows that any alpha-numeric combo returns a blank highlight, regardless of length. Thus, "pr0blem" will not highlight because of the zero

RE: Need help in Tika on SolrCloud

2017-02-01 Thread Anatharaman, Srinatha (Contractor)
Is there anyone to help me with my issue? Your help is much appreciated I figured out the problem but need solution In my below data-config file tikaConfig.xml is not recognized by zookeeper ( processor="TikaEntityProcessor" tikaConfig="tikaConfig.xml" )

Solr querying nested documents with ChildDocTransformerFactory, get “Parent query yields document which is not matched by parents filter”

2017-02-01 Thread David Kramer
Some background: · The data involved is catalog data, with three nested objects: Products, Items, and Skus, in that order. We have a docType field on each record as a differentiator. · The "id" field in our data is unique within datatype, but not across datatypes. We added a "u

Re: Solr 6.0.0 Returns Blank Highlights for alpha-numeric combos

2017-02-01 Thread Erick Erickson
The termvectors and offsets aren't necessary, they can be beneficial for speed reasons so I'd defer them. I ran a quick test on 6.0 with your definitions and it works just fine. I did have to comment out your custom stopwords filter on the indexing but unless you're substituting for pairs like you

Re: How to combine third party search data as top results ?

2017-02-01 Thread Doug Turnbull
I was going to say what Charlie said! I would trust Flax's work in this area :) -Doug On Wed, Feb 1, 2017 at 3:10 PM shamik wrote: > Charlie, thanks for sharing the information. I'm going to take a look and > get > back to you. > > > > -- > View this message in context: > http://lucene.472066.n

Re: Collection will not replicate

2017-02-01 Thread Jeff Wartes
Sounds similar to a thread last year: http://lucene.472066.n3.nabble.com/Node-not-recovering-leader-elections-not-occuring-tp4287819p4287866.html On 2/1/17, 7:49 AM, "tedsolr" wrote: I have version 5.2.1. Short of an upgrade, are there any remedies? Erick Erickson wrote >

Re: project related configsets need to be deployed in both data and solr install folders ?

2017-02-01 Thread Renee Sun
Hi Chris, since I have been playing with this install, and I am not certain if I have unknowingly messed some other settings. I want to avoid put in a false Jira wasting your time. I wiped out everything on my solr box and did a fresh install of solr 6.4.0, made sure my config file set are place

Re: project related configsets need to be deployed in both data and solr install folders ?

2017-02-01 Thread Chris Hostetter
Renee: Huh ... so sounds like something must have been wonky in your original install? Glad it worked out for you, and thanks for following up. : Date: Wed, 1 Feb 2017 15:09:54 -0700 (MST) : From: Renee Sun : Reply-To: solr-user@lucene.apache.org : To: solr-user@lucene.apache.org : Subject: R

Re: project related configsets need to be deployed in both data and solr install folders ?

2017-02-01 Thread Renee Sun
thanks for your time! -- View this message in context: http://lucene.472066.n3.nabble.com/project-related-configsets-need-to-be-deployed-in-both-data-and-solr-install-folders-tp4317897p4318382.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Collection will not replicate

2017-02-01 Thread Erick Erickson
It's worth a try to take down your entire cluster. Bring one machine back up at a time. There _may_ be something like a 3 minute wait before each of the replicas on that machine come up, the leader election process has a 180 second delay before the replicas on that node take over leadership to wait

Re: How to combine third party search data as top results ?

2017-02-01 Thread Joel Bernstein
This type of ranking behavior is what the RankQuery is designed to do. A RankQuery allows you to inject your own TopDocs collector into the query and take full control of the ranking. It's more complex to implement though. Here is an example RankQuery implementation: https://github.com/apache/luce

Re: How to combine third party search data as top results ?

2017-02-01 Thread Joel Bernstein
Also this presentation discusses the RankQuery (Starting on slide 16) http://www.slideshare.net/lucidworks/managed-search-presented-by-jacob-graves-getty-images Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Feb 1, 2017 at 9:58 PM, Joel Bernstein wrote: > This type of ranking behavior is

How long for autoAddReplica?

2017-02-01 Thread Walter Underwood
I added a new node an shut down a node with a shard replica on it. It has been an hour and I don’t see any activity toward making a new replica. The new node and the one I shut down are both 6.4. The rest of the 16-node cluster is 6.2.1. wunder Walter Underwood wun...@wunderwood.org http://obse

6.4.0 collection leader election and recovery issues

2017-02-01 Thread Ravi Solr
Hello, Yesterday I upgraded from 6.0.1 to 6.4.0, its been straight 12 hours of debugging spree!! Can somebody kindly help me out of this misery. I have a set has 8 single shard collections with 3 replicas. As soon as I updated the configs and started the servers one of my collection got