Re: More references for configuring Solr

2012-11-11 Thread Lance Norskog
LucidFind collects several sources of information in one searchable archive: http://find.searchhub.org/?q=&sort=#%2Fp%3Asolr - Original Message - | From: "Dmitry Kan" | To: solr-user@lucene.apache.org | Sent: Sunday, November 11, 2012 2:24:21 AM | Subject: Re: More references for configu

Re: Integrating Solr with Database

2012-11-11 Thread 122jxgcn
> This might make sense if you were using Solr to search for the > ID of an object in the database with relations to other objects. > However, if all you are doing is retrieving the file path/URL, why > not index that into Solr, and get it directly from Solr? That's what I'm doing right now but si

Re: Integrating Solr with Database

2012-11-11 Thread Gora Mohanty
On 12 November 2012 13:00, 122jxgcn wrote: [...] > What I want to do right now is that whenever the client indexes document and > uploads the raw file, > database gets update with the pairs of (Document ID in Solr, path of the raw > file inside server). > So on search result page, instead of givin

Integrating Solr with Database

2012-11-11 Thread 122jxgcn
Hello, I'm currently working on file management system based on Solr. What I have accomplished now is that I have Solr server and windows client application that runs on different computers. When the client indexes rich document to Solr server remotely, it also uploads the file itself via FTP. So

Re: zkcli issues

2012-11-11 Thread Jeevanandam Madanagopal
Nick - Sorry, embedded links are not shown in previous email. I'm mentioning below. > Handy SolrCloud ZkCLI Commands > (http://www.myjeeva.com/2012/10/solrcloud-cluster-single-collection-deployment/#handy-solrcloud-cli-commands) > Uploading Solr Configuration into ZooKeeper ensemble > (http://

Re: zkcli issues

2012-11-11 Thread Jeevanandam Madanagopal
Nick - I believe you're experiencing a difficulties with SolrCloud CLI commands for interacting ZooKeeper. Please have a look on below links, it will provide you direction. Handy SolrCloud ZkCLI Commands Uploading Solr Configuration into ZooKeeper ensemble Cheers, Jeeva On Nov 12, 2012, at 4:45

Re: customize solr search/scoring for performance

2012-11-11 Thread jchen2000
Yes, we only need term overlap information to choose top candidates (we may incorporate boost factor for different terms later but that's another story). we are quite new to solr so haven't really profiled the process. Is there any rough guess on what could be expected latency from such cases? ou

RE: sort by function error

2012-11-11 Thread Kuai, Ben
more information, problem only happends when I have both sort by function and grouping in query. From: Kuai, Ben [ben.k...@sensis.com.au] Sent: Monday, November 12, 2012 2:12 PM To: solr-user@lucene.apache.org Subject: sort by function error Hi I am tr

how to sort the solr suggester's result

2012-11-11 Thread 徐郑
following is my config , it suggests words well . i want to get a sorted result when it suggest, so i added a transformer , it will add a tab(\t) separated float weight string to the end of the Suggestion field , but the suggestion result still does't sorted correctly. my suggest result( note the

Re: how to sort the solr suggester's result

2012-11-11 Thread eyun
anyone can help to tell me where is my mistake? eyun From: eyun Date: 2012-11-12 11:24 To: solr-user-subscribe Subject: how to sort the solr suggester's result following is my config , it suggests words well . i want to get a sorted result when it suggest, so i added a transformer , it will

Re: 4.0 query question

2012-11-11 Thread Amit Nithian
Why not group by cid using the grouping component, within the group sort by version descending and return 1 result per group. http://wiki.apache.org/solr/FieldCollapsing Cheers Amit On Fri, Nov 9, 2012 at 2:56 PM, dm_tim wrote: > I think I may have found my answer buy I'd like additional vali

Re: Preventing accepting queries while custom QueryComponent starts up?

2012-11-11 Thread Amit Nithian
Jack, I think the issue is that the ping which is used to determine whether or not the server is live returns a seemingly false positive back to the load balancer (and indirectly the client) that this server is ready to go when in fact it's not. Reading this page ( http://wiki.apache.org/solr/Solr

Re: custom request handler

2012-11-11 Thread Amit Nithian
Hi Lee, So the query component would be a subclass of SearchComponent and you can define the list of components executed during a search handler. http://wiki.apache.org/solr/SearchComponent I *think* you can have a custom component do what you want as long as it's the first component in the list

Re: Apache Nutch 1.5.1 + Apache Solr 4.0

2012-11-11 Thread Iwan Hanjoyo
Hi Steiner, I found a video tutorial on Nutch 1.4 + Solr 3.4.0 (on Windows). It do solve my error. Hope it do for yours too. Here is the link: Running Nutch and Solr on Windows Tutorial: Part 1 http://www.youtube.com/watch?v=baxhI6Wkov8 Running Nutch and Solr on Windows Tut

Solr 4.0 - distributed updates without zookeeper?

2012-11-11 Thread Peter Wolanin
Looking at how we could upgrade some of our infrastructure to Solr 4.0 - I would really like to take advantage of distributed updates to get NRT, but we want to keep our fixed master and slave server roles since we use different hardware appropriate to the different roles. Looking at the solr 4.0

Re: zkcli issues

2012-11-11 Thread Mark Miller
On 11/11/2012 04:47 PM, Yonik Seeley wrote: On Sun, Nov 11, 2012 at 10:39 PM, Nick Chase wrote: So I'm trying to use ZkCLI without success. I DID start and stop Solr in non-cloud mode, so everything is extracted and it IS finding zookeeper*.jar. However, now it's NOT finding SolrJ. Not sure a

Re: Internal Vs. External ZooKeeper

2012-11-11 Thread Mark Miller
When SolrCloud is in a steady state (eg the number of nodes in the cluster is not changing and config is not changing), Solr does not really talk to ZooKeeper other than really light stuff like a heartbeat and maintaining a connection. So performance is not likely a large concern here. Mostly

Re: Internal Vs. External ZooKeeper

2012-11-11 Thread Anirudha Jadhav
let me see if i get this correctly, greater the no.of zookeeper nodes , more the time it takes to come to a consensus. During an indexing operation, how many times does a solr client needs to contact zookeeper for consensus ? - per docs ? per commit ? ? thanks, Ani On Sun, Nov 11, 2012 at 11:1

Re: Apache Nutch 1.5.1 + Apache Solr 4.0

2012-11-11 Thread Dave Meikle
Hi, On 8 Nov 2012, at 15:00, Markus Jelsma wrote: > Hm, i copied the schema from Nutch' trunk verbatim and only had to change the > stemmer. It seems like you have, for some reason, a float with an extra > point dangling around somewhere. Can you check? Just building a Nutch 1.5.1 environmen

Re: zkcli issues

2012-11-11 Thread Yonik Seeley
On Sun, Nov 11, 2012 at 10:39 PM, Nick Chase wrote: > So I'm trying to use ZkCLI without success. I DID start and stop Solr in > non-cloud mode, so everything is extracted and it IS finding zookeeper*.jar. > However, now it's NOT finding SolrJ. Not sure about your specific problem in this case,

zkcli issues

2012-11-11 Thread Nick Chase
OK, so this is my ZooKeeper week, sorry. :) So I'm trying to use ZkCLI without success. I DID start and stop Solr in non-cloud mode, so everything is extracted and it IS finding zookeeper*.jar. However, now it's NOT finding SolrJ. I even tried to run it from the provided script (in cloud-sc

Re: Internal Vs. External ZooKeeper

2012-11-11 Thread Nick Chase
Thanks, Jack, this is a great explanation! And since a greater number of ZK nodes tends to degrade write performance, that would be a factor in making every Solr node a ZK node as well. Much obliged! Nick On 11/11/2012 10:45 AM, Jack Krupansky wrote: "Production" typically implies "hi

Re: Preventing accepting queries while custom QueryComponent starts up?

2012-11-11 Thread Jack Krupansky
Is the issue here that the Solr node is continuously "live" with the load balancer so that the moment during startup that Solr can respond to anything, the load balancer will be sending it traffic and that this can occur while Solr is still warming up? First, shouldn't we be encouraging people

Re: Internal Vs. External ZooKeeper

2012-11-11 Thread Jack Krupansky
"Production" typically implies "high availability" and in a distributed system the goal is that the overall cluster integrity and performance should not be compromised just because a few "worker" nodes go down. Solr nodes do a lot of complex operations and are quite prone to running into "issues

Internal Vs. External ZooKeeper

2012-11-11 Thread Nick Chase
OK, I can't find a definitive answer on this. The wiki says not to use the embedded ZooKeeper servers for production. But my question is: why not? Basically, what are the reasons and circumstances that make you better off using an external ZooKeeper ensemble? Thanks... Nick

Re: custom request handler

2012-11-11 Thread Lee Carroll
Only slaves are public facing and they are read only, with limited query request handlers defined. The above approach is to prevent abusive / in appropriate queries by clients. A query component sounds interesting would this be implemented through an interface so could be separate from solr or woul

Re: How to insert documents into differenet indexes

2012-11-11 Thread Gora Mohanty
On 11 November 2012 15:06, tomw wrote: [...] > Thanks, that seems to work. Life can be so simple. Unfortunately this > case isn't mentioned in any of the sections covering updates in the > wiki. While this could be made clearer, it should not be very difficult to guess at the update URL for a spe

Re: More references for configuring Solr

2012-11-11 Thread Dmitry Kan
Hi, here are some resources: http://wiki.apache.org/solr/ (Solr wiki) http://lucene.apache.org/solr/books.html (books published on Solr) the goes googling on a specific topic. But before reading a book might not be a bad idea.. -- Dmitry On Sat, Nov 10, 2012 at 1:15 PM, FARAHZADI, EMAD wrote:

Re: How to insert documents into differenet indexes

2012-11-11 Thread tomw
> Just use the update handler that is specific to a given core. For > example if you have two cores named core1 and core2, you should use > the following addresses (if you didn't change the default > configuration): > > /solr/core1/update/ > > and > > /solr/core2/update/ > Thanks, that seems t

Re: How to insert documents into differenet indexes

2012-11-11 Thread Rafał Kuć
Hello! Just use the update handler that is specific to a given core. For example if you have two cores named core1 and core2, you should use the following addresses (if you didn't change the default configuration): /solr/core1/update/ and /solr/core2/update/ -- Regards, Rafał Kuć Sematext :

How to insert documents into differenet indexes

2012-11-11 Thread tomw
Hi, I 've set up a Solr instance with multiple cores to be able to use different indexes for different applications. The point I'm struggling with is how do I insert documents into the index running on a specific core? Any clue appreciated. best -- tomw