MLT in SolrJ vs. URL?

2013-05-21 Thread Darren Govoni
Hi, I compose a mlt query in a URL and get the queried result back and a list of documents in the "moreLikeThis" section in my browser. When I try to execute the same query in SolrJ setting the same params, I only get the queried result document back and no MLT docs. What's the trick here? th

Re: Geo spatial search with multi-valued locations (SOLR-2155 / lucene-spatial-playground)

2011-08-29 Thread Darren Govoni
It doesn't. On 08/29/2011 01:37 PM, Mike Austin wrote: I've been trying to follow the progress of this and I'm not sure what the current status is. Can someone update me on what is currently in Solr4 and does it support multi-valued location in a single document? I saw that SOLR-2155 was not i

Re: basic solr cloud questions

2011-09-27 Thread Darren Govoni
On 09/27/2011 05:05 PM, Yury Kats wrote: You need to either submit the docs to both nodes, or have a replication setup between the two. Otherwise they are not in sync. I hope that's not the case. :/ My understanding (or hope maybe) is that the new Solr Cloud implementation will support auto-shar

Re: basic solr cloud questions

2011-09-29 Thread Darren Govoni
tand the pieces involved for replication rather than sharding ... perhaps this link would be of help in taking you through it: http://pulkitsinghal.blogspot.com/2011/09/setup-solr-master-slave-replication.html - Pulkit 2011/9/27 Yury Kats: On 9/27/2011 5:16 PM, Darren Govoni wrote: On 09/

Re: basic solr cloud questions

2011-09-29 Thread Darren Govoni
Agree. Thanks also for clarifying. It helps. On 09/29/2011 08:50 AM, Yury Kats wrote: On 9/29/2011 7:22 AM, Darren Govoni wrote: That was kinda my point. The "new" cloud implementation is not about replication, nor should it be. But rather about horizontal scalability where &quo

Sorting on multivalued fields?

2011-10-07 Thread Darren Govoni
Hi, I could be wrong here but my app worked fine with sorting on multivalued date fields before 3.4. But now it is giving an error. *HTTP Status 400 - can not sort on multivalued field: creation_date_dm *Is this a regression/bug? thanks, Darren

Re: Sorting on multivalued fields?

2011-10-07 Thread Darren Govoni
Thanks Markus. I should have caught that, but I skipped a couple releases. ;) On 10/07/2011 06:31 PM, Markus Jelsma wrote: Here's the snippet from the changelog of 3.1.0. 571 * In previous releases, sorting or evaluating function queries on 572 fields that were "multiValued" (either by

EmbeddedSolr + Http?

2011-10-08 Thread Darren Govoni
Hi, Can and EmbeddedSolrServer in one app and an HttpSolrServer (deployed as war) in another share the same solr home, data, config etc? I want to use the Http server for indexing against and the embedded one for searching. Is there a reason why this is bad? thanks, Darren

Re: More new solr cloud questions

2011-10-13 Thread Darren Govoni
Great! Thank you. I'm eager to test it on EC2 whenever its near beta ready. On 10/13/2011 11:51 AM, Ted Dunning wrote: On Thu, Oct 13, 2011 at 1:37 PM, wrote: Hi, I have some questions about the 4.0 solr cloud implementation. 1. I want to have a large cloud of machines on a network. each m

Re: Merging Remote Solr Indexes?

2011-10-19 Thread Darren Govoni
Hi Otis, Yeah, I saw page, but it says for merging cores, which I presume must reside locally to the solr instance doing the merging? What I'm interested in doing is merging across solr instances running on different machines into a single solr running on another machine (programmatically). I

Re: Merging Remote Solr Indexes?

2011-10-19 Thread Darren Govoni
uster? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ From: Darren Govoni To: solr-user@lucene.apache.org Sent: Wednesday, October 19, 2011 5:15 PM Subject: Re: Merging Remote Solr Indexes

Re: Merging Remote Solr Indexes?

2011-10-20 Thread Darren Govoni
Interesting Yury. Thanks. On 10/20/2011 11:00 AM, Yury Kats wrote: On 10/19/2011 5:15 PM, Darren Govoni wrote: Hi Otis, Yeah, I saw page, but it says for merging cores, which I presume must reside locally to the solr instance doing the merging? What I'm interested in doing is me

Re: inconsistent results when faceting on multivalued field

2011-10-21 Thread Darren Govoni
My interpretation of your results are that your FQ found 1281 documents with 1213206 value in sou_codeMetier field. Of those results, 476 also had 1212104 as a value...and so on. Since ALL the results will have the field value in your FQ, then I would expect the "other" values to be equal or less

Query time help

2011-10-30 Thread Darren Govoni
Hi, I am running Solr 3.4 in a glassfish domain for itself. I have about 12,500 documents with a 100 or so fields with the works (stored, termv's, etc). In my webtier code, I use SolrJ and execute a query as such: long querystart = new Date().getTime(); System.out.pr

Re: Solr 3.5 very slow (performance)

2011-11-29 Thread Darren Govoni
Any suspicous activity in the logs? what about disk activity? On 11/29/2011 05:22 PM, Pawel Rog wrote: On Tue, Nov 29, 2011 at 9:13 PM, Chris Hostetter wrote: Let's back up a minute and cover some basics... 1) You said that you built a brand new index on a brand new master server, using Solr

Re: Solr 3.5 very slow (performance)

2011-11-30 Thread Darren Govoni
Monitoring this thread make me ask the question of whether there are standardized performance benchmarks for Solr. Such that they are run and published with each new release. This would affirm its performance under known circumstances, with which people can try in their own environments and compa

Re: Grouping or Facet ?

2011-12-07 Thread Darren Govoni
Yes. That's what I would expect. I guess I didn't understand when you said "The facet counts are the counts of the *values* in that field" Because it seems its the count of the number of matching documents irrespective if one document has 20 values for that field and another 10, the facet coun

Re: Poor performance on distributed search

2011-12-19 Thread Darren Govoni
I see what you are asking. This is an interesting question. It seems inefficient for Solr to apply the requested rows to all shards only to discard most of the results on merge. That would consume lots of resources not used in the final result set. On 12/19/2011 04:32 PM, ku3ia wrote: Uhm, eith

Highlighting in 3.5?

2012-01-02 Thread Darren Govoni
Hi, Can someone tell me if this is correct behavior from Solr. I search on a dynamic field: field_t:[* TO *] I set highlight fields to "field_t,text_t" but I am not searching specifically inside text_t field. The highlights for text_t come back with EVERY WORD. Maybe because of the [* TO

Re: Highlighting in 3.5?

2012-01-02 Thread Darren Govoni
:54 AM, Darren Govoni wrote: Hi, Can someone tell me if this is correct behavior from Solr. I search on a dynamic field: field_t:[* TO *] I set highlight fields to "field_t,text_t" but I am not searching specifically inside text_t field. The highlights for text_t come back with

Re: Highlighting in 3.5?

2012-01-02 Thread Darren Govoni
r it. If there are no query term matches for the df, then getting ALL the field terms highlighted (as it does now) is rather perplexing feature. Darren On 01/02/2012 06:28 PM, Darren Govoni wrote: Hi Juan, Setting that parameter produces the same extraneous results. Here is my query: {!l

Re: How can a distributed Solr setup scale to TB-data, if URL limitations are 4000 for distributed shard search?

2012-01-18 Thread Darren Govoni
Try changing the URI/HTTP/GET size limitation on your app server. On 01/18/2012 05:59 PM, Daniel Bruegge wrote: Hi, I am just wondering how I can 'grow' a distributed Solr setup to an index size of a couple of terabytes, when one of the distributed Solr limitations is max. 4000 characters in UR

Re: How to accelerate your Solr-Lucene appication by 4x

2012-01-18 Thread Darren Govoni
And to be honest, many people on this list are professionals who not only build their own solutions, but also buy tools and tech. I don't see what the big deal is if some clever company has something of imminent value here to share it. Considering that its a rare event. On 01/18/2012 08:28 PM

Re: How to accelerate your Solr-Lucene appication by 4x

2012-01-19 Thread Darren Govoni
h, I agree. On 01/18/2012 11:08 PM, Steven A Rowe wrote: Hi Darren, I think it's rare because it's rare: if this were found to be a useful advertising space, rare would cease to be descriptive of it. But I could be wrong. Steve -Original Message- From: Darren Govoni [mailto

Re: How to accelerate your Solr-Lucene appication by 4x

2012-01-19 Thread Darren Govoni
Agree. There's probably some unwritten etiquette there. On 01/19/2012 05:52 AM, Patrick Plaatje wrote: Partially agree. If just the facts are given, and not a complete sales talk instead, it'll be fine. Don't overdo it like this though. Cheers, Patrick 2012/1/19 Darren Govon

Re: Hierarchical faceting in UI

2012-01-24 Thread Darren Govoni
Yuhao, Ok, let me think about this. A term can have multiple parents. Each of those parents would be 'different', yes? In this case, use a multivalued field for the parent and add all the parent names or id's to it. The relations should be unique. Your UI will associate the correct parent i

Bizarre TFV output

2010-06-18 Thread Darren Govoni
Hi, I am using a recent nightly build of Solr with no significant schema mods. I index a couple documents and view the TFV's in this query.

Re: Bizarre TFV output

2010-06-18 Thread Darren Govoni
5 And some improper stemming (e.g.requir? require.) This seems buggy to me. Are these correct? If so, how can I sort out the legit terms from these messy ones? thanks for any tips! Darren On Fri, 2010-06-18 at 15:33 -0400, Darren Govoni wrote: > Hi, > I am using a recen

Re: Bizarre TFV output

2010-06-18 Thread Darren Govoni
Thanks for the explanation Chris. I'll try it but the term " " strikes me as not very legitimate and the source text is just space bounded words so even if its doing what it is supposed to, I'm not sure this term is helpful in the index. I'm kinda new to TFV's though, so much to learn. On Fri,

Re: Bizarre TFV output

2010-06-18 Thread Darren Govoni
Well stated. You are correct. Here is the field It uses the text field type as its defined in Solr schema. I didn't change it. The input text is a 6 page UTF-8 text document, the relevant line the term seems to be related to. Just a sentence with no specific boundaries. "...perform more quer

Specifiying multiple mlt.fl fields

2010-06-19 Thread Darren Govoni
Hi, I read the wiki and tried about a dozen variations such as: ...&mlt.fl=field1&mlt.fl=field2 and ...&mlt.fl=field1,field2&... to specify more than one MLT field and it won't take. What's the trick? Also, how to do it with SolrJ? Nothing I try works. Solr 4.0 nightly build. Any tips, very

Re: Specifiying multiple mlt.fl fields

2010-06-19 Thread Darren Govoni
It works! Thanks Sascha. I swear I tried that combination. Hehe. On Sat, 2010-06-19 at 21:19 +0200, Sascha Szott wrote: > Hi Darren, > > try mlt.fl=field1 field2 > > Best, > Sascha > > Darren Govoni wrote: > > Hi, > >I read the wiki and tr

preside != president

2010-06-28 Thread Darren Govoni
Hi, It seems to me that because the stemming does not produce grammatically correct stems in many of the cases, search anomalies can occur like the one I am seeing where I have a document with "president" in it and it is returned when I search for "preside", a different word entirely. Is this co

Re: preside != president

2010-06-29 Thread Darren Govoni
ially has a 'dictionary' > >>> of exception words where stemming stops if found, so in your case > >>> president won't be stemmed any further than president (but presidents will > >>> be stemmed to president). You will have to integrate it into solr > >>

Bizarre Terms revisited

2010-06-30 Thread Darren Govoni
Hi, I really think there is something "not quite right" going on here after much study. Here is my findings. Using MLT, I get terms that appear to be long concatenations of words that are space delimited in the original text. I can't think of any reason for these sentence-like terms to exist (s

Facet count != Query count

2010-07-11 Thread Darren Govoni
Hi, What could cause a facet query on a field (say 'name') differ in count from a basic query on the field using the same value? e.g name:'Darren' If there are 10 documents that match this, the facet count should be 10 for 'Darren', and I should get 10 results if I query on the field. But aft

Clustering results limit?

2010-07-21 Thread Darren Govoni
Hi, I am attempting to cluster a query. It kinda works, but where my (regular) query returns 500 results the cluster only shows 1-10 hits for each cluster (5 clusters). Never more than 10 docs and I know its not right. What could be happening here? It should be showing dozens of documents per clus

Re: Clustering results limit?

2010-07-22 Thread Darren Govoni
Staszek, Thank you. The cluster response has a maximum of 10 documents in each cluster. I didn't set this limit and the query by itself returns 500+ documents. There should be many more than 10 in each cluster. Does it default to 10 maybe? Or is there a way to say, cluster every result in the que

Re: Clustering results limit?

2010-07-22 Thread Darren Govoni
I set the rows=50 on my clustering URL in a browser and it returns more. In my SolrJ, I used ModifiableSolrParams and I set ("rows",50) but it still returns less than 10 for each cluster. Is there a way to set rows wanted with ModifiableSolrParams? thanks and sorry for the double post. Darren

Re: Clustering results limit?

2010-07-22 Thread Darren Govoni
Yeah, my results count is 151 and only 21 documents appear in 6 clusters. This is true whether I use URL or SolrJ. When I use carrot workbench and point to my Solr using local clustering, the workbench has numerous clusters and all documents are placed On Thu, 2010-07-22 at 18:06 +0200, Stan

Re: Clustering results limit?

2010-07-22 Thread Darren Govoni
lename_s"); params.set("rows","100"); params.set("clustering", "true"); params.set("carrot.snippet", "excerpt_t"); The rows param needs to be a string I think. thanks. On Thu, 2010-07-22 at 19:10 -0400, Darren Govoni wro

Re: MoreLikeThis explanation?

2010-10-21 Thread Darren Govoni
Thank you! On Thu, 2010-10-21 at 23:03 +0900, Koji Sekiguchi wrote: > (10/10/21 20:33), dar...@ontrenet.com wrote: > > Hi, > >Does the latest Solr provide an explanation for results returned by MLT? > > No, but there is an open issue: > > https://issues.apache.org/jira/browse/SOLR-860 > >

Re: MoreLikeThis explanation?

2010-10-22 Thread Darren Govoni
Hi Koji, I tried to apply your patch to the 1.4.0 tagged branch, but it didn't take completely. What branch does it work for? Darren On Thu, 2010-10-21 at 23:03 +0900, Koji Sekiguchi wrote: > (10/10/21 20:33), dar...@ontrenet.com wrote: > > Hi, > >Does the latest Solr provide an explanat

Deploying WAR from trunk, exception

2010-11-05 Thread Darren Govoni
Hi, I built the trunk and deploy the war, but cannot access the admin URL anymore. Error loading class 'org.apache.solr.highlight.MultiColoredScoreOrderFragmentsBuilder This class seems to be missing? thanks, Darren

RE: Re: Terminology question: Core vs. Collection vs...

2013-01-03 Thread Darren Govoni
Good write up. And what about "node"? I think there needs to be an official glossary of terms that is sanctioned by the solr team and some terms still ni use may need to be labeled "deprecated". After so many years, its still confusing. --- Original Message --- On 1/3/2013 08:07 AM

RE: Re: Terminology question: Core vs. Collection vs...

2013-01-03 Thread Darren Govoni
ferent shards on the same node/box for a partial improvement of performance and fault tolerance. A Solr "cloud' is really a cluster. -- Jack Krupansky -Original Message- From: Darren Govoni Sent: Thursday, January 03, 2013 8:16 AM To: solr-user@lucene.apache.org Subject: RE: Re:

RE: Re: Terminology question: Core vs. Collection vs...

2013-01-03 Thread Darren Govoni
of the terminology. So, we're not "sharding shards", but we are "replicating shards". -- Jack Krupansky -Original Message- From: Darren Govoni Sent: Thursday, January 03, 2013 8:51 AM To: solr-user@lucene.apache.org Subject: RE: Re: Terminology question: Core

RE: Re: Terminology question: Core vs. Collection vs...

2013-01-03 Thread Darren Govoni
he terminology. So, we're not "sharding shards", but we are "replicating shards". -- Jack Krupansky -Original Message- From: Darren Govoni Sent: Thursday, January 03, 2013 8:51 AM To: solr-user@lucene.apache.org Subject: RE: Re: Terminology question: Core vs. Coll

RE: Re: Terminology question: Core vs. Collection vs...

2013-01-03 Thread Darren Govoni
ctual Solr cores. Replicas are instances of the data for a shard. Sometimes people may loosely speak of a replica as being "a shard", but that's just loose use of the terminology. So, we're not "sharding shards", but we are "replicating shards". -- Jack Kru

RE: Re: Terminology question: Core vs. Collection vs...

2013-01-03 Thread Darren Govoni
--- On 1/3/2013 09:19 AM Jack Krupansky wrote:A single shard MAY exist on a single core, but only if it is not replicated. Generally, a single shard will exist on multiple cores, each a replica of the source data as it comes into the update handler. -- Jack Krupansky -Original Message--

RE: Re: Terminology question: Core vs. Collection vs...

2013-01-03 Thread Darren Govoni
Great point. --- Original Message --- On 1/3/2013 10:42 AM Per Steffensen wrote:On 1/3/13 4:33 PM, Mark Miller wrote: > This has pretty much become the standard across other distributed systems and in the literat…err…books. Hmmm Im not sure you are right about that. Maybe more than one

RE: Re: Terminology question: Core vs. Collection vs...

2013-01-03 Thread Darren Govoni
And based on the previous explanation there is never a "copy of a shard". A shard represents and contains only replicas for itself, replicas being copies of cores within the shard. --- Original Message --- On 1/3/2013 11:58 AM Walter Underwood wrote:A "factor" is multiplied, so multip

Re: Terminology question: Core vs. Collection vs...

2013-01-03 Thread Darren Govoni
, put more of them on a single node and then migrate them to their own nodes as the data outgrows the smaller number of nodes. In other words, add nodes incrementally without having to reindex all the data. -- Jack Krupansky -Original Message- From: Darren Govoni Sent: Thursday, Janua

Re: Terminology question: Core vs. Collection vs...

2013-01-04 Thread Darren Govoni
Yes. In that case, core should best be described as a logical solr entity with various "managed" attributes and qualities above the physical layer (sorry, not trying to perpetuate this thread so much). On 01/04/2013 01:55 PM, Mark Miller wrote: Currently a SolrCore is 1:1 with a low level Luce

RE: RE: Max number of core in Solr multi-core

2013-01-07 Thread Darren Govoni
This should be clarified some. In the "client" API, SolrServer is represents a connection to a single server backend/endpoint and should be re-used where possible. The approach being discussed is to have one "client" connection (represented by SolrServer class) per solr core, all residing in a "

RE: SolrJ and Solr 4.0 | doc.getFieldValue() returns String instead of Date

2013-01-08 Thread Darren Govoni
SimpleDateFormat df= new SimpleDateFormat("-MM-dd'T'hh:mm:ss.S'Z'"); Date dateObj = df.parse("2009-10-29T00:00:009Z"); --- Original Message --- On 1/8/2013 09:34 AM uwe72 wrote:A Lucene 4.0 document returns for a Date field now a string value, instead of a Date object. Solr4.0 --

zk Config URL?

2013-02-24 Thread Darren Govoni
Hi, I'm trying the latest solrcloud 4.1. Is there a button(or url) I can't find that shows me the zookeeper config XML, so I can check what other nodes are connected? Can't seem to find it. I deploy my solrcloud war into glassfish and set jetty.port (among other properties) to the GF domain

Re: zk Config URL?

2013-02-24 Thread Darren Govoni
York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Sun, Feb 24, 2013 at 6:34 PM, Darren Govoni wrote: Hi, I'm trying the latest solrcloud 4.1. Is there a button(or url) I can't find that shows me the zookeeper config XML, so I can check what other nodes are connected? Can&#

Re: zk Config URL?

2013-02-25 Thread Darren Govoni
On 02/24/2013 08:32 PM, Mark Miller wrote: You either have to specifically upload a config set or use one of the bootstrap sys props. Are you doing either? - Mark On Feb 24, 2013, at 8:15 PM, Darren Govoni wrote: Thanks Michael. I went ahead and just started an external zookeeper, but my

SolrCloud war?

2012-01-30 Thread Darren Govoni
Hi, Is there any issue with running the new SolrCloud deployed as a war in another app server? Has anyone tried this yet? thanks.

Re: SolrCloud war?

2012-02-01 Thread Darren Govoni
in an app server? Any tips appreciated! Darren On 01/30/2012 06:58 PM, Darren Govoni wrote: Hi, Is there any issue with running the new SolrCloud deployed as a war in another app server? Has anyone tried this yet? thanks.

Re: Federation in SolrCloud?

2012-02-02 Thread Darren Govoni
Thanks for the reply Mark. I did example A. One of the instances had zookeeper. If I shut down the other instance, all searches on the other (running) instance produced an error in the browser. I don't have the error handy but it was one line. Something like missing shard in collection IIRC.

Re: SolrCloud war?

2012-02-03 Thread Darren Govoni
jetty.port. You probably want to explicitly pass -DhostPort= if you are not going to use jetty.port. - Mark Miller lucidimagination.com On Feb 1, 2012, at 2:44 PM, Darren Govoni wrote: Hi, I'm trying to get the SolrCloud2 examples to work using a war deployed solr into glassfish. The startu

Re: Range facet - Count in facet menu != Count in search results

2012-02-10 Thread Darren Govoni
Double check your default operator for a faceted search vs. regular search. I caught this difference in my work that explained this difference. On Fri, 2012-02-10 at 07:45 -0800, Yuhao wrote: > Jay, > > Was the curly closing bracket "}" intentional? I'm using 3.4, which also > supports "fq=pric

SolrJ + SolrCloud

2012-02-11 Thread Darren Govoni
Hi, Do all the normal facilities of Solr work with SolrCloud from SolrJ? Things like /mlt, /cluster, facets , tvf's, etc. Darren

Re: SolrJ + SolrCloud

2012-02-12 Thread Darren Govoni
Thanks Mark. Is there any plan to make all the Solr search handlers work with SolrCloud, like MLT? That missing feature would prohibit us from using SolrCloud at the moment. :( On Sat, 2012-02-11 at 18:24 -0500, Mark Miller wrote: > On Feb 11, 2012, at 6:02 PM, Darren Govoni wrote: >

maxClauseCount error

2012-02-22 Thread Darren Govoni
Hi, I am suddenly getting a maxclause count error and don't know why. I am using Solr 3.5

maxClauseCount Exception

2012-02-22 Thread Darren Govoni
Hi, I am suddenly getting a maxClauseCount exception for no reason. I am using Solr 3.5. I have only 206 documents in my index. Any ideas? This is wierd. QUERY PARAMS: [hl, hl.snippets, hl.simple.pre, hl.simple.post, fl, hl.mergeContiguous, hl.usePhraseHighlighter, hl.requireFieldMatch, echoPar

Trunk build errors

2012-02-22 Thread Darren Govoni
Hi, I am getting numerous errors preventing a build of solrcloud trunk. [licenses] MISSING LICENSE for the following file: Any tips to get a clean build working? thanks

Re: Building a resilient cluster

2012-03-06 Thread Darren Govoni
What I think was mentioned on this a bit ago is that the index stops working if one of the "nodes" goes down unless its a replica. You have 2 "nodes" running with numShards=2? Thus if one goes down the entire index is inoperable. In the future I'm hoping this changes such that the index cluster co

RE: Solr 4.0 and production environments

2012-03-07 Thread Darren Govoni
As a rule of thumb, many will say not to go to production with a pre-release baseline. So until Solr4 goes "final" and "stable", it's best not to assume too much about it. Second suggestion is to properly stage new technologies in your product such that they go through their own validation. And

Re: maxClauseCount Exception

2012-03-17 Thread Darren Govoni
Thanks for the tip Hoss. I notice that it appears sometimes and was varying because my index runs would sometimes have different amount of docs, etc. So all I want to do is a simple "all docs with something in this field, and to highlight the field". Is the query expansion to "all possible term

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Darren Govoni
I think he's asking if all the nodes (same machine or not) return a response. Presumably you have different ports for each node since they are on the same machine. On Sun, 2012-03-18 at 14:44 -0400, Matthew Parker wrote: > The cluster is running on one machine. > > On Sun, Mar 18, 2012 at 2:07 PM

RE: Re: maxClauseCount Exception

2012-03-19 Thread Darren Govoni
at doesn't really make sense to do at the Solr/Lucene level. All you're saying is that you want that field highlighted. Wouldn't it be much easier to just do this at the app level whenever your field had anything returned in it? Best Erick On Sat, Mar 17, 2012 at 8:07 PM, Darren Govoni

MLT and solrcloud?

2012-03-22 Thread Darren Govoni
Hi, It was mentioned before that SolrCloud has all the capability of regular solr (including handlers) with the exception of the MLT handler. As this is a key capability for Solr, is there work planned to include the MLT in SolrCloud? If so when? Our efforts greatly depend on it. As such, I'm hap

Re: MLT and solrcloud?

2012-03-22 Thread Darren Govoni
Ok, I'll do what I can to help! As always, appreciate the hard work Mark. On Thu, 2012-03-22 at 17:31 -0400, Mark Miller wrote: > On Mar 22, 2012, at 5:22 PM, Darren Govoni wrote: > > > Hi, > > It was mentioned before that SolrCloud has all the capability of >

Custom scoring question

2012-03-29 Thread Darren Govoni
Hi, I have a situation I want to re-score document relevance. Let's say I have two fields: text: The quick brown fox jumped over the white fence. terms: fox fence Now my queries come in as: terms:[* TO *] and Solr scores them on that field. What I want is to rank them according to the distr

Re: Custom scoring question

2012-03-29 Thread Darren Govoni
I'm going to try index time per-field boosting and do the boost computation at index time and see if that helps. On Thu, 2012-03-29 at 10:08 -0400, Darren Govoni wrote: > Hi, > I have a situation I want to re-score document relevance. > > Let's say I have two fields: >

Re: Custom scoring question

2012-03-29 Thread Darren Govoni
nd what to do with it. On Thu, 2012-03-29 at 16:29 -0300, Tomás Fernández Löbbe wrote: > Can't you simply calculate that at index time and assign the result to a > field, then sort by that field. > > On Thu, Mar 29, 2012 at 12:07 PM, Darren Govoni wrote: > > > I

Re: Does any one know when Solr 4.0 will be released.

2012-04-04 Thread Darren Govoni
No one knows. But if you ask the devs, they will say 'when its done'. One clue might be to monitor the bugs/issues scheduled for 4.0. When they are all resolved, then its ready. On Wed, 2012-04-04 at 09:41 -0700, srinivas konchada wrote: > Hello every one > Does any one know when Solr 4.0 will be

Re: Duplicates in Facets

2012-04-04 Thread Darren Govoni
Try using Luke to look at your index and see if there are multiple similar TFV's. You can browse them easily in Luke. On Wed, 2012-04-04 at 23:35 -0400, Jamie Johnson wrote: > I am currently indexing some information and am wondering why I am > getting duplicates in facets. From what I can tell t

No webadmin for trunk?

2012-04-07 Thread Darren Govoni
Hi, Just updated solr trunk and tried the java -jar start.jar and localhost:8983/solr/admin.not found. Where did it go? thanks.

Re: No webadmin for trunk?

2012-04-07 Thread Darren Govoni
; > On Sat, Apr 7, 2012 at 8:23 AM, Darren Govoni wrote: > > Hi, > > Just updated solr trunk and tried the java -jar start.jar and > > localhost:8983/solr/admin.not found. > > > > Where did it go? > > > > thanks. > > >

Re: No webadmin for trunk?

2012-04-07 Thread Darren Govoni
start.jar has no apps in it at all. On Sat, 2012-04-07 at 09:47 -0400, Darren Govoni wrote: > HTTP ERROR: 404 > Problem accessing /solr. Reason: > > Not Found > > > > Powered by Jetty:// > &g

Re: No webadmin for trunk?

2012-04-07 Thread Darren Govoni
Yep. I did all kinds of ant clean, ant dist, ant example, etc. My trunk rev. At revision 1310773. Example start.jar is broken. No webapp inside. :( On Sat, 2012-04-07 at 16:11 +0200, Rafał Kuć wrote: > Hello! > > Did you run 'ant example' ? >

Re: No webadmin for trunk?

2012-04-07 Thread Darren Govoni
K. There is a solr.war in the webapps directory. But still get the 404. On Sat, 2012-04-07 at 16:19 +0200, Rafał Kuć wrote: > Hello! > > start.jar shouldn't contain any webapp. If you look at the 'example' > directory, you'll notice that there is a 'webapps' directory which > should contain solr.

Re: No webadmin for trunk?

2012-04-07 Thread Darren Govoni
Now, it comes up. Not sure why its acting weird. Will continue to look at it. On Sat, 2012-04-07 at 10:23 -0400, Darren Govoni wrote: > K. There is a solr.war in the webapps directory. But still get the 404. > > On Sat, 2012-04-07 at 16:19 +0200, Rafał Kuć wrote: > > Hello! >

Re: upgrade 3.5 to 4.0

2012-04-07 Thread Darren Govoni
In my opinion, its never a good idea to overwrite files of a previous version with a new version. The easiest thing would be to just deploy the solr war file into tomcat and let tomcat manage the webapp, files, etc. On Sat, 2012-04-07 at 22:39 -0400, Dan Foley wrote: > I have download the nightl

RE: Re: Cloud-aware request processing?

2012-04-09 Thread Darren Govoni
"...it is a distributed real-time query scheme..." SolrCloud does this already. It treats all the shards like one-big-index, and you can query it normally to get "subset" results from each shard. Why do you have to re-write the query for each shard? Seems unnecessary. --- Original Message

Re: How to facet data from a multivalued field?

2012-04-09 Thread Darren Govoni
Your handler for that field should be looked at. Try not using a handler that tokenizes or stems the field. You want to leave the text as is. I forget the handler setting for that, but its documented in there somewhere. On Mon, 2012-04-09 at 13:02 -0700, Thiago wrote: > Hello everybody, > > I've

RE: SOLR issue - too many search queries

2012-04-10 Thread Darren Govoni
My first reaction to your question is why are you running thousands of queries in a loop? Immediately, I think this will not scale well and the design probably needs to be re-visited. Second, if you need that many requests, then you need to seriously consider an architecture that supports it.

Re: I've broken delete in SolrCloud and I'm a bit clueless as to how

2012-04-11 Thread Darren Govoni
Hard to say why its not working for you. Start with a fresh Solr and work forward from there or back out your configs and plugins until it works again. On Tue, 2012-04-10 at 17:15 -0400, Benson Margulies wrote: > In my cloud configuration, if I push > > > *:* > > > followed by: > > > > I

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-12 Thread Darren Govoni
You could use SolrCloud (for the automatic scaling) and just mount a fuse[1] HDFS directory and configure solr to use that directory for its data. [1] https://ccp.cloudera.com/display/CDHDOC/Mountable+HDFS On Thu, 2012-04-12 at 16:04 +0300, Ali S Kureishy wrote: > Hi, > > I'm trying to setup a

RE: Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-12 Thread Darren Govoni
el that using SolrCloud would be redundant, and a step in the opposite direction, which is what I'm trying to avoid in the first place. Or am I mistaken? Thanks, Safdar On Thu, Apr 12, 2012 at 4:27 PM, Darren Govoni wrote: > You could use SolrCloud (for the automatic scaling) and just m

RE: Realtime /get versus SearchHandler

2012-04-13 Thread Darren Govoni
Yes --- Original Message --- On 4/13/2012 06:25 AM Benson Margulies wrote:A discussion over on the dev list led me to expect that the by-if field retrievals in a SolrCloud query would come through the get handler. In fact, I've seen them turn up in my search component in the search hand

Re: Monitoring SolrCloud health

2012-04-14 Thread Darren Govoni
Can you be more specific about "health"? On Sat, 2012-04-14 at 00:03 -0400, Jamie Johnson wrote: > How do people currently monitor the health of a solr cluster? Are > there any good tools which can show the health across the entire > cluster? Is this something which is planned for the new admin

Re: hierarchical faceting?

2012-04-18 Thread Darren Govoni
Put the parent term in all the child documents at index time and the re-issue the facet query when you expand the parent using the parent's term. works perfect. On Wed, 2012-04-18 at 10:56 -0400, sam ” wrote: > I have hierarchical colors: > stored="true" multiValued="true"/> > text_path is TextFi

Re: hierarchical faceting?

2012-04-18 Thread Darren Govoni
exactly what PathHierarchyTokenizer does. > > > positionIncrementGap="100"> > > > > > > > > > > > > I think I have a query time tokenizer that tokenizes at / > > > > ?q=colors:red > > ==&

SolrCloud indexing question

2012-04-20 Thread Darren Govoni
Hi, I just wanted to make sure I understand how distributed indexing works in solrcloud. Can I index locally at each shard to avoid throttling a central port? Or all the indexing has to go through a single shard leader? thanks

Re: SolrCloud indexing question

2012-04-20 Thread Darren Govoni
he other replicas. > > On Fri, Apr 20, 2012 at 7:54 AM, Darren Govoni wrote: > > Hi, > > I just wanted to make sure I understand how distributed indexing works > > in solrcloud. > > > > Can I index locally at each shard to avoid throttling a central port? Or

  1   2   >