Re: "TokenStream contract violation: close() call missing" error in 4.9.0

2015-06-09 Thread Benson Margulies
What tokenizer are you using? I think, but I'm not entirely sure, that this would require a bug in a tokenizer. On Tue, Jun 9, 2015 at 10:21 AM, Ryan, Michael F. (LNG-DAY) wrote: > I'm using Solr 4.9.0. I'm trying to figure out what would cause an error like > this to occur a rare, non-determin

Re: Korean script conversion

2015-03-30 Thread Benson Margulies
Why do you think that this is a good idea? Hanja are used for special purposes; they are not trivally convertable to Hanjul due to ambiguity, and it's not at all clear that a typical search user wants to treat them as equivalent. On Sun, Mar 29, 2015 at 1:52 AM, Eyal Naamati < eyal.naam...@exlibri

Re: qt.shards in solrconfig.xml

2015-02-26 Thread Benson Margulies
> mkhlud...@griddynamics.com> wrote: > >> Hello, >> >> Giving >> >> http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201301.mbox/%3c711daae5-c366-4349-b644-8e29e80e2...@gmail.com%3E >> you can add qt.shards into handler defaults/invariants.

qt.shards in solrconfig.xml

2015-02-26 Thread Benson Margulies
A query I posted yesterday amounted to me forgetting that I have to set qt.shards when I use a URL other than plain old '/select' with SolrCloud. Is there any way to configure a query handler to automate this, so that all queries addressed to '/RNI' get that added in?

Customized search handler components and cloud

2015-02-25 Thread Benson Margulies
We have a pair of customized search components which we used successfully with SolrCloud some releases back (4.x). In 4.10.3, I am trying to find the point of departure in debugging why we get no results back when querying to them with a sharded index. If I query the regular /select, all is swell.

Re: 8 Shards of Cloud with 4.10.3.

2015-02-25 Thread Benson Margulies
.10.3 -Duser.timezone=UTC -Djava.net.preferIPv4Stack=true -DzkRun -jar start.jar & and then: curl 'http://localhost:8983/solr/admin/collections?action=CREATE&name=rni&numShards=8&replicationFactor=1&collection.configName=rni&maxSh\ ardsPerNode=8' On Wed, Feb

Re: 8 Shards of Cloud with 4.10.3.

2015-02-25 Thread Benson Margulies
erverCnxn.java:228) at org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:208) at java.lang.Thread.run(Thread.java:745) On Wed, Feb 25, 2015 at 10:52 AM, Shawn Heisey wrote: > On 2/25/2015 8:35 AM, Benson Margulies wrote: >> Do I need a zkcli bootstrap or do I start with upconfig? W

Re: 8 Shards of Cloud with 4.10.3.

2015-02-25 Thread Benson Margulies
Do I need a zkcli bootstrap or do I start with upconfig? What port does zkRun put zookeeper on? On Feb 25, 2015 10:15 AM, "Shawn Heisey" wrote: > On 2/25/2015 7:44 AM, Benson Margulies wrote: > > Shawn, I _am_ starting from clean. However, I didn't find a recipe for &

Re: 8 Shards of Cloud with 4.10.3.

2015-02-25 Thread Benson Margulies
org.apache.solr.update.processor.UpdateRequestProcessor.processDelete(UpdateRequestProcessor.java:55) On Wed, Feb 25, 2015 at 9:44 AM, Benson Margulies wrote: > On Wed, Feb 25, 2015 at 8:04 AM, Shawn Heisey wrote: >> On 2/25/2015 5:50 AM, Benson Margulies wrote: >>> So, found the following line in the guid

Re: 8 Shards of Cloud with 4.10.3.

2015-02-25 Thread Benson Margulies
On Wed, Feb 25, 2015 at 8:04 AM, Shawn Heisey wrote: > On 2/25/2015 5:50 AM, Benson Margulies wrote: >> So, found the following line in the guide: >> >>java -DzkRun -DnumShards=2 >> -Dbootstrap_confdir=./solr/collection1/conf >> -Dcollection.configName=my

Re: 8 Shards of Cloud with 4.10.3.

2015-02-25 Thread Benson Margulies
com.basistech.rni.index.internal.SolrCloudEvaluationNameIndex.(SolrCloudEvaluationNameIndex.java:53) with corresponding log traffic in the solr log. The cloud page in the Solr admin app shows the IP address in green. It's a bit hard to read in general, it's all squished up to the top. On Tue, Feb 24, 2015 at 4:33 PM, Benson Margul

Re: 8 Shards of Cloud with 4.10.3.

2015-02-24 Thread Benson Margulies
On Tue, Feb 24, 2015 at 4:27 PM, Chris Hostetter wrote: > > : Unfortunately, this is all 5.1 and instructs me to run the 'start from > : scratch' process. > > a) checkout the left nav of any ref guide page webpage which has a link to > "Older Versions of this Guide (PDF)" > > b) i'm not entirely s

Re: 8 Shards of Cloud with 4.10.3.

2015-02-24 Thread Benson Margulies
On Tue, Feb 24, 2015 at 3:32 PM, Michael Della Bitta wrote: > https://cwiki.apache.org/confluence/display/solr/SolrCloud Unfortunately, this is all 5.1 and instructs me to run the 'start from scratch' process. I wish that I could take my existing one-core no-cloud config and convert it into a cl

Re: 8 Shards of Cloud with 4.10.3.

2015-02-24 Thread Benson Margulies
le.com/u/0/b/112002776285509593336/112002776285509593336/posts> > w: appinions.com <http://www.appinions.com/> > > On Tue, Feb 24, 2015 at 1:27 PM, Benson Margulies > wrote: > >> With so much of the site shifted to 5.0, I'm having a bit of trouble >> finding wh

8 Shards of Cloud with 4.10.3.

2015-02-24 Thread Benson Margulies
With so much of the site shifted to 5.0, I'm having a bit of trouble finding what I need, and so I'm hoping that someone can give me a push in the right direction. On a big multi-core machine, I want to set up a configuration with 8 (or perhaps more) nodes treated as shards. I have some very parti

Having a spot of trouble setting up /browse

2015-02-16 Thread Benson Margulies
So, I had set up a solr core modelled on the 'multicore' example in 4.10.3, which has no /browse. Upon request, I went to set up /browse. I copied in a minimal version. When I go there, I just get some XML back: 0 4 What else does /browse depend upon?

codec factory versus posting format versus documentation

2015-02-10 Thread Benson Margulies
I think perhaps there is a minor doc drought, or perhaps just I'm having an SEO bad hair day. I'm trying to understand the relationship of codecFactory and postingFormat. Experiment 1: I just want to use my own codec. So, I make a CodecFactory, declare it in solrconfig.xml, and stand back? If so,

Re: Complaint of multiple /updates but solrconfig.xml has one

2015-02-09 Thread Benson Margulies
OK, I see, I forgot to include the core name in the URL. On Mon, Feb 9, 2015 at 8:27 PM, Benson Margulies wrote: > I see https://issues.apache.org/jira/browse/SOLR-6302 but I don't see > what I am supposed to do about it. > > On Mon, Feb 9, 2015 at 8:19 PM, Benson Margulies

log location when using bin/start

2015-02-09 Thread Benson Margulies
Running bin/start with a command like: /data/solr-4.10.3/bin/solr start -s $PWD/solr_home -a "-Djava.library.path=$libdir -Dbt.root=$bt_root\ " "$@" I note that the logs are ending up in the solr install dir/examples/logs. Can I move them?

Complaint of multiple /updates but solrconfig.xml has one

2015-02-09 Thread Benson Margulies
4.10.3: Customized solrconfig.xml. My log shows: 2/9/2015, 8:14:44 PMWARNRequestHandlersMultiple requestHandler registered to the same name: /update ignoring: org.apache.solr.handler.UpdateRequestHandler But there is only one: RNI And all attempts to post with the simple

Re: Complaint of multiple /updates but solrconfig.xml has one

2015-02-09 Thread Benson Margulies
I see https://issues.apache.org/jira/browse/SOLR-6302 but I don't see what I am supposed to do about it. On Mon, Feb 9, 2015 at 8:19 PM, Benson Margulies wrote: > 4.10.3: Customized solrconfig.xml. > > My log shows: > > 2/9/2015, 8:14:44 PMWARNRequestHandlersMultiple request

Re: Is there any sentence tokenizers in sold 4.9.0?

2014-09-12 Thread Benson Margulies
Basis Technology's toolset includes sentence boundary detectors. Please contact me for more details. On Fri, Sep 12, 2014 at 1:15 AM, Sandeep B A wrote: > Hi All, > Sorry for the delayed response. > I was out of office for last few days and was not able to reply. > Thanks for the information. >

Re: Business Name spell check

2014-08-31 Thread Benson Margulies
Trying to shoehorn business name resolution or correction purely into Solr tokenization and spell checking is not, in my opinion, a viable approach. It seems to me that you need a query parser that does something very different from pure tokenization, and you might also need a more complex approach

Re: Solr Japanese support

2014-03-16 Thread Benson Margulies
Your problem has nothing to do with Japanese. Perhaps a content-type for CSV would work better? On Sat, Mar 15, 2014 at 12:50 PM, Bala Iyer wrote: > Hi, > > I am new to Solr japanese. > I added the support for japanese on schema.xml > How can i insert Japanese text into that field either by solr

Mixing lucene scoring and other scoring

2014-03-06 Thread Benson Margulies
Some months ago, I talked to some people at LR about this, but I can't find my notes. Imagine a function of some fields that produces a score between 0 and 1. Imagine that you want to combine this score with relevance over some more or less complex ordinary query. What are the options, given the

(lack) of error for missing library?

2014-02-08 Thread Benson Margulies
is the comment, but when I put a completely missing path in there -- no error. Should I file a JIRA?

A bit lost in the land of schemaless Solr

2014-02-08 Thread Benson Margulies
Say that I have 10 fieldTypes for 10 languages. Is there a way to associate a naming convention from field names to field types so that I can avoid bothering with all those dynamic fields?

Re: Multi Lingual Analyzer

2014-01-20 Thread Benson Margulies
MT is not nearly good enough to allow approach 1 to work. On Mon, Jan 20, 2014 at 9:25 AM, Erick Erickson wrote: > It Depends (tm). Approach (2) will give you better, more specific > search results. (1) is simpler to implement and might be "good > enough"... > > > > On Mon, Jan 20, 2014 at 5:21 A

Re: Tracking down the input that hits an analysis chain bug

2014-01-16 Thread Benson Margulies
I think that https://issues.apache.org/jira/browse/SOLR-5623 should be ready to go. Would someone please commit from the PR? If there's a preference, I can attach a patch as well. On Fri, Jan 10, 2014 at 1:37 PM, Benson Margulies wrote: > Thanks, that's the recipe that I need. >

Re: Analyzers versus Tokenizers/TokenFilters

2014-01-15 Thread Benson Margulies
benson On Wed, Jan 15, 2014 at 5:09 PM, Ahmet Arslan wrote: > Hi Benson, > > Using lucene analyzer in schema.xlm should be last resort. For very specific > reasons : if you have an existing analyzer, etc. > > Ahmet > > > On Wednesday, January 15, 2014 11:52 PM, Benso

Analyzers versus Tokenizers/TokenFilters

2014-01-15 Thread Benson Margulies
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters never mentions an Analyzer class. http://wiki.apache.org/solr/SolrPlugins talks about subclasses of SolrAnalyzer as ways of delivering an entire analysis chain and still 'minding the gap'. Anyone care to offer a comparison of the viewpoi

Re: Tracking down the input that hits an analysis chain bug

2014-01-10 Thread Benson Margulies
Thanks, that's the recipe that I need. On Fri, Jan 10, 2014 at 11:40 AM, Chris Hostetter wrote: > > : Is there a neighborhood of existing tests I should be visiting here? > > You'll need a custom schema that refers to your new > MockFailOnCertainTokensFilterFactory, so i would create a completley

Re: Tracking down the input that hits an analysis chain bug

2014-01-10 Thread Benson Margulies
Is there a neighborhood of existing tests I should be visiting here? On Fri, Jan 10, 2014 at 11:27 AM, Benson Margulies wrote: > OK, patch forthcoming. > > On Fri, Jan 10, 2014 at 11:23 AM, Chris Hostetter > wrote: >> >> : The problem manifests as this sort of thing: &g

Re: Tracking down the input that hits an analysis chain bug

2014-01-10 Thread Benson Margulies
OK, patch forthcoming. On Fri, Jan 10, 2014 at 11:23 AM, Chris Hostetter wrote: > > : The problem manifests as this sort of thing: > : > : Jan 3, 2014 6:05:33 PM org.apache.solr.common.SolrException log > : SEVERE: java.lang.IllegalArgumentException: startOffset must be > : non-negative, and endO

Re: Tracking down the input that hits an analysis chain bug

2014-01-04 Thread Benson Margulies
text in the logs? > > -Mike > > > On 01/03/2014 03:33 PM, Benson Margulies wrote: >> >> Robert, >> >> Yes, if the problem was not data-dependent, indeed I wouldn't need to >> index anything. However, I've run a small mountain of data throug

Re: Tracking down the input that hits an analysis chain bug

2014-01-03 Thread Benson Margulies
be able to detect > this as well... > > On Fri, Jan 3, 2014 at 1:56 PM, Benson Margulies wrote: >> Using Solr Cloud with 4.3.1. >> >> We've got a problem with a tokenizer that manifests as calling >> OffsetAtt.setOffsets() with invalid inputs. OK, so, we want

Tracking down the input that hits an analysis chain bug

2014-01-03 Thread Benson Margulies
Using Solr Cloud with 4.3.1. We've got a problem with a tokenizer that manifests as calling OffsetAtt.setOffsets() with invalid inputs. OK, so, we want to figure out what input provokes our code into getting into this pickle. The problem happens on SolrCloud nodes. The problem manifests as this

TokenizerFactory from 4.2.0 to 4.3.0

2013-09-16 Thread Benson Margulies
TokenizerFactory changed, incompatibly with subclasses, from 4.2.0 to 4.3.0. Subclasses must now implement a different overload of create, and may not implement the old one. Has anyone got any devious strategies other than multiple copies of code to deal with this when supporting multiple versions

Re: Solr Patent

2013-09-15 Thread Benson Margulies
I am not a lawyer. The Apache Software Foundation cannot 'protect Solr developers.' Patent infringement is a claim made against someone who derived economic benefit from an invention, not someone who writes code. The patent clause in the Apache License requires people who contribute code to gran

SOLR-4872 and LUCENE-2145 (or, how to clean up a Tokenizer)

2013-06-12 Thread Benson Margulies
Could I have some help on the combination of these two? Right now, it appears that I'm stuck with a finalizer to chase after native resources in a Tokenizer. Am I missing something?

Re: Seeming bug in ConcurrentUpdateSolrServer

2013-05-29 Thread Benson Margulies
s 1/2-full. If the idea is that we want to pile up 'a lot' (1/2-of-a-q) of work before sending any of it, why start that first runner? On Wed, May 29, 2013 at 2:45 PM, Benson Margulies wrote: > Ah. So now I have to find some other explanation of why it never > creates more than on

Re: Seeming bug in ConcurrentUpdateSolrServer

2013-05-29 Thread Benson Margulies
Ah. So now I have to find some other explanation of why it never creates more than one thread, even when I make a very deep queue and specify 6 threads. On Wed, May 29, 2013 at 2:25 PM, Shalin Shekhar Mangar wrote: > On Wed, May 29, 2013 at 11:29 PM, Benson Margulies > wrote: > >&

Seeming bug in ConcurrentUpdateSolrServer

2013-05-29 Thread Benson Margulies
The comment here is clearly wrong, since there is no division by two. I think that the code is wrong, because this results in not starting runners when it should start runners. Am I misanalyzing? if (runners.isEmpty() || (queue.remainingCapacity() < queue.size() // queue // is // ha

How can a Tokenizer be CoreAware?

2013-05-29 Thread Benson Margulies
I am currently testing some things with Solr 4.0.0. I tried to make a tokenizer CoreAware, and was rewarded with: Caused by: org.apache.solr.common.SolrException: Invalid 'Aware' object: com.basistech.rlp.solr.RLPTokenizerFactory@19336006 -- org.apache.solr.util.plugin.SolrCoreAware must be an ins

Not so concurrent concurrency

2013-05-28 Thread Benson Margulies
I can't quite apply SolrMeter to my problem, so I did something of my own. The brains of the operation are the function here. This feeds a ConcurrentUpdateSolrServer about 95 documents, each about 10mb, and 'threads' is six. Yet Solr just barely uses more than one core. private long doIterati

Benchmarking Solr

2013-05-26 Thread Benson Margulies
I'd like to run a repeatable test of having Solr ingest a corpus of docs on disk, to measure the speed of some alternative things plugged in. Anyone have some advice to share? One approach would be a quick SolrJ program that pushed the entire stack as one giant collection with a commit at the end.

Re: solr.xml or its successor in the wiki

2013-05-20 Thread Benson Margulies
mpletely correct, > mind you) is that the presence of a tag defines which checks > are performed. Errors are thrown on old-style constructs when no > tag is present and vice-versa. > > Best > Erick > > > On Sun, May 19, 2013 at 7:20 PM, Benson Margulies > wrote:

Re: solr.xml or its successor in the wiki

2013-05-19 Thread Benson Margulies
One point of confusion: Is the compatibility code I hit trying to prohibit the 'str' form when it sees old-fangled cores? Or when the current running version pre-5.0? I hope it's the former. On Sun, May 19, 2013 at 6:47 PM, Shawn Heisey wrote: > On 5/19/2013 4:38 PM, Bens

Re: solr.xml or its successor in the wiki

2013-05-19 Thread Benson Margulies
Shawn, thanks. need any more jiras on this? On May 19, 2013, at 6:37 PM, Shawn Heisey wrote: > On 5/19/2013 11:27 AM, Benson Margulies wrote: >> Starting with the shipped solr.xml, I added a new-style child to >> configure a shared lib, and i was rewarded with:

Re: solr.xml or its successor in the wiki

2013-05-19 Thread Benson Margulies
on a fork between 4791 and this. On Sun, May 19, 2013 at 12:52 PM, Benson Margulies wrote: > OK, I found the successor. > > On Sun, May 19, 2013 at 12:40 PM, Benson Margulies > wrote: >> I found http://wiki.apache.org/solr/Solr.xml%204.3%20and%20beyond, but >> it doesn&#x

Re: solr.xml or its successor in the wiki

2013-05-19 Thread Benson Margulies
OK, I found the successor. On Sun, May 19, 2013 at 12:40 PM, Benson Margulies wrote: > I found http://wiki.apache.org/solr/Solr.xml%204.3%20and%20beyond, but > it doesn't mention the successor to sharedLib. > > On Sun, May 19, 2013 at 12:02 PM, Benson Margulies

Re: solr.xml or its successor in the wiki

2013-05-19 Thread Benson Margulies
I found http://wiki.apache.org/solr/Solr.xml%204.3%20and%20beyond, but it doesn't mention the successor to sharedLib. On Sun, May 19, 2013 at 12:02 PM, Benson Margulies wrote: > http://wiki.apache.org/solr/ConfiguringSolr > > does not point to any information on solr.xml. &

solr.xml or its successor in the wiki

2013-05-19 Thread Benson Margulies
http://wiki.apache.org/solr/ConfiguringSolr does not point to any information on solr.xml. Given https://issues.apache.org/jira/browse/SOLR-4791, I'm a bit confused, and I need to set up a sharedLib directory for 4.3.0. I would do some writing or linking if I had some raw material ...

Re: wiki versus downloads versus archives

2013-05-16 Thread Benson Margulies
tanks. On Thu, May 16, 2013 at 4:28 PM, Shawn Heisey wrote: > On 5/16/2013 2:21 PM, Benson Margulies wrote: > >> http://wiki.apache.org/solr/**Solr3.1<http://wiki.apache.org/solr/Solr3.1>claims >> that Solr3.1 is available in a >> place where it is not, and I

wiki versus downloads versus archives

2013-05-16 Thread Benson Margulies
http://wiki.apache.org/solr/Solr3.1 claims that Solr3.1 is available in a place where it is not, and I can't find a link on the front page to the archive for old releases.

A request handler that manipulated the index

2013-04-02 Thread Benson Margulies
I am thinking about trying to structure a problem as a Solr plugin. The nature of the plugin is that it would need to read and write the lucene index to do its work. It could not be cleanly split into URP 'over here' and a Search Component 'over there'. Are there invariants of Solr that would prec

Solr1.4 and threads ....

2012-06-13 Thread Benson Margulies
We've got a tokenizer which is quite explicitly coded on the assumption that it will only be called from one thread at a time. After all, what would it mean for two threads to make interleaved calls to the hasNext() function()? Yet, a customer of ours with a gigantic instance of Solr 1.4 reports i

Re: Why would solr norms come up different from Lucene norms?

2012-05-05 Thread Benson Margulies
On Fri, May 4, 2012 at 6:30 AM, Benson Margulies > wrote: >> So, I've got some code that stores the same documents in a Lucene >> 3.5.0 index and a Solr 3.5.0 instance. It's only five documents. >> >> For a particular field, the Solr norm is always 0.625, while th

Why would solr norms come up different from Lucene norms?

2012-05-04 Thread Benson Margulies
So, I've got some code that stores the same documents in a Lucene 3.5.0 index and a Solr 3.5.0 instance. It's only five documents. For a particular field, the Solr norm is always 0.625, while the Lucene norm is .5. I've watched the code in NormsWriterPerField in both cases. In Solr we've got .57

Re: Latest solr4 snapshot seems to be giving me a lot of unhappy logging about 'Log4j', should I be concerned?

2012-05-01 Thread Benson Margulies
395) >> at org.apache.solr.core.CoreContainer.load(CoreContainer.java:355) >> at >> org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:304) >> at >> org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:101) >> >> >> >> On Tue

Re: Latest solr4 snapshot seems to be giving me a lot of unhappy logging about 'Log4j', should I be concerned?

2012-05-01 Thread Benson Margulies
logging as a by product. > > Don't remember the issue # offhand. I think there was a dispute about what > should be done with it. > > On May 1, 2012, at 11:14 AM, Benson Margulies wrote: > >> CoreContainer.java, in the method 'load', finds itself calling >&

Latest solr4 snapshot seems to be giving me a lot of unhappy logging about 'Log4j', should I be concerned?

2012-05-01 Thread Benson Margulies
CoreContainer.java, in the method 'load', finds itself calling loader.NewInstance with an 'fname' of Log4j of the slf4j backend is 'Log4j'. e.g.: 2012-05-01 10:40:32,367 org.apache.solr.core.CoreContainer - Unable to load LogWatcher org.apache.solr.common.SolrException: Error loading class 'Log4

Re: Unsubscribe does not appear to be working

2012-04-27 Thread Benson Margulies
There is no such thing as a 'solr forum' or a 'solr forum account.' If you are subscribed to this list, an email to the unsubscribe address will unsubscribe you. If some intermediary or third party is forwarding email from this list to you, no one here can help you. On Fri, Apr 27, 2012 at 12:09

Re: Query parsing VS marshalling/unmarshalling

2012-04-24 Thread Benson Margulies
2012/4/24 Mindaugas Žakšauskas : > Hi, > > I maintain a distributed system which Solr is part of. The data which > is kept is Solr is "permissioned" and permissions are currently > implemented by taking the original user query, adding certain bits to > it which would make it return less data in the

Is there such as thing as FQ on a subquery?

2012-04-16 Thread Benson Margulies
I found myself wanting to write ... OR _query_:{!lucene fq=\"a:b\"}c:d And then I started looking at query trees in the debugger, and found myself thinking that there's no possible representation for this -- a subquery with a filter, since the filters are part of the RequestBuilder, no

Re: It's hard to google on _val_

2012-04-15 Thread Benson Margulies
On Sun, Apr 15, 2012 at 12:14 PM, Yonik Seeley wrote: > On Sun, Apr 15, 2012 at 11:34 AM, Benson Margulies > wrote: >> So, I've been experimenting to learn how the _val_ participates in scores. >> >> It seems to me that http://wiki.apache.org/solr/FunctionQuery shou

It's hard to google on _val_

2012-04-15 Thread Benson Margulies
So, I've been experimenting to learn how the _val_ participates in scores. It seems to me that http://wiki.apache.org/solr/FunctionQuery should explain the *effect* of including an _val_ term in an ordinary query, starting with a constant. http://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_ch

Re: Questions about the query function

2012-04-15 Thread Benson Margulies
Since I ended up with 'fund' instead of 'func' we're even. I made the edit. I'd make some more if you answered more of my questions :-) On Sun, Apr 15, 2012 at 9:42 AM, Erik Hatcher wrote: > >>> _val_ would work too, or of course using that function as a parameter to >>> (e)dismay's bf, or disma

Re: Questions about the query function

2012-04-15 Thread Benson Margulies
is clear? Yup. > > _val_ would work too, or of course using that function as a parameter to > (e)dismay's bf, or dismay's boost params. > >        Erik > > > > On Apr 15, 2012, at 08:43 , Benson Margulies wrote: > >> I've been pestering you

Questions about the query function

2012-04-15 Thread Benson Margulies
I've been pestering you all with a series of questions about disassembling and partially rescoring queries. Every helpful response (thanks) has led me to further reading, and this leads to more questions. If I haven't before, I'll apologize now for the high level of ignorance at which I'm starting.

Re: Can I discover what part of a score is attributable to a subquery?

2012-04-14 Thread Benson Margulies
just our query, to retrieve the scores, should be pretty speedy for a mere 200 items. Maybe I'm missing some even easier way, given a DocList and a query, to obtain scores for those docs for that query? > > paul > > Le 14 avr. 2012 à 15:34, Benson Margulies a écrit : >

Re: Can I discover what part of a score is attributable to a subquery?

2012-04-14 Thread Benson Margulies
can dig. > Paul > -- > Envoyé de mon téléphone Android avec K-9 Mail. Excusez la brièveté. > > > Benson Margulies a écrit : > > Given a query including a subquery, is there any way for me to learn > that subquery's contribution to the overall document score? >

Re: Can I discover what part of a score is attributable to a subquery?

2012-04-13 Thread Benson Margulies
On Fri, Apr 13, 2012 at 7:07 PM, Chris Hostetter wrote: > > : Given a query including a subquery, is there any way for me to learn > : that subquery's contribution to the overall document score? > > You have to just execute the subquery itself ... doc collection > and score calculation doesn't kee

Re: Can I discover what part of a score is attributable to a subquery?

2012-04-13 Thread Benson Margulies
On Fri, Apr 13, 2012 at 6:43 PM, John Chee wrote: > On Fri, Apr 13, 2012 at 2:40 PM, Benson Margulies > wrote: >> Given a query including a subquery, is there any way for me to learn >> that subquery's contribution to the overall document score? I need this numb

Realtime /get versus SearchHandler

2012-04-13 Thread Benson Margulies
A discussion over on the dev list led me to expect that the by-if field retrievals in a SolrCloud query would come through the get handler. In fact, I've seen them turn up in my search component in the search handler that is configured with my custom QT. (I have a 'prepare' method that sets ShardPa

Re: I've broken delete in SolrCloud and I'm a bit clueless as to how

2012-04-12 Thread Benson Margulies
On Thu, Apr 12, 2012 at 2:14 PM, Mark Miller wrote: > google must not have found it - i put that in a month or so ago I believe - > at least weeks. As you can see, there is still a bit to fill in, but it > covers the high level. I'd like to add example snippets for the rest soon. Mark, is it all

Re: I've broken delete in SolrCloud and I'm a bit clueless as to how

2012-04-12 Thread Benson Margulies
I'm probably confused, but it seems to me that the case I hit does not meet any of Yonik's criteria. I have no replicas. I'm running SolrCloud in the simple mode where each doc ends up in exactly one place. I think that it's just a bug that the code refuses to do the local deletion when there's n

Re: I've broken delete in SolrCloud and I'm a bit clueless as to how

2012-04-12 Thread Benson Margulies
urable just like the uniqueKey in the schema. > > schema.xml > > You must have a _version_ field defined: > > > > On Apr 11, 2012, at 9:10 AM, Benson Margulies wrote: > >> I didn't have a _version_ field, since nothing in the schema says that >> it&#x

Re: Default qt on SolrCloud

2012-04-11 Thread Benson Margulies
t;query" which presumably you've defined in solrconfig.xml... > > What does &debugQuery=on show? It turned out that I had left an extra(eous) declaration for /query with my custom RT, and when I removed it all was well. thanks,benson > > Best > Erick > > On Tue, A

Re: I've broken delete in SolrCloud and I'm a bit clueless as to how

2012-04-11 Thread Benson Margulies
it > works again. > > On Tue, 2012-04-10 at 17:15 -0400, Benson Margulies wrote: >> In my cloud configuration, if I push >> >> >>   *:* >> >> >> followed by: >> >> >> >> I get no errors, the log looks happy enough, but t

Re: I've broken delete in SolrCloud and I'm a bit clueless as to how

2012-04-11 Thread Benson Margulies
ere or back out your configs and plugins until it > works again. > > On Tue, 2012-04-10 at 17:15 -0400, Benson Margulies wrote: >> In my cloud configuration, if I push >> >> >>   *:* >> >> >> followed by: >> >> >> >> I ge

I've broken delete in SolrCloud and I'm a bit clueless as to how

2012-04-10 Thread Benson Margulies
In my cloud configuration, if I push *:* followed by: I get no errors, the log looks happy enough, but the documents remain in the index, visible to /query. Here's what seems my relevant bit of solrconfig.xml. My URP only implements processAdd.

Default qt on SolrCloud

2012-04-10 Thread Benson Margulies
After I load documents into my cloud instance, a URL like: http://localhost:PORT/solr/query?q=*:* finds nothing. http://localhost:PORT/solr/query?q=*:*&qt=standard finds everything. My custom request handlers have 'default="false"'. What have I done?

Re: URP's versus Cloud

2012-04-10 Thread Benson Margulies
hat seems to imply that 'before' processors run both on the leader and on the shards. Where do the afters run? Just on the leader or just on the shards? > > > On Tue, 10 Apr 2012 12:43:36 -0400, Benson Margulies > wrote: >> >> How are URP's managed with respec

URP's versus Cloud

2012-04-10 Thread Benson Margulies
How are URP's managed with respect to cloud deployment? Given some solrconfig.xml like the below, do I expect it to be in the chain on the leader, the shards, or both? RNI

Re: SolrCloud versus a SearchComponent that rescores

2012-04-10 Thread Benson Margulies
I've updated the doc with my findings. Thanks for the pointer.

Re: SolrCloud versus a SearchComponent that rescores

2012-04-10 Thread Benson Margulies
Another thought: currently I'm using qt=ME to indicate this process. I could, in theory, use some ME=true and make my components check for it to avoid this process, but it seems kind of peculiar from an end-user standpoint.

Re: SolrCloud versus a SearchComponent that rescores

2012-04-10 Thread Benson Margulies
as well. Thanks. > > Sent from my iPhone > > On Apr 9, 2012, at 9:26 PM, Benson Margulies wrote: > >> Um, maybe I've hit a quirk? >> >> In my solrconfig.xml, my special SearchComponents are installed only >> for a specific QT. So, it looks to me as if

Re: SolrCloud versus a SearchComponent that rescores

2012-04-09 Thread Benson Margulies
Um, maybe I've hit a quirk? In my solrconfig.xml, my special SearchComponents are installed only for a specific QT. So, it looks to me as if that QT is not propagated into the request out to the shards, and so they run the ordinary request handler without my components in it. Is this intended beh

Re: SolrCloud versus a SearchComponent that rescores

2012-04-09 Thread Benson Margulies
That page seems to be saying that the 'distributed' APIs take place on the leader, and the ordinary prepare/process APIs out at the leaves. I'll set out to prove or disprove that tomorrow. On Mon, Apr 9, 2012 at 8:17 PM, Mark Miller wrote: > > On Apr 9, 2012, at 7:34 PM, B

SolrCloud versus a SearchComponent that rescores

2012-04-09 Thread Benson Margulies
Those of you insomniacs who have read my messages here over the last few weeks might recall that I've been working on a request handler that wraps the SearchHandler to rewrite queries and then reorder results. (I haven't quite worked out how to apply Grant's alternative suggestions without losing

Stumped on using a custom update request processor with SolrCloud

2012-04-09 Thread Benson Margulies
If you would be so kind as to look at https://issues.apache.org/jira/browse/SOLR-3342, you will see that I tried to use a working configuration for a URP of mine with SolrCloud, and received in return an NPE. Somehow or another, by default, the XmlUpdateRequestHandler ends up using (I think) the P

Re: Re: Cloud-aware request processing?

2012-04-09 Thread Benson Margulies
; do you have to re-write the query for each shard? Seems unnecessary. For reasons described in previous email that I won't repeat here. > > --- Original Message --- > On 4/9/2012  08:45 AM Benson Margulies wrote: Jan Høydahl, > > My problem is intimately connect

Is http://wiki.apache.org/solr/SolrCloud#Example_A:_Simple_two_shard_cluster up to date?

2012-04-09 Thread Benson Margulies
I specify -Dcollection.configName=rnicloud, but the admin gui tells me that I have a collection named 'collection1'. And, as reported in a prior email, the admin UI URL in there seems wrong.

Re: Cloud-aware request processing?

2012-04-09 Thread Benson Margulies
ook at Hadoop or another > framework for distributed computation, see e.g. > http://java.dzone.com/articles/comparison-gridcloud-computing > > -- > Jan Høydahl, search solution architect > Cominvent AS - www.cominvent.com > Solr Training - www.solrtraining.com > > On 9. apr. 2012,

'No JSP support' error in embedded Jetty for solrCloud as of apache-solr-4.0-2012-04-02_11-54-55

2012-04-09 Thread Benson Margulies
Starting the leader with: java -Dbootstrap_confdir=./solr/conf -Dcollection.configName=rnicloud -DzkRun -DnumShards=3 -Djetty.port=9167 -jar start.jar and browsing to http://localhost:9167/solr/rnicloud/admin/zookeeper.jsp I get: HTTP ERROR 500 Problem accessing /solr/rnicloud/admin/zookeep

Cloud-aware request processing?

2012-04-09 Thread Benson Margulies
I'm working on a prototype of a scheme that uses SolrCloud to, in effect, distribute a computation by running it inside of a request processor. If there are N shards and M operations, I want each node to perform M/N operations. That, of course, implies that I know N. Is that fact available anypla

Re: A curious request about a curious request handler

2012-04-03 Thread Benson Margulies
Grant, let me see if I can expand this, as it were: {!benson f1:v1 f2:v2 f3:v3} (or do I mean {!query defType='benson' ...}?) I see how that could expand to be anything else I like. However, the Function side has me a little more puzzled. The information from the fields inside my {! ... } gets

Re: A curious request about a curious request handler

2012-04-03 Thread Benson Margulies
On Tue, Apr 3, 2012 at 12:27 PM, Grant Ingersoll wrote: > > On Apr 3, 2012, at 9:43 AM, Benson Margulies wrote: > >> I've made a RequestHandler class that acts as follows: >> >> 1. At its initialization, it creates a StandardRequestHandler and hangs onto >>

A curious request about a curious request handler

2012-04-03 Thread Benson Margulies
I've made a RequestHandler class that acts as follows: 1. At its initialization, it creates a StandardRequestHandler and hangs onto it. 2. When a query comes to it (I configure it to a custom qt value), it: a. creates a new query based on the query that arrived b. creates a LocalSolrQueryReque

  1   2   >