Re: new user: some questions about parameters and query syntax

2008-05-30 Thread Chris Hostetter
: For date faceting, count missing the order doesn't matter either, and : there it's given as a comma-separated list. Either you are mistaken, or i don't understand your statement. date faceting works just like field faceting -- no comma seperated lists. : I think it makes sense doing everythi

RE: Issuing queries during analysis?

2008-05-30 Thread Dallan Quass
Hi Grant, > Can you describe your indexing process a bit more? Do you > just have one or two tokens that you have "translate" or is > it that you are going to query on every token in your text? > I just don't see how that will perform at all to look up > every token in some index, so maybe i

RE: Issuing queries during analysis?

2008-05-30 Thread Dallan Quass
> Dallas, got money to spend on solving this problem? I > believe this is something that tools like LingPipe can solve > through language model training and named entity extraction. Hi Otis, Thank-you for your reply. I'm familiar with tools like LingPipe, but this problem is actually *much* s

RE: Issuing queries during analysis?

2008-05-30 Thread Dallan Quass
> Can you describe your indexing process a bit more? Do you > just have one or two tokens that you have "translate" or is > it that you are going to query on every token in your text? > I just don't see how that will perform at all to look up > every token in some index, so maybe if we have s

Re: Re[2]: "null" in admin page

2008-05-30 Thread Chris Hostetter
: It surely comes on the example, as I got this problem all times I get the : example, and I have to remove the file multicore.xml or I get the error. something is wrong then. if yo uare runing "java -jar start.jar" in the "example" directory then "example/solr" will be used as your solr home

Re: Delete by multiple query doesn't seem to work

2008-05-30 Thread Chris Hostetter
: Thanks for the suggestion. It didn't do anything. FWIW: If you have a query (any query) that returns result when you use the sstandard request handler, but does not delete any docs when sent in a ... update command, then that is a bug. A reproducable test case filed in Jira would be apprecia

Re: highlighting and hyperlink

2008-05-30 Thread Mike Klaas
On 30-May-08, at 2:25 PM, Kevin Xiao wrote: Hi I am not sure if there are any discussions about this, I could not find the search function in mailing list archives. :) Anyway, here is my problem: In my document, I have a hyperlink, say, breast cancer, but when I applied solr highlighti

highlighting and hyperlink

2008-05-30 Thread Kevin Xiao
Hi I am not sure if there are any discussions about this, I could not find the search function in mailing list archives. :) Anyway, here is my problem: In my document, I have a hyperlink, say, breast cancer, but when I applied solr highlighting on search term 'cancer', that hyperlink becomes: c

Re: new user: some questions about parameters and query syntax

2008-05-30 Thread Mike Klaas
On 29-May-08, at 11:22 AM, Bram de Jong wrote: On Thu, May 29, 2008 at 6:40 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: I haven't been paying close attention to the uniformity of URL parameters, but if there is room for making them more uniform (e.g. always use singular, always use comma

Re: Solr indexing configuration help

2008-05-30 Thread Yonik Seeley
Some things to try: - turn off autowarming on the master - turn off autocommit, unless you really need it, or change it to be less agressive: autocommitting every 50 docs is bad if you are rapidly adding documents. - set maxWarmingSearchers to 1 to prevent the buildup of searchers -Yonik On Fri,

Re: Solr indexing configuration help

2008-05-30 Thread Gaku Mak
I started running the test on 2 other machines with similar specs but more RAM (4G). One of them now has about 60k docs and still running fine. On the other machine, solr died at about 43k docs. A short while before solr died, I saw that there were 5 searchers at the same time. Do any of you know

RE: How to describe 2 entities in dataConfig for the DataImporter?

2008-05-30 Thread Julio Castillo
I'm sorry Shalin, but I still get the same Null Pointer exception. This is my complete dataconfig.xml (I remove the parallel entity to narrow down the scope of the problem). Thanks again. ** julio -Original Message- From: Shalin Shekhar Mangar

Re: How to describe 2 entities in dataConfig for the DataImporter?

2008-05-30 Thread Shalin Shekhar Mangar
The surname is used just as an example of a field. The NullPointerException is because the same field "id" tries to use it's own value in a template. The template cannot contain the same field on which it is being applied. I'd suggest that you get the id aliased to another name, for example using

Re: wildcard highlighting

2008-05-30 Thread Mike Klaas
On 30-May-08, at 6:45 AM, Stefan Oestreicher wrote: Hi, I've started to play around with Solr and I'm quite impressed with its performance and features. However it seems to me that highlighting of wildcard terms is not supported, which is somewhat disappointing. Are there any plans to suppor

RE: How to describe 2 entities in dataConfig for the DataImporter?

2008-05-30 Thread Lance Norskog
You might try creating your whole transform as an SQL database view rather than with the Solr transformer toolkit. This would also make it easier to directly examine the data to be indexed. Lance -Original Message- From: Julio Castillo [mailto:[EMAIL PROTECTED] Sent: Thursday, May 29, 20

NullPointerException at lucene.analysis.StopFilter with 1.3

2008-05-30 Thread Ronald K. Braun
Hi, I'm just looking into transitioning from solr 1.2 to 1.3 (trunk). I have some legacy handler code (called "AdvancedRequestHandler") that used to work with 1.2 but now throws an exception using 1.3 (latest nightly build). The exception is this: HTTP Status 500 - null java.lang.NullPointerExce

RE: How to describe 2 entities in dataConfig for the DataImporter?

2008-05-30 Thread Julio Castillo
Thanks for all the leads. I did get however a null pointer exception while implementing it: May 30, 2008 9:57:50 AM org.apache.solr.handler.dataimport.EntityProcessorBase applyTransformer WARNING: transformer threw error java.lang.NullPointerException at org.apache.solr.handler.dataimport.Templ

Re: Solr Collection Distribution on AIX

2008-05-30 Thread Bill Au
I know people have made changes to the scripts in order for them to work correctly on the UNIX variant that they are running on. Has anyone done this for AIX? Bill On Fri, May 30, 2008 at 10:38 AM, tglawles <[EMAIL PROTECTED]> wrote: > > I've found that the scripts provided with Solr do not run

Re: Getting maximum and minimum values of a field

2008-05-30 Thread Jonathan Ariel
Ok so with this patch you have something like this. facet=true&facet.field=weight&f.weight.facet.statistical=true This will bring the regular facet info, like: 5 2 1 and then will add the following: 3 1 1.5 0.7559289460 0.5039526307 So you get the max and min value as well as the CV

Re: Getting maximum and minimum values of a field

2008-05-30 Thread Uri Boness
I guess, to generalize the idea, is to have some support for aggregation functions. average anyone ;-) ? It would also be very useful to be able to define the field that is being aggregated. For example, in a flight reservation web site we developed we needed to show facets on different flight

Re: Getting maximum and minimum values of a field

2008-05-30 Thread Jonathan Ariel
yeap. forgot to mention that. well I didn't add support for multivalued fields yet, it might be interesting. On Fri, May 30, 2008 at 1:05 PM, Francisco Sanmartin <[EMAIL PROTECTED]> wrote: > Don't forget to mention Jonathan that the complexity of the algorithm is > not changed at all. It reads ju

Re: Getting maximum and minimum values of a field

2008-05-30 Thread Francisco Sanmartin
Don't forget to mention Jonathan that the complexity of the algorithm is not changed at all. It reads just once the value of the field for each document selected, so no extra loops either in or outside the main loop ;). Pako Jonathan Ariel wrote: Well, this is the first version of the patch.

Re: Getting maximum and minimum values of a field

2008-05-30 Thread Otis Gospodnetic
Don't know what exactly I mean(t), and can't think of some appropriate function, but my thinking was roughly: but what if we want to function foo, and function foo has not been implemented. Can I write my own foo function and plug it in somewhere? I'm theorizing though... I'll look at the patc

Re: Issuing queries during analysis?

2008-05-30 Thread Otis Gospodnetic
Dallas, got money to spend on solving this problem? I believe this is something that tools like LingPipe can solve through language model training and named entity extraction. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Dallan Quass <

Re: Solr Collection Distribution on AIX

2008-05-30 Thread Otis Gospodnetic
This may be a long-term solution: https://issues.apache.org/jira/browse/SOLR-561 No code yet, though. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: tglawles <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Friday, May 30, 20

Re: Getting maximum and minimum values of a field

2008-05-30 Thread Jonathan Ariel
Well, this is the first version of the patch. https://issues.apache.org/jira/browse/SOLR-588 Give me your feedback so we can make it better. Thanks! On Fri, May 30, 2008 at 10:36 AM, Jonathan Ariel <[EMAIL PROTECTED]> wrote: > What do you mean? Right now you send an extra parameter > f.weight.f

Re: threshold of result rankings

2008-05-30 Thread Francisco Sanmartin
I've done that already. All you need to do is to create your custom request handler. My handler, among other things, what it does is the following: It receives a factor threshold, such as 0.85. This means that the score of the first document returned will be the assumed as the "best" matching

Re: Issuing queries during analysis?

2008-05-30 Thread Grant Ingersoll
On May 30, 2008, at 10:22 AM, Dallan Quass wrote: this may sound a bit too KISS - but another approach could be based on synonyms, i.e. if the number of abbreviation is limited and defined ("All US States"), you can simply define complete state name for each abbreviation, this way a "Chicago, I

Solr Collection Distribution on AIX

2008-05-30 Thread tglawles
I've found that the scripts provided with Solr do not run on AIX. From other posts, it seems is due to the UNIX commands the scripts utilize. Does anyone have any suggestions or experience on the best way to distribute solr collections on UNIX systems where the provided scripts do not run? -- V

RE: Issuing queries during analysis?

2008-05-30 Thread Dallan Quass
> this may sound a bit too KISS - but another approach could be > based on synonyms, i.e. if the number of abbreviation is > limited and defined ("All US States"), you can simply define > complete state name for each abbreviation, this way a > "Chicago, IL" will be "translated" (...) in "Chicag

threshold of result rankings

2008-05-30 Thread Umar Shah
Hi, is there some way of limiting the results above some fixed threshold? thanks in anticipation -umar

wildcard highlighting

2008-05-30 Thread Stefan Oestreicher
Hi, I've started to play around with Solr and I'm quite impressed with its performance and features. However it seems to me that highlighting of wildcard terms is not supported, which is somewhat disappointing. Are there any plans to support that or did I miss something? best regards, Stefan Oe

Re: Getting maximum and minimum values of a field

2008-05-30 Thread Jonathan Ariel
What do you mean? Right now you send an extra parameter f.weight.facet.statistical=true and it will add statistical information to the facet field response. What I don't like is that I made the the changes to the SimpleFacet class itself, it would nicer to have the ability to specify a different s

Re: Getting maximum and minimum values of a field

2008-05-30 Thread Otis Gospodnetic
Was thinking... how about making those functions pluggable? Doable? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Friday, May 30, 2008 3:56:14 AM > Subject:

Re: new user: some questions about parameters and query syntax

2008-05-30 Thread Erik Hatcher
On May 30, 2008, at 2:40 AM, Otis Gospodnetic wrote: Right, the only actively developed Solr client is really Solrj. All other ones are not well maintained - I don't recall seeing any patches for any of them in the recent months Ummm solr-ruby! It doesn't need any patches ;) It's u

Re: Issuing queries during analysis?

2008-05-30 Thread Walter Ferrara
Dallan Quass wrote: I have a situation where it would be beneficial to issue queries in a filter that is called during analysis. In a nutshell, I have an index of places that includes possible abbreviations. And I want to query this index during analysis to convert user-entered places to "stand

Re: simple ui?

2008-05-30 Thread Karl Wettin
28 maj 2008 kl. 14.15 skrev Erik Hatcher: On May 28, 2008, at 2:34 AM, Karl Wettin wrote: It would be perfect if all I had to do was to define a couple of facet fields, a default text query field and some title/body/class type to render the results. Is there such a formula 1A JSP/servlet

Re: Getting maximum and minimum values of a field

2008-05-30 Thread solr
I know that I would definately be interested in that. :) /Jimi Quoting Jonathan Ariel <[EMAIL PROTECTED]>: Ok. So I have a version of solr with a small modification to the SimpleFacet class where you can send a parameter to tell that you want some more info. It'll bring back a list with the ma

Re: Sorting in different languages

2008-05-30 Thread Uwe Klosa
Well, thats what worries me, too. On Fri, May 30, 2008 at 8:46 AM, Otis Gospodnetic < [EMAIL PROTECTED]> wrote: > I see. That patch is not in Lucene yet, and it looks like *nobody* voted > for it. If you like it, please vote for it. > Personally seeing a mention of higher memory usage in that p