Re: [VOTE] Community Logo Preferences

2008-11-27 Thread Andrzej Bialecki

https://issues.apache.org/jira/secure/attachment/12394268/apache_solr_c_red.jpg
https://issues.apache.org/jira/secure/attachment/12394350/solr.s4.jpg
https://issues.apache.org/jira/secure/attachment/12394376/solr_sp.png
https://issues.apache.org/jira/secure/attachment/12394267/apache_solr_c_blue.jpg


--
Best regards,
Andrzej Bialecki <><
 ___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



RE: Deleting indices

2008-11-27 Thread Raghunandan Rao
Can anybody please let me know why is this happening? I have also tried
giving indexed="false" for name field in the xml. 

-Original Message-
From: Raghunandan Rao [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, November 26, 2008 6:18 PM
To: solr-user@lucene.apache.org
Subject: RE: Deleting indices

I have restarted and re-indexed all the docs after the change in the
schema.xml. I was able to search even after that. 

I hit browser with this url

http://localhost:7001/solr/select?q=name:2124&fl=*&debugQuery=true


-Original Message-
From: Shalin Shekhar Mangar [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, November 26, 2008 6:14 PM
To: solr-user@lucene.apache.org
Subject: Re: Deleting indices

Did you change your schema.xml after you indexed the documents?

On Wed, Nov 26, 2008 at 6:08 PM, Raghunandan Rao <
[EMAIL PROTECTED]> wrote:

> I am not sure. It is showing in the debug logs as below.
>
> 0.5967079 = (MATCH) fieldWeight(name:2124 in 0), product of: 1.4142135
=
> tf(termFreq(name:2124)=2) 1.1251632 = idf
>
> This is how I have done.
> I have a class NameDTO which contains all these variables. I have
@Field
> annotation to these fields. My schema.xml file contains as I have said
> below. Is that because I have @Field annotation it is indexing?
>
> -Original Message-
> From: Shalin Shekhar Mangar [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, November 26, 2008 5:47 PM
> To: solr-user@lucene.apache.org
> Subject: Re: Deleting indices
>
> On Wed, Nov 26, 2008 at 5:01 PM, Raghunandan Rao <
> [EMAIL PROTECTED]> wrote:
>
> > Ok.
> > I have another query on search.
> >
> > I have given two fields to be indexed = true in xml.
> >
> > 
> >  > required="false"/>
> >  > required="false"/>
> >
> > But when I try to search on non-indexed "name" field it is able to
> retrieve
> > records. How is this possible?
> >
>
> It is not possible. There must be an indexed field which contained
that
> value.
>
> Execute your query from the web interface and add debugQuery=true to
> your
> search request to see details on how it is being searched.
> --
> Regards,
> Shalin Shekhar Mangar.
>



-- 
Regards,
Shalin Shekhar Mangar.


Re: omiting no price documents when sorting on price

2008-11-27 Thread joeMcElroy

was hoping there was an option to omit results with a certain value to stop
certain fields containing 0 skewing the search results.

i will try not sending the field if its 0. cheers!

joe


joeMcElroy wrote:
> 
> im sure this is an easy question but...
> 
> when a product doesn't have a price, I index the price as 0. When sorting
> on price, these values come up first or last. How can you omit these items
> when sorting against price. 
> 
> thanks
> 
> joe 
> 

-- 
View this message in context: 
http://www.nabble.com/omiting-no-price-documents-when-sorting-on-price-tp20703795p20716352.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: omiting no price documents when sorting on price

2008-11-27 Thread Erik Hatcher
You can omit documents.  I recommend doing it with a filter query.   
Append the following to your request to Solr:


&fq=-price:0

That does the trick?  You'll have to have client logic to only add  
that parameter when sorting by price, if that's how you want it to work.


Erik


On Nov 27, 2008, at 4:59 AM, joeMcElroy wrote:



was hoping there was an option to omit results with a certain value  
to stop

certain fields containing 0 skewing the search results.

i will try not sending the field if its 0. cheers!

joe


joeMcElroy wrote:


im sure this is an easy question but...

when a product doesn't have a price, I index the price as 0. When  
sorting
on price, these values come up first or last. How can you omit  
these items

when sorting against price.

thanks

joe



--
View this message in context: 
http://www.nabble.com/omiting-no-price-documents-when-sorting-on-price-tp20703795p20716352.html
Sent from the Solr - User mailing list archive at Nabble.com.




RE: Keyword extraction

2008-11-27 Thread Plaatje, Patrick
Hi Aleksander,

With all the help of you and the other comments, we're now at a point where a 
MoreLikeThis list is returned, and shows 10 related records. However on the 
query executed there are no keywords whatsoever being returned. Is the 
querystring still wrong or is something else required?

The querystring we're currently executing is:

http://suempnr3:8080/solr/select/?q=amsterdam&mlt.fl=text&mlt.displayTerms=list&mlt=true


Best,

Patrick 

-Original Message-
From: Aleksander M. Stensby [mailto:[EMAIL PROTECTED] 
Sent: woensdag 26 november 2008 15:07
To: solr-user@lucene.apache.org
Subject: Re: Keyword extraction

Ah, yes, That is important. In lucene, the MLT will see if the term vector is 
stored, and if it is not it will still be able to perform the querying, but in 
a much much much less efficient way.. Lucene will analyze the document (and the 
variable DEFAULT_MAX_NUM_TOKENS_PARSED will be used to limit the number of 
tokens that will be parsed). (don't want to go into details on this since I 
haven't really dug through the code:p) But when the field isn't stored either, 
it is rather difficult to re-analyze the
document;)

On a general note, if you want to "really" understand how the MLT works, take a 
look at the wiki or read this thorough blog post:  
http://cephas.net/blog/2008/03/30/how-morelikethis-works-in-lucene/

Regards,
  Aleksander

On Wed, 26 Nov 2008 14:41:52 +0100, Plaatje, Patrick <[EMAIL PROTECTED]> wrote:

> Hi Aleksander,
>
> This was a typo on my end, the original query included a semicolon 
> instead of an equal sign. But I think it has to do with my field not 
> being stored and not being identified as termVectors="true". I'm 
> recreating the index now, and see if this fixes the problem.
>
> Best,
>
> patrick
>
> -Original Message-
> From: Aleksander M. Stensby [mailto:[EMAIL PROTECTED]
> Sent: woensdag 26 november 2008 14:37
> To: solr-user@lucene.apache.org
> Subject: Re: Keyword extraction
>
> Hi there!
> Well, first of all i think you have an error in your query, if I'm not 
> mistaken.
> You say http://localhost:8080/solr/select/?q=id=18477975...
> but since you are referring to the field called "id", you must say:
> http://localhost:8080/solr/select/?q=id:18477975...
> (use colon instead of the equals sign).
> I think that will do the trick.
> If not, try adding the &debugQuery=on at the end of your request url, 
> to see debug output on how the query is parsed and if/how any 
> documents are matched against your query.
> Hope this helps.
>
> Cheers,
>   Aleksander
>
>
>
> On Wed, 26 Nov 2008 13:08:30 +0100, Plaatje, Patrick 
> <[EMAIL PROTECTED]> wrote:
>
>> Hi Aleksander,
>>
>> Thanx for clearing this up. I am confident that this is a way to 
>> explore for me as I'm just starting to grasp the matter. Do you know 
>> why I'm not getting any results with the query posted earlier then? 
>> It gives me the folowing only:
>>
>> 
>>   
>>
>> Instead of delivering details of the interestingTerms.
>>
>> Thanks in advance
>>
>> Patrick
>>
>>
>> -Original Message-
>> From: Aleksander M. Stensby [mailto:[EMAIL PROTECTED]
>> Sent: woensdag 26 november 2008 13:03
>> To: solr-user@lucene.apache.org
>> Subject: Re: Keyword extraction
>>
>> I do not agree with you at all. The concept of MoreLikeThis is based 
>> on the fundamental idea of TF-IDF weighting, and not term frequency 
>> alone.
>> Please take a look at:
>> http://lucene.apache.org/java/2_4_0/api/org/apache/lucene/search/simi
>> l ar/MoreLikeThis.html As you can see, it is possible to use cut-off 
>> thresholds to significantly reduce the number of unimportant terms, 
>> and generate highly suitable queries based on the tf-idf frequency of 
>> the term, since as you point out, high frequency terms alone tends to 
>> be useless for querying, but taking the document frequency into 
>> account drastically increases the importance of the term!
>>
>> In solr, use parameters to manipulate your desired results:
>> http://wiki.apache.org/solr/MoreLikeThis#head-6460069f297626f2a982f1e
>> 2
>> 2ec5d1519c456b2c
>> For instance:
>> mlt.mintf - Minimum Term Frequency - the frequency below which terms 
>> will be ignored in the source doc.
>> mlt.mindf - Minimum Document Frequency - the frequency at which words 
>> will be ignored which do not occur in at least this many docs.
>> You can also set thresholds for term length etc.
>>
>> Hope this gives you a better idea of things.
>> - Aleks
>>
>> On Wed, 26 Nov 2008 12:38:38 +0100, Scurtu Vitalie 
>> <[EMAIL PROTECTED]>
>> wrote:
>>
>>> Dear Partick, I had the same problem with MoreLikeThis function.
>>>
>>> After  briefly reading and analyzing the source code of moreLikeThis 
>>> function in solr, I conducted:
>>>
>>> MoreLikeThis uses term vectors to ranks all the terms from a 
>>> document by its frequency. According to its ranking, it will start 
>>> to generate queries, artificially, and search for documents.
>>>
>>> So, moreLikeThis will retrieve

Re: Keyword extraction

2008-11-27 Thread Aleksander M. Stensby

Hi again Patrick.
Glad to hear that we can contribute to help you guys. Thats what this  
mailing list is for:)


First of all, I think you use the wrong parameter to get your terms.
Take a look at  
http://lucene.apache.org/solr/api/org/apache/solr/common/params/MoreLikeThisParams.html  
to see the supported params.
In your string you use mlt.displayTerms=list, which i believe should be  
mlt.interestingTerms=list.


If that doesn't work:
One thing you should know is that from what i can tell, you are using the  
StandardRequestHandler in your querying. The StandardRequestHandler  
supports a simplified handling of more like these queries, namely; "This  
method returns similar documents for each document in the response set."
it supports the common mlt parameters, needs mlt=true (as you have done)  
and supports a mlt.count parameter to specify the number of similar  
documents returned for each matching doc from your query.


If you want to get the "top keywords" etc, (and in essence your  
mlt.interestingTerms=list parameter to have any effect at all, if I'm not  
completely wrong), you will need to configure up a MoreLikeThisHandler in  
your solrconfig.xml and then map that to your query.


From the sample configuration file:
	incoming queries will be dispatched to the correct handler based on the  
path or the qt (query type) param. Names starting with a '/' are accessed  
with the a path equal to the registered name.  Names without a leading '/'  
are accessed with: http://host/app/select?qt=name If no qt is defined, the  
requestHandler that declares default="true" will be used.


You can read about the MoreLikeThisHandler here:  
http://wiki.apache.org/solr/MoreLikeThisHandler


Once you have it configured properly your query would be something like:
http://localhost:8983/solr/mlt?q=amsterdam&mlt.fl=text&mlt.interestingTerms=list&mlt=true  
(don't think you need the mlt=true here tho...)

or
http://localhost:8983/solr/select?qt=mlt&q=amsterdam&mlt.fl=text&mlt.interestingTerms=list&mlt=true
(in the last example I use qt=mlt)

Hope this helps.
Regards,
 Aleksander


On Thu, 27 Nov 2008 11:49:30 +0100, Plaatje, Patrick  
<[EMAIL PROTECTED]> wrote:



Hi Aleksander,

With all the help of you and the other comments, we're now at a point  
where a MoreLikeThis list is returned, and shows 10 related records.  
However on the query executed there are no keywords whatsoever being  
returned. Is the querystring still wrong or is something else required?


The querystring we're currently executing is:

http://suempnr3:8080/solr/select/?q=amsterdam&mlt.fl=text&mlt.displayTerms=list&mlt=true


Best,

Patrick

-Original Message-
From: Aleksander M. Stensby [mailto:[EMAIL PROTECTED]
Sent: woensdag 26 november 2008 15:07
To: solr-user@lucene.apache.org
Subject: Re: Keyword extraction

Ah, yes, That is important. In lucene, the MLT will see if the term  
vector is stored, and if it is not it will still be able to perform the  
querying, but in a much much much less efficient way.. Lucene will  
analyze the document (and the variable DEFAULT_MAX_NUM_TOKENS_PARSED  
will be used to limit the number of tokens that will be parsed). (don't  
want to go into details on this since I haven't really dug through the  
code:p) But when the field isn't stored either, it is rather difficult  
to re-analyze the

document;)

On a general note, if you want to "really" understand how the MLT works,  
take a look at the wiki or read this thorough blog post:

http://cephas.net/blog/2008/03/30/how-morelikethis-works-in-lucene/

Regards,
  Aleksander

On Wed, 26 Nov 2008 14:41:52 +0100, Plaatje, Patrick  
<[EMAIL PROTECTED]> wrote:



Hi Aleksander,

This was a typo on my end, the original query included a semicolon
instead of an equal sign. But I think it has to do with my field not
being stored and not being identified as termVectors="true". I'm
recreating the index now, and see if this fixes the problem.

Best,

patrick

-Original Message-
From: Aleksander M. Stensby [mailto:[EMAIL PROTECTED]
Sent: woensdag 26 november 2008 14:37
To: solr-user@lucene.apache.org
Subject: Re: Keyword extraction

Hi there!
Well, first of all i think you have an error in your query, if I'm not
mistaken.
You say http://localhost:8080/solr/select/?q=id=18477975...
but since you are referring to the field called "id", you must say:
http://localhost:8080/solr/select/?q=id:18477975...
(use colon instead of the equals sign).
I think that will do the trick.
If not, try adding the &debugQuery=on at the end of your request url,
to see debug output on how the query is parsed and if/how any
documents are matched against your query.
Hope this helps.

Cheers,
  Aleksander



On Wed, 26 Nov 2008 13:08:30 +0100, Plaatje, Patrick
<[EMAIL PROTECTED]> wrote:


Hi Aleksander,

Thanx for clearing this up. I am confident that this is a way to
explore for me as I'm just starting to grasp the matter. Do you know
why I'm not getting any results with the quer

SolrException: HTTP code=503

2008-11-27 Thread dudes dudes
















Hello all, 

I'm having this issue bellow .. can some one tell me what I have to change in 
the config file to overcome this issue ?


SolrException: HTTP code=503, reason=Error opening new searcher. exceeded

limit of maxWarmingSearchers=4, try again later.



thanks a lot 
Ak


_
See the most popular videos on the web 
http://clk.atdmt.com/GBL/go/115454061/direct/01/

Re: range queries on string field with millions of values

2008-11-27 Thread Alexander Ramos Jardim
I did not even understand what you are considering to be the order on your
call numbers.

2008/11/26 Naomi Dushay <[EMAIL PROTECTED]>

> I have a performance problem and I haven't thought of a clever way around
> it.
>
> I work at the Stanford University Libraries.  We have a collection of over
> 8 million items.  Each item has a call number.  I have been asked to provide
> a way to browse forward and backward from an arbitrary call number.
>
> I have managed to create a fields that present the call numbers in
> appropriate sorts, both forward and reverse.  (This is necessary because raw
> call numbers don't sort properly:   A123 AZ27 B99 B999 BBB11).
>
> We can ignore the reverse sorted range query problem;  it's the same as the
> forward sorted range query.
>
> So I use a query like this:
>
> sortCallNum["A123 B34 1970" TO *]&rows=10.
>
>
> Call numbers are squirrelly, so we can't predict the string that will
> appropriately grab at least 10 subsequent documents.  They are certainly not
> consecutive!
>
> so from
> A123 B34 1970
>
> we're unable to predict if any of these will return at least 10 results:
>
> A123 B34 1980  or
> A123 B34 V.8  or
> A123 B44 or
> A123 B67 or
> A123 C27 or
> A124* or
> A22* or
> AA* or
>
> You get the idea.
>
> I have not figured out a way to efficiently query for "the next 10 call
> numbers in sort order".  I have also mucked about with the cache
> initialization, but that's not working either:
>
>
>  
>
>
>shelfkey:[0 TO *]
>shelfkey asc
>
>  
>
> Can anyone help me with this?
>
> - Naomi
>
>


-- 
Alexander Ramos Jardim


Re: facet.sort and distributed search

2008-11-27 Thread Grégoire Neuville
Hi,

Thanks a lot for your answer : I've tried the patch and it works fine.

regards,

2008/11/27 Aleksander M. Stensby <[EMAIL PROTECTED]>:
> This is a known issue but take a look at the following jira issue and the
> patch supplied there:
> https://issues.apache.org/jira/browse/SOLR-764
>
> Haven't tried it myself, but i believe it should do the trick for you.
> Hope that helps.
>
> Cheers,
>  Aleksander
>
> On Wed, 26 Nov 2008 22:53:21 +0100, Grégoire Neuville
> <[EMAIL PROTECTED]> wrote:
>
>> Hi,
>>
>> I'm working on an web application one functionality of which consists
>> in presenting to the user a list of terms to seize in a form field,
>> sorted alphabetically. As long as one single index was concerned, I
>> used solr facets to produce the list and it worked fine. But I must
>> now deal with several indices,  and thus use the distributed search
>> capability of solr, which forbid the use of "facet.sort=false".
>>
>> I would like to know if someone plans to, or is even working on, the
>> implementation of the natural facet sorting in case of a distributed
>> search.
>>
>> Thanks a lot,
>
>
>
> --
> Aleksander M. Stensby
> Senior software developer
> Integrasco A/S
> www.integrasco.no
>



-- 
Grégoire Neuville


Re: Deleting indices

2008-11-27 Thread Mike Klaas


On 26-Nov-08, at 4:48 AM, Raghunandan Rao wrote:


I have restarted and re-indexed all the docs after the change in the
schema.xml. I was able to search even after that.

I hit browser with this url

http://localhost:7001/solr/select?q=name:2124&fl=*&debugQuery=true


Are you sure all the old docs are gone?  Try wiping the index  
completely:


deleteByQuery *:*

(it is also more efficient to do this first if you are going to re- 
index everything).


-Mike


Mock solr server

2008-11-27 Thread Robert Young
Hi,

Does anyone know of an easy to use Mock solr server?

Thanks
Rob


Re: Mock solr server

2008-11-27 Thread Jeryl Cook
are you trying to unit test something? I would simply make use  of the
Embedded SOLR component in your unit tests..

On 11/27/08, Robert Young <[EMAIL PROTECTED]> wrote:
> Hi,
>
> Does anyone know of an easy to use Mock solr server?
>
> Thanks
> Rob
>


-- 
Jeryl Cook
/^\ Pharaoh /^\
http://pharaohofkush.blogspot.com/
"Whether we bring our enemies to justice, or bring justice to our
enemies, justice will be done."
--George W. Bush, Address to a Joint Session of Congress and the
American People, September 20, 2001


RE: Deleting indices

2008-11-27 Thread Raghunandan Rao
Yes, I am sure. Everytime I re-index, I do deleteByQuery("*:*") and then
fetch the rows from DB and call server.addBeans(list). 

-Original Message-
From: Mike Klaas [mailto:[EMAIL PROTECTED] 
Sent: Friday, November 28, 2008 4:30 AM
To: solr-user@lucene.apache.org
Subject: Re: Deleting indices


On 26-Nov-08, at 4:48 AM, Raghunandan Rao wrote:

> I have restarted and re-indexed all the docs after the change in the
> schema.xml. I was able to search even after that.
>
> I hit browser with this url
>
> http://localhost:7001/solr/select?q=name:2124&fl=*&debugQuery=true

Are you sure all the old docs are gone?  Try wiping the index  
completely:

deleteByQuery *:*

(it is also more efficient to do this first if you are going to re- 
index everything).

-Mike


Re: [VOTE] Community Logo Preferences

2008-11-27 Thread Shrutipriya
https://issues.apache.org/jira/secure/attachment/12394268/apache_solr_c_red.jpg
https://issues.apache.org/jira/secure/attachment/12394282/solr2_maho_impression.png
https://issues.apache.org/jira/secure/attachment/12394267/apache_solr_c_blue.jpg

regards,
shrutipriya