performance implications on using lots of values in fq

2008-07-23 Thread briand

I have documents in SOLR such that each document contains one to many points
(latitude and longitudes).   Currently we store the multiple points for a
given document in the db and query the db to find all of the document ids
around a given point first.   Once we have the list of ids, we populate the
fq with those ids and the q value and send that off to SOLR to do a search.  
In the "longest" query to SOLR we're populating about 450 ids into the fq
parameter at this time.   I was wondering if anyone knows the performance
implications of passing so many ids into the fq and when it would
potentially be a problem for SOLR?   Currently the query passing in 450 ids
is not a problem at all and returns in less than a second.   Thanks. 
-- 
View this message in context: 
http://www.nabble.com/performance-implications-on-using-lots-of-values-in-fq-tp18617397p18617397.html
Sent from the Solr - User mailing list archive at Nabble.com.



NPE on auto-warming and out of memory issues

2007-10-19 Thread briand

We are experiencing OOM issues with a SOLR index that has about 12G of
indexed data with 2GB allocated to the JVM.   We first see these type of
messages in the log: 

Oct 18, 2007 10:25:00 AM org.apache.solr.core.SolrException log
SEVERE: Error during auto-warming of key:+(search_place_type:citi
search_place_type:attract search_place_type:airport
search_place_type:univers):java.lang
.OutOfMemoryError: Java heap space

Oct 18, 2007 10:25:07 AM org.apache.solr.core.SolrException log
SEVERE: Error during auto-warming of
key:+search_place_type:busi:java.lang.OutOfMemoryError: Java heap space

After a log of continuing messages like the ones above we'll see log series
of messages like this:

Oct 18, 2007 10:55:58 AM org.apache.solr.core.SolrException log
SEVERE: Error during auto-warming of
key:[EMAIL PROTECTED]:java.lang.NullPointerException
at org.apache.lucene.index.Term.compareTo(Term.java:91)
at
org.apache.lucene.index.TermInfosReader.getIndexOffset(TermInfosReader.java:112)
at
org.apache.lucene.index.TermInfosReader.get(TermInfosReader.java:147)
at
org.apache.lucene.index.SegmentReader.docFreq(SegmentReader.java:342)
at org.apache.lucene.index.MultiReader.docFreq(MultiReader.java:220)
at
org.apache.lucene.search.IndexSearcher.docFreq(IndexSearcher.java:87)
at org.apache.lucene.search.Similarity.idf(Similarity.java:459)
at
org.apache.lucene.search.TermQuery$TermWeight.(TermQuery.java:44)
at
org.apache.lucene.search.TermQuery.createWeight(TermQuery.java:145)
at
org.apache.lucene.search.DisjunctionMaxQuery$DisjunctionMaxWeight.(DisjunctionMaxQuery.java:99)
at
org.apache.lucene.search.DisjunctionMaxQuery.createWeight(DisjunctionMaxQuery.java:161)
at
org.apache.lucene.search.BooleanQuery$BooleanWeight.(BooleanQuery.java:187)
at
org.apache.lucene.search.BooleanQuery$BooleanWeight2.(BooleanQuery.java:342)
at
org.apache.lucene.search.BooleanQuery.createWeight(BooleanQuery.java:384)
at org.apache.lucene.search.Query.weight(Query.java:95)
at org.apache.lucene.search.Searcher.createWeight(Searcher.java:171)
at org.apache.lucene.search.Searcher.search(Searcher.java:118)
at org.apache.lucene.search.Searcher.search(Searcher.java:97)
at
org.apache.solr.search.SolrIndexSearcher.getDocListNC(SolrIndexSearcher.java:888)
at
org.apache.solr.search.SolrIndexSearcher.getDocListC(SolrIndexSearcher.java:805)
at
org.apache.solr.search.SolrIndexSearcher.access$100(SolrIndexSearcher.java:60)
at
org.apache.solr.search.SolrIndexSearcher$2.regenerateItem(SolrIndexSearcher.java:251)
at org.apache.solr.search.LRUCache.warm(LRUCache.java:193)
at
org.apache.solr.search.SolrIndexSearcher.warm(SolrIndexSearcher.java:1385)
at org.apache.solr.core.SolrCore$1.call(SolrCore.java:488)
at
java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:269)
at java.util.concurrent.FutureTask.run(FutureTask.java:123)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:650)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:675)
at java.lang.Thread.run(Thread.java:595)

We have our configuration set for autowarmCount to:







It appears that autowarming has something to do with our OOM.   I've read
that if you set the autowarmCount to 0 or a lower value then the first
request after a commit may take some time.Definitely willing to try
setting the autowarmCount to a lower value.   Does anyone have any other
ideas to help with this autowarming issue?   Thanks. 
-- 
View this message in context: 
http://www.nabble.com/NPE-on-auto-warming-and-out-of-memory-issues-tf4654164.html#a13297439
Sent from the Solr - User mailing list archive at Nabble.com.



restricting search to a set of documents

2007-11-07 Thread briand

I need to perform a search against a limited set of documents.  I have the
set of document ids, but was wondering what is the best way to formulate the
query to SOLR? 
-- 
View this message in context: 
http://www.nabble.com/restricting-search-to-a-set-of-documents-tf4767801.html#a13637479
Sent from the Solr - User mailing list archive at Nabble.com.



querying with two words returns less results when ORing terms

2008-04-28 Thread briand

I'm searching using query of 'food' and get back 572 results.   When I do a
search for 'food drink' I am expecting more results since it's defaulting to
ORing the search terms, however, I now get back 6 results.   I'm sure I'm
doing something stupid, but I don't know what for sure.   I am using a
DismaxQueryHandler for this and boosting certain fields.   I've included
some of the debug info for the 'food drink' results.Thanks. 

food drink

food drink


+((DisjunctionMaxQuery((summary:food^2.0 | places:food^2.5 | tags:food^10.0
| content:food | name:food^15.0 | author:food^10.0 |
locations:food^5.0)~0.01) DisjunctionMaxQuery((summary:drink^2.0 |
places:drink^2.5 | tags:drink^10.0 | content:drink | name:drink^15.0 |
author:drink^10.0 | locations:drink^5.0)~0.01))~2)
DisjunctionMaxQuery((summary:"food drink"~2^2.0 | places:"food drink"~2^2.5
| tags:"food drink"~2^10.0 | content:"food drink"~2 | name:"food
drink"~2^15.0 | author:"food drink"~2^10.0 | locations:"food
drink"~2^5.0)~0.01)


-- 
View this message in context: 
http://www.nabble.com/querying-with-two-words-returns-less-results-when-ORing-terms-tp16942461p16942461.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: querying with two words returns less results when ORing terms

2008-04-28 Thread briand

It was set to AND as the default so I changed it to 

  

restarted SOLR but still seeing the same results.   The debug output looks
the same as well.  


Alok K. Dhir wrote:
> 
> check for
> 
>   
> 
> in schema.xml
> 
> On Apr 28, 2008, at 12:16 PM, briand wrote:
> 
>>
>> I'm searching using query of 'food' and get back 572 results.   When  
>> I do a
>> search for 'food drink' I am expecting more results since it's  
>> defaulting to
>> ORing the search terms, however, I now get back 6 results.   I'm  
>> sure I'm
>> doing something stupid, but I don't know what for sure.   I am using a
>> DismaxQueryHandler for this and boosting certain fields.   I've  
>> included
>> some of the debug info for the 'food drink' results.Thanks.
>>
>> food drink
>> 
>> food drink
>>
>> 
>> +((DisjunctionMaxQuery((summary:food^2.0 | places:food^2.5 |  
>> tags:food^10.0
>> | content:food | name:food^15.0 | author:food^10.0 |
>> locations:food^5.0)~0.01) DisjunctionMaxQuery((summary:drink^2.0 |
>> places:drink^2.5 | tags:drink^10.0 | content:drink | name:drink^15.0 |
>> author:drink^10.0 | locations:drink^5.0)~0.01))~2)
>> DisjunctionMaxQuery((summary:"food drink"~2^2.0 | places:"food  
>> drink"~2^2.5
>> | tags:"food drink"~2^10.0 | content:"food drink"~2 | name:"food
>> drink"~2^15.0 | author:"food drink"~2^10.0 | locations:"food
>> drink"~2^5.0)~0.01)
>>
>>
>> -- 
>> View this message in context:
>> http://www.nabble.com/querying-with-two-words-returns-less-results-when-ORing-terms-tp16942461p16942461.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
> 
> 
> 

-- 
View this message in context: 
http://www.nabble.com/querying-with-two-words-returns-less-results-when-ORing-terms-tp16942461p16942834.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: querying with two words returns less results when ORing terms

2008-04-28 Thread briand

Yep setting mm=1 did the trick for us.   Thanks!


Howard Lee-2 wrote:
> 
> I don't think the defaultOperator applies to Dismax in the same way as the
> Standard Request Handler. The Dismax handler uses the mm (Minimum 'Should'
> Match) parameter to specify how many words should match the query. You
> may need to change this to get it working the way you want it.
> 
> http://wiki.apache.org/solr/DisMaxRequestHandler?highlight=%28CategorySolrRequestHandler%29%7C%28%28CategorySolrRequestHandler%29%29#head-6c5fe41d68f3910ed544311435393f5727408e61
> 
> Howard
> 
> 2008/4/28 briand <[EMAIL PROTECTED]>:
> 
>>
>> I'm searching using query of 'food' and get back 572 results.   When I do
>> a
>> search for 'food drink' I am expecting more results since it's defaulting
>> to
>> ORing the search terms, however, I now get back 6 results.   I'm sure I'm
>> doing something stupid, but I don't know what for sure.   I am using a
>> DismaxQueryHandler for this and boosting certain fields.   I've included
>> some of the debug info for the 'food drink' results.Thanks.
>>
>> food drink
>> 
>> food drink
>>
>> 
>> +((DisjunctionMaxQuery((summary:food^2.0 | places:food^2.5 |
>> tags:food^10.0
>> | content:food | name:food^15.0 | author:food^10.0 |
>> locations:food^5.0)~0.01) DisjunctionMaxQuery((summary:drink^2.0 |
>> places:drink^2.5 | tags:drink^10.0 | content:drink | name:drink^15.0 |
>> author:drink^10.0 | locations:drink^5.0)~0.01))~2)
>> DisjunctionMaxQuery((summary:"food drink"~2^2.0 | places:"food
>> drink"~2^2.5
>> | tags:"food drink"~2^10.0 | content:"food drink"~2 | name:"food
>> drink"~2^15.0 | author:"food drink"~2^10.0 | locations:"food
>> drink"~2^5.0)~0.01)
>>
>>
>> --
>> View this message in context:
>> http://www.nabble.com/querying-with-two-words-returns-less-results-when-ORing-terms-tp16942461p16942461.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
>>
> 
> 

-- 
View this message in context: 
http://www.nabble.com/querying-with-two-words-returns-less-results-when-ORing-terms-tp16942461p16945652.html
Sent from the Solr - User mailing list archive at Nabble.com.



stemming the synonyms

2008-04-28 Thread briand

I'm using the synonyms text file such that when you enter travel you find all
things related to vacation.   However, when I enter in travelling it does
not find anything related to vacation, I assume it's because I'm not
explicitly putting travelling in the synonyms file.   Is there a way to
activate stemming for all of the synonym terms in the file without having to
manually put 'travel' and 'travelling' and 'travelers' in the synonym file? 
Thanks.
-- 
View this message in context: 
http://www.nabble.com/stemming-the-synonyms-tp16945953p16945953.html
Sent from the Solr - User mailing list archive at Nabble.com.