OutOfMemoryError coming from TermVectorsReader

2011-09-18 Thread Anand.Nigam
Hi, I am new to solr. I an trying to index text documents of large size. On searching from indexed documents I am getting following OutOfMemoryError. Please help me in resolving this issue. The field which stores file content is configured in schema.xml as below: and Highlighting is configu

Re: Upgrading solr from 3.3 to 3.4

2011-09-18 Thread Wyhw Whon
If you are already using Apache Lucene 3.1, 3.2 or 3.3, we strongly recommend you upgrade to 3.4.0 because of the index corruption bug on OS or computer crash or power loss (LUCENE-3418), now fixed in 3.4.0. 2011/9/19 Isan Fulia > Hi all, > > Does upgrading solr from 3.3 to 3.4 requires reindexi

Re: indexing data from rich documents - Tika with solr3.1

2011-09-18 Thread scorpking
Hi Erik Hatcher-4 I tried index from your url. But i have a problem. In your case, you knew a files absolute path (Dir.new("/Users/erikhatcher/apache-solr-3.3.0/docs"). So you can indexed it. In my case, i don't know a files absolute path. I only know http's address where have files (ex: you can se

Upgrading solr from 3.3 to 3.4

2011-09-18 Thread Isan Fulia
Hi all, Does upgrading solr from 3.3 to 3.4 requires reindexing of all the cores or we can directly copy the data folders to the new solr ? -- Thanks & Regards, Isan Fulia.

Re: what is the default value of omitNorms and termVectors in solr schema

2011-09-18 Thread Erik Hatcher
From Solr's example schema.xml: "omitTermFreqAndPositions attribute introduced, true by default except for text fields" (meaning TextField's) And term vectors most definitely are false by default. Also from the example schema.xml: "termVectors: [false] set to true to store the term vector for a

what is the default value of omitNorms and termVectors in solr schema

2011-09-18 Thread roz dev
Hi As per this document, http://wiki.apache.org/solr/FieldOptionsByUseCase, omitNorms and termVectors have to be "explicitly" specified in some cases. I am wondering what is the default value of these settings if solr schema definition does not state them. *Example:* In above case, will Solr

Re: Lucene->SOLR transition

2011-09-18 Thread Michael Sokolov
On 9/15/2011 8:30 PM, Scott Smith wrote: 2. Assuming that the answer to 1 is "correct", then is there an easy way to take a lucene query (with nested Boolean queries, filter queries, etc.) and generate a SOLR query string with q and fq components? I believe that Query.toString() will p

Re: Tomcat unicode problems with IE 9, Firefox 6 and solrj client

2011-09-18 Thread Xue-Feng Yang
conclusion: both Tomcat 6 and Glassfish 3 work with solrj client by POST. ^=^ From: Xue-Feng Yang To: "solr-user@lucene.apache.org" Sent: Sunday, September 18, 2011 4:53:25 PM Subject: Re: Tomcat unicode problems with IE 9, Firefox 6 and solrj client Finally,

Re: JSON and DataImportHandler

2011-09-18 Thread Pulkit Singhal
Ah I see now: http://wiki.apache.org/solr/UpdateJSON#Example Not part of DIH that's all. On Sun, Sep 18, 2011 at 5:42 PM, Pulkit Singhal wrote: > Any updates on this topic? > > On Fri, Jul 16, 2010 at 5:36 PM, P Williams > wrote: >> Hi All, >> >>    Has anyone gotten the DataImportHandler to wor

Re: JSON and DataImportHandler

2011-09-18 Thread Pulkit Singhal
Any updates on this topic? On Fri, Jul 16, 2010 at 5:36 PM, P Williams wrote: > Hi All, > >    Has anyone gotten the DataImportHandler to work with json as input?  Is > there an even easier alternative to DIH?  Could you show me an example? > > Many thanks, > Tricia >

Re: Tomcat unicode problems with IE 9, Firefox 6 and solrj client

2011-09-18 Thread Xue-Feng Yang
Finally, it's a known bug: https://issues.apache.org/jira/browse/SOLR-1959 From: Xue-Feng Yang To: "solr-user@lucene.apache.org" Sent: Sunday, September 18, 2011 1:25:08 PM Subject: Re: Tomcat unicode problems with IE 9, Firefox 6 and solrj client Actually,

How to sort results based on matching term position

2011-09-18 Thread Dan Ledgard
We have a requirement to sort/boost documents returned for phrase matches depending on where the match was within the field, the nearer the beginning the better i.e. a search term of "The w" would return results in this order The War of the Worlds The Ward The Whistleblower Gone with the Wind T

Stability of NRT searching in trunk

2011-09-18 Thread sammaye
Hey everyone, I know this question has probably been asked before but I cannot seem to find it. Basically I noticed that in the latest trunk of Solr there is now a soft commit (http://wiki.apache.org/solr/UpdateXmlMessages#A.22commit.22_and_.22optimize.22) that ofc allows for the hard index to lo

Re: Tomcat unicode problems with IE 9, Firefox 6 and solrj client

2011-09-18 Thread Xue-Feng Yang
Actually, it works in some cases in my post. So it's not a problem in server. It's a problem in client. I may have some mistakes in solrj client, but it works in English makes the thing confused. From: Kiwi de coder To: solr-user@lucene.apache.org; Xue-Feng

Re: Tomcat unicode problems with IE 9, Firefox 6 and solrj client

2011-09-18 Thread Xue-Feng Yang
I saw this in the doc and set it before I started up tomcat first time. All results I posted is with this configuration. Thanks anyway. From: Kiwi de coder To: solr-user@lucene.apache.org; Xue-Feng Yang Sent: Sunday, September 18, 2011 1:14:58 PM Subject: Re:

Re: Tomcat unicode problems with IE 9, Firefox 6 and solrj client

2011-09-18 Thread Xue-Feng Yang
I had found the encode from java.net.URLEncoder. However, I still couldn't get any result for solrj client. Is there any different usage for solrj between English and other languages? From: Xue-Feng Yang To: "solr-user@lucene.apache.org" Sent: Sunday, Septe

Re: Tomcat unicode problems with IE 9, Firefox 6 and solrj client

2011-09-18 Thread Kiwi de coder
maybe set on tomcat connector ? kiwi On Mon, Sep 19, 2011 at 12:56 AM, Xue-Feng Yang wrote: > The first returned link for Firefox 6 is the same as the returned link for > IE 9. I was misread. > > So the encode is this one. Anyone knows what it is. > > > > > F

Re: Tomcat unicode problems with IE 9, Firefox 6 and solrj client

2011-09-18 Thread Xue-Feng Yang
The first returned link for Firefox 6 is the same as the returned link for IE 9. I was misread. So the encode is this one. Anyone knows what it is. From: Xue-Feng Yang To: "solr-user@lucene.apache.org" Sent: Sunday, September 18, 2011 12:48:31 PM Subject: Tom

Tomcat unicode problems with IE 9, Firefox 6 and solrj client

2011-09-18 Thread Xue-Feng Yang
Now I moved to Tomcat 6 for another round testing. All English cores are OK for IE 9,  Firefox 6  and solrj client. When I tested with Chinese cores, IE 9 is OK whatever I played on it. Firefox 6 is a little different: 1) If I set Query String to "content:耶和华" without qotations in page htt

Re: Miscellaneous DIH related questions

2011-09-18 Thread pulkitsinghal
No cron job, I'm just clicking on the full import button in dataimport.jsp page. 1) Can you point me to the code in Solr where such a retry functionality should be added? I might be able to contribute. 2) What is a good place to add the java based scheduling? Again I'll test and share if I succe

Re: Numeric Facets query results

2011-09-18 Thread Erick Erickson
Of course you can hack something like length:[10 TO 19.99], but that's clumsy too. It doesn't help the 3.x code line, but trunk allows mixing inclusive and exclusive end points in ranges, e.g. length:[10 TO 20} (note curly brace at end). See: https://issues.apache.org/jira/browse/LUCENE-996

Numeric Facets query results

2011-09-18 Thread Coda Moda
Hello, I am new to SOLR and using the Numeric Facets feature in v3.3 to display a facet list of lengths. I have successfully added the query parameters to display a facet list using: facet.range = length f.length.facet.range.start = 0 f.length.facet.range.end = 100 f.length.facet.range.gap = 10 f