Re: Getting term offsets from Solr

2013-09-20 Thread Nalini Kartha
I'm wondering if storing just the offset as a payload would be cheaper from storage perspective than enabling termOffsets, termVectors and termPositions? Maybe we could get the offset info to return with results from there then? Thanks, Nalini On Fri, Sep 20, 2013 at 5:02 PM, Nalini Kartha wrote

Re: Getting term offsets from Solr

2013-09-20 Thread Nalini Kartha
Thanks for the reply. We tried enabling these options but that's also causing too much index bloat so I was wondering if there's a way to get at the offset information more cheaply? Thanks, Nalini On Fri, Sep 20, 2013 at 4:41 PM, Jack Krupansky wrote: > Set: > > termVectors=true > termPosition

Re: Limits of Document Size at SolrCloud and Faced Problems with Large Size of Documents

2013-09-20 Thread Erick Erickson
You're probably exceeding the size that your servlet container allows. This assumes you're using curl or some such. You can change it. How big is the document and how are you sending it to Solr? Best, Erick On Tue, Sep 17, 2013 at 4:28 PM, Otis Gospodnetic wrote: > Hi > > 50m docs across 18 serv

Re: Getting term offsets from Solr

2013-09-20 Thread Jack Krupansky
Set: termVectors=true termPositions=true termOffsets=true And use the fast vector highlighter. -- Jack Krupansky -Original Message- From: Nalini Kartha Sent: Friday, September 20, 2013 7:34 PM To: solr-user@lucene.apache.org Subject: Getting term offsets from Solr Hi, We're loo

Getting term offsets from Solr

2013-09-20 Thread Nalini Kartha
Hi, We're looking at implementing highlighting for some fields which may be too large to store in the index. As an alternative to using the Solr Highlighter (which needs fields to be stored), I was wondering if a) the offsets of terms are stored BY DEFAULT in the index (even if we're not using th

Re: Limits of Document Size at SolrCloud and Faced Problems with Large Size of Documents

2013-09-20 Thread Shawn Heisey
On 9/20/2013 12:34 PM, Erick Erickson wrote: > You're probably exceeding the size that your servlet container allows. > This assumes you're using curl or some such. You can change it. > How big is the document and how are you sending it to Solr? The maximum form size is configurable in Solr, not s

Re: Limits of Document Size at SolrCloud and Faced Problems with Large Size of Documents

2013-09-20 Thread Erick Erickson
A, good to know Shawn... Erick On Fri, Sep 20, 2013 at 1:04 PM, Shawn Heisey wrote: > On 9/20/2013 12:34 PM, Erick Erickson wrote: > > You're probably exceeding the size that your servlet container allows. > > This assumes you're using curl or some such. You can change it. > > How big is t

java.lang.LinkageError when using custom filters in multiple cores

2013-09-20 Thread Hayden Muhl
I have two cores "favorite" and "user" running in the same Tomcat instance. In each of these cores I have identical field types "text_en", "text_de", "text_fr", and "text_ja". These fields use some custom token filters I've written. Everything was going smoothly when I only had the "favorite" core.

Re: JVM Crash using solr 4.4 on Centos

2013-09-20 Thread Oak McIlwain
Thanks Michael, I thought I had the latest but it turned out to be from July 2011. Working Fine with the latest build :-) On Thu, Sep 19, 2013 at 7:29 PM, Michael Ryan wrote: > This is a known bug in that JDK version. Upgrade to a newer version of JDK > 7 (any build within the last two years or

Re: SolrCloud setup - any advice?

2013-09-20 Thread Shawn Heisey
On 9/19/2013 9:20 AM, Neil Prosser wrote: > Apologies for the giant email. Hopefully it makes sense. Because of its size, I'm going to reply inline like this and I'm going to trim out portions of your original message. I hope that's not horribly confusing to you! Looking through my archive of th

Cause of NullPointer Exception? (Solr with Spring Data)

2013-09-20 Thread JMill
I am unsure about the cause of the following NullPointer Exception. Any Ideas? Thanks Exception in thread "main" org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'aDocumentService': Injection of autowired dependencies failed; nested exception is org.springfr

Re: Migrating from Endeca

2013-09-20 Thread Shawn Heisey
On 9/19/2013 5:50 AM, Gareth Poulton wrote: > A customer wants us to move their entire enterprise platform - of which one > of the many components is Oracle Endeca - to open source. > However, customers being the way they are, they don't want to have to give > up any of the features they currently

Re: Solr node goes down while trying to index records

2013-09-20 Thread Erick Erickson
What happens if you bump up you zookeeper timeout? This has been an issue at times in the past. Best, Erick On Tue, Sep 17, 2013 at 1:48 PM, Furkan KAMACI wrote: > Could you give some information about your jetty.xml and give more info > about your index rate and RAM usage of your machines? > >

Re: check which file/document cause solr to work hard

2013-09-20 Thread Erick Erickson
you can always commit them one at a time to the ExtractingRequestHandler http://wiki.apache.org/solr/ExtractingRequestHandler Best, Erick On Tue, Sep 17, 2013 at 6:47 AM, Yossi Nachum wrote: > Hi, > > I am trying to index my windows pc files with manifoldcf version 1.3 and > solr version 4.4. >

Re: Need help understanding the use cases behind core auto-discovery

2013-09-20 Thread Timothy Potter
Exactly the insight I was looking for! Thanks Yonik ;-) On Fri, Sep 20, 2013 at 10:37 AM, Yonik Seeley wrote: > On Fri, Sep 20, 2013 at 11:56 AM, Timothy Potter > wrote: > > Trying to add some information about core.properties and auto-discovery > in > > Solr in Action and am at a loss for wha

Problem running EmbeddedSolr (spring data)

2013-09-20 Thread JMill
What is the cause of this Stactrace? Working with the following solr maven dependancies 4.4.0 1.0.0.RC1 Stacktrace SEVERE: Exception sending context initialized event to listener instance of class org.springframework.web.context.ContextLoaderListener org.springframework.beans.factory.BeanCreati

Re: Need help understanding the use cases behind core auto-discovery

2013-09-20 Thread Yonik Seeley
On Fri, Sep 20, 2013 at 11:56 AM, Timothy Potter wrote: > Trying to add some information about core.properties and auto-discovery in > Solr in Action and am at a loss for what to tell the reader is the purpose > of this feature. IMO, it was more a removal of unnecessary central configuration. You

Need help understanding the use cases behind core auto-discovery

2013-09-20 Thread Timothy Potter
Trying to add some information about core.properties and auto-discovery in Solr in Action and am at a loss for what to tell the reader is the purpose of this feature. Can anyone point me to any background information about core auto-discovery? I'm not interested in the technical implementation det

Re: Limits of Document Size at SolrCloud and Faced Problems with Large Size of Documents

2013-09-20 Thread Erick Erickson
You're probably exceeding the size that your servlet container allows. This assumes you're using curl or some such. You can change it. How big is the document and how are you sending it to Solr? Best, Erick On Tue, Sep 17, 2013 at 2:24 PM, Furkan KAMACI wrote: > Currently I hafer over 50+ milli

Re: Solr node goes down while trying to index records

2013-09-20 Thread Erick Erickson
What happens if you bump up you zookeeper timeout? This has been an issue at times in the past. Best, Erick On Tue, Sep 17, 2013 at 1:48 PM, Furkan KAMACI wrote: > Could you give some information about your jetty.xml and give more info > about your index rate and RAM usage of your machines? > >

Re: check which file/document cause solr to work hard

2013-09-20 Thread Erick Erickson
you can always commit them one at a time to the ExtractingRequestHandler http://wiki.apache.org/solr/ExtractingRequestHandler Best, Erick On Tue, Sep 17, 2013 at 6:47 AM, Yossi Nachum wrote: > Hi, > > I am trying to index my windows pc files with manifoldcf version 1.3 and > solr version 4.4.

Re: Will Solr work with a mapped drive?

2013-09-20 Thread Aloke Ghoshal
Hi, Try the UNC path instead: http://wiki.apache.org/tomcat/FAQ/Windows#Q6 Regards, Aloke On 9/20/13, johnmu...@aol.com wrote: > Hi, > > > I'm having this same problem as described here: > http://stackoverflow.com/questions/17708163/absolute-paths-in-solr-xml-configuration-using-tomcat6-on-wind

RE: Spellchecking

2013-09-20 Thread Dyer, James
If you're using "spellcheck.collate" you can also set "spellcheck.maxCollationTries" to validate each collation against the index before suggesting it. This validation takes into account any "fq" parameters on your query, so if your original query has "fq=Product:Book", then the collations ret

Re: Hash range to shard assignment

2013-09-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
This would need you to plug your own router . It is not yet possible But , you can split that shard repeatedly and keep the no:of users in that shard limited On Fri, Sep 20, 2013 at 3:52 PM, lochri wrote: > Hello folks, > > we would like to have control of where certain hash values or ranges a

Hash range to shard assignment

2013-09-20 Thread lochri
Hello folks, we would like to have control of where certain hash values or ranges are being located. The reason is that we want to shard per user but we know ahead that one or more specific users could grow way faster than others. Therefore we would like to locate them on separate shards (which ma

Spellchecking

2013-09-20 Thread Gastone Penzo
Hi, i'd like to know if is it possibile to have suggests only of a part of indexes. for example: an ecommerce: there are a lot of typologies of products (book, dvd, cd..) if i search inside books, i want only suggests of books products, not cds but the spellchecking indexs are all together. is i

Re: SolrCloud setup - any advice?

2013-09-20 Thread Neil Prosser
Sorry, my bad. For SolrCloud soft commits are enabled (every 15 seconds). I do a hard commit from an external cron task via curl every 15 minutes. The version I'm using for the SolrCloud setup is 4.4.0. Document cache warm-up times are 0ms. Filter cache warm-up times are between 3 and 7 seconds.