I'm wondering if storing just the offset as a payload would be cheaper from
storage perspective than enabling termOffsets, termVectors and
termPositions? Maybe we could get the offset info to return with results
from there then?
Thanks,
Nalini
On Fri, Sep 20, 2013 at 5:02 PM, Nalini Kartha wrote
Thanks for the reply.
We tried enabling these options but that's also causing too much index
bloat so I was wondering if there's a way to get at the offset information
more cheaply?
Thanks,
Nalini
On Fri, Sep 20, 2013 at 4:41 PM, Jack Krupansky wrote:
> Set:
>
> termVectors=true
> termPosition
You're probably exceeding the size that your servlet container allows.
This assumes you're using curl or some such. You can change it.
How big is the document and how are you sending it to Solr?
Best,
Erick
On Tue, Sep 17, 2013 at 4:28 PM, Otis Gospodnetic
wrote:
> Hi
>
> 50m docs across 18 serv
Set:
termVectors=true
termPositions=true
termOffsets=true
And use the fast vector highlighter.
-- Jack Krupansky
-Original Message-
From: Nalini Kartha
Sent: Friday, September 20, 2013 7:34 PM
To: solr-user@lucene.apache.org
Subject: Getting term offsets from Solr
Hi,
We're loo
Hi,
We're looking at implementing highlighting for some fields which may be too
large to store in the index.
As an alternative to using the Solr Highlighter (which needs fields to be
stored), I was wondering if a) the offsets of terms are stored BY DEFAULT
in the index (even if we're not using th
On 9/20/2013 12:34 PM, Erick Erickson wrote:
> You're probably exceeding the size that your servlet container allows.
> This assumes you're using curl or some such. You can change it.
> How big is the document and how are you sending it to Solr?
The maximum form size is configurable in Solr, not s
A, good to know Shawn...
Erick
On Fri, Sep 20, 2013 at 1:04 PM, Shawn Heisey wrote:
> On 9/20/2013 12:34 PM, Erick Erickson wrote:
> > You're probably exceeding the size that your servlet container allows.
> > This assumes you're using curl or some such. You can change it.
> > How big is t
I have two cores "favorite" and "user" running in the same Tomcat instance.
In each of these cores I have identical field types "text_en", "text_de",
"text_fr", and "text_ja". These fields use some custom token filters I've
written. Everything was going smoothly when I only had the "favorite" core.
Thanks Michael, I thought I had the latest but it turned out to be from
July 2011. Working Fine with the latest build :-)
On Thu, Sep 19, 2013 at 7:29 PM, Michael Ryan wrote:
> This is a known bug in that JDK version. Upgrade to a newer version of JDK
> 7 (any build within the last two years or
On 9/19/2013 9:20 AM, Neil Prosser wrote:
> Apologies for the giant email. Hopefully it makes sense.
Because of its size, I'm going to reply inline like this and I'm going
to trim out portions of your original message. I hope that's not
horribly confusing to you! Looking through my archive of th
I am unsure about the cause of the following NullPointer Exception. Any
Ideas?
Thanks
Exception in thread "main"
org.springframework.beans.factory.BeanCreationException: Error creating
bean with name 'aDocumentService': Injection of autowired dependencies
failed; nested exception is
org.springfr
On 9/19/2013 5:50 AM, Gareth Poulton wrote:
> A customer wants us to move their entire enterprise platform - of which one
> of the many components is Oracle Endeca - to open source.
> However, customers being the way they are, they don't want to have to give
> up any of the features they currently
What happens if you bump up you zookeeper timeout? This has been an
issue at times in the past.
Best,
Erick
On Tue, Sep 17, 2013 at 1:48 PM, Furkan KAMACI wrote:
> Could you give some information about your jetty.xml and give more info
> about your index rate and RAM usage of your machines?
>
>
you can always commit them one at a time to the ExtractingRequestHandler
http://wiki.apache.org/solr/ExtractingRequestHandler
Best,
Erick
On Tue, Sep 17, 2013 at 6:47 AM, Yossi Nachum wrote:
> Hi,
>
> I am trying to index my windows pc files with manifoldcf version 1.3 and
> solr version 4.4.
>
Exactly the insight I was looking for! Thanks Yonik ;-)
On Fri, Sep 20, 2013 at 10:37 AM, Yonik Seeley wrote:
> On Fri, Sep 20, 2013 at 11:56 AM, Timothy Potter
> wrote:
> > Trying to add some information about core.properties and auto-discovery
> in
> > Solr in Action and am at a loss for wha
What is the cause of this Stactrace?
Working with the following solr maven dependancies
4.4.0
1.0.0.RC1
Stacktrace
SEVERE: Exception sending context initialized event to listener instance of
class org.springframework.web.context.ContextLoaderListener
org.springframework.beans.factory.BeanCreati
On Fri, Sep 20, 2013 at 11:56 AM, Timothy Potter wrote:
> Trying to add some information about core.properties and auto-discovery in
> Solr in Action and am at a loss for what to tell the reader is the purpose
> of this feature.
IMO, it was more a removal of unnecessary central configuration.
You
Trying to add some information about core.properties and auto-discovery in
Solr in Action and am at a loss for what to tell the reader is the purpose
of this feature.
Can anyone point me to any background information about core
auto-discovery? I'm not interested in the technical implementation det
You're probably exceeding the size that your servlet container allows.
This assumes you're using curl or some such. You can change it.
How big is the document and how are you sending it to Solr?
Best,
Erick
On Tue, Sep 17, 2013 at 2:24 PM, Furkan KAMACI wrote:
> Currently I hafer over 50+ milli
What happens if you bump up you zookeeper timeout? This has been an issue
at times in the past.
Best,
Erick
On Tue, Sep 17, 2013 at 1:48 PM, Furkan KAMACI wrote:
> Could you give some information about your jetty.xml and give more info
> about your index rate and RAM usage of your machines?
>
>
you can always commit them one at a time to the ExtractingRequestHandler
http://wiki.apache.org/solr/ExtractingRequestHandler
Best,
Erick
On Tue, Sep 17, 2013 at 6:47 AM, Yossi Nachum wrote:
> Hi,
>
> I am trying to index my windows pc files with manifoldcf version 1.3 and
> solr version 4.4.
Hi,
Try the UNC path instead: http://wiki.apache.org/tomcat/FAQ/Windows#Q6
Regards,
Aloke
On 9/20/13, johnmu...@aol.com wrote:
> Hi,
>
>
> I'm having this same problem as described here:
> http://stackoverflow.com/questions/17708163/absolute-paths-in-solr-xml-configuration-using-tomcat6-on-wind
If you're using "spellcheck.collate" you can also set
"spellcheck.maxCollationTries" to validate each collation against the index
before suggesting it. This validation takes into account any "fq" parameters
on your query, so if your original query has "fq=Product:Book", then the
collations ret
This would need you to plug your own router . It is not yet possible
But , you can split that shard repeatedly and keep the no:of users in that
shard limited
On Fri, Sep 20, 2013 at 3:52 PM, lochri wrote:
> Hello folks,
>
> we would like to have control of where certain hash values or ranges a
Hello folks,
we would like to have control of where certain hash values or ranges are
being located.
The reason is that we want to shard per user but we know ahead that one or
more specific users could grow way faster than others. Therefore we would
like to locate them on separate shards (which ma
Hi,
i'd like to know if is it possibile to have suggests only of a part of
indexes.
for example:
an ecommerce:
there are a lot of typologies of products (book, dvd, cd..)
if i search inside books, i want only suggests of books products, not cds
but the spellchecking indexs are all together.
is i
Sorry, my bad. For SolrCloud soft commits are enabled (every 15 seconds). I
do a hard commit from an external cron task via curl every 15 minutes.
The version I'm using for the SolrCloud setup is 4.4.0.
Document cache warm-up times are 0ms.
Filter cache warm-up times are between 3 and 7 seconds.
27 matches
Mail list logo