Adding DocValues after or in the middle of indexing

2013-11-15 Thread Otis Gospodnetic
Hi, Can one introduce DocValues (by adding them to the schema.xml) post facto? If that is done, do newly added documents end up using DocValues, while the old ones remain without DocValues? Thanks, Otis -- Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch Support * ht

Very long warmup query vs. frequent soft commit with new searcher

2013-11-15 Thread Otis Gospodnetic
Hi, What happens when one has a *single* vry long *warming* running query that takes, say, 10 minutes, and a soft commit that opens a new searcher happening every 1 minute? Could one run into a situation where each soft commit triggers the same long warming query, thus queueing them one after

Re: Suspicious message with attachment

2013-11-15 Thread N.Emrah ALBULAK
Sent from my iPad > On 15 Kas 2013, at 11:00, h...@ssww.com wrote: > > The following message addressed to you was quarantined because it likely > contains a virus: > > Subject: Solr Core Reload causing JVM Memory Leak through > FieldCache/LRUCache/LFUCache > From: Umesh Prasad > > However,

RE: SolrCloud question

2013-11-15 Thread Beale, Jim (US-KOP)
Hi Mark, Thanks for the reply. I am struggling a bit here. Sorry if these are basic questions! I can't find the answers anywhere. I modified my solr.xml on all boxes to comment out the core definition for 'tp'. Then, I used /admin/collections?action=CREATE&name=tp&numShards=1 against one of t

Re: Solr Grouping

2013-11-15 Thread tamanjit.bin...@yahoo.co.in
My question is fro group.format=simple. In normal grouping i know group.offset would work -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Grouping-tp4101313p4101316.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr grouping performance porblem

2013-11-15 Thread shamik
Thanks for the update Shawn, will look forward to the release. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-grouping-performance-porblem-tp4098565p4101314.html Sent from the Solr - User mailing list archive at Nabble.com.

Solr Grouping

2013-11-15 Thread tamanjit.bin...@yahoo.co.in
Hi, In Grouping we can group docs as per a field. Can we also have something like pagination within a group. For eg. G1 has G1D1,G1D2,G1D3 G2 has G2D1, G2D2 G3 has G3D1, G3D2, G3D3, G3D4. Can I fetch the results like (if group.format=simple) Page1: G1D1 G2D1 G3D1 Page 2: G1D2 G2D2 G3D2 Page3:

Re: SolrCoreAware

2013-11-15 Thread Steven Bower
And the close hook will basically only be fired once during shutdown? On Fri, Nov 15, 2013 at 1:07 PM, Chris Hostetter wrote: > > : So for a given instance of a handler it will only be called once during > the > : lifetime of that handler? > > correct (unless there is a bug somewhere) > > : Also

Re: SolrCoreAware

2013-11-15 Thread Chris Hostetter
: So for a given instance of a handler it will only be called once during the : lifetime of that handler? correct (unless there is a bug somewhere) : Also, when the core is passed in as part of inform() is it guaranteed to be : ready to go? (ie I can start feeding content at this point?) Right,

Re: SolrCoreAware

2013-11-15 Thread Steven Bower
>>> it should be called only once during hte lifetime of a given plugin, >>> usually not long after construction -- but it could be called many, many >>> times in the lifetime of the solr process. So for a given instance of a handler it will only be called once during the lifetime of that handler?

Re: SolrCoreAware

2013-11-15 Thread Chris Hostetter
: So its something that can happen multiple times during the lifetime of : process, but i'm guessing something not occuring very often? it should be called only once during hte lifetime of a given plugin, usually not long after construction -- but it could be called many, many times in the life

Re: SolrCoreAware

2013-11-15 Thread Shalin Shekhar Mangar
On Fri, Nov 15, 2013 at 11:19 PM, Steven Bower wrote: > So its something that can happen multiple times during the lifetime of > process, but i'm guessing something not occuring very often? That's right. > > Also is there a way to hook the shutdown of the core? You can use SolrCore.addCloseHook

Re: SolrCoreAware

2013-11-15 Thread Steven Bower
So its something that can happen multiple times during the lifetime of process, but i'm guessing something not occuring very often? Also is there a way to hook the shutdown of the core? steve On Fri, Nov 15, 2013 at 12:08 PM, Alan Woodward wrote: > Hi Steven, > > It's called when the handler

Re: PDF indexing issues

2013-11-15 Thread Furkan KAMACI
You should check the Apache PDFBox project. A similar question: https://issues.apache.org/jira/browse/PDFBOX-940 2013/11/15 Marcello Lorenzi > Hi, > during you testing of Apache SOLR 4.3, we have noticed some errors > occurred for PDF indexing: > > ERROR - 2013-11-15 15:14:26.248; org.apache.pd

Re: SolrCoreAware

2013-11-15 Thread Alan Woodward
Hi Steven, It's called when the handler is created, either at SolrCore construction time (solr startup or core reload) or the first time the handler is requested if it's a lazy-loading handler. Alan Woodward www.flax.co.uk On 15 Nov 2013, at 15:40, Steven Bower wrote: > Under what circumst

Re: SolrCloud question

2013-11-15 Thread Mark Miller
We are moving away from pre defining SolrCores for SolrCloud. The correct approach would be to use thew Collections API - then it is quite simple to change the number of shards for each collection you create. Hopefully our examples will move to doing this before long. - Mark On Nov 15, 2013, a

SolrCloud question

2013-11-15 Thread Beale, Jim (US-KOP)
Hello all, I am trying to set up a SolrCloud deployment consisting of 5 boxes each of which is running Solr under jetty. A zookeeper ensemble is running separately on 3 of the boxes. Each Solr instance has 2 cores, one of which is sharded across the five boxes and the other not sharded at all

PDF indexing issues

2013-11-15 Thread Marcello Lorenzi
Hi, during you testing of Apache SOLR 4.3, we have noticed some errors occurred for PDF indexing: ERROR - 2013-11-15 15:14:26.248; org.apache.pdfbox.pdmodel.font.PDCIDFont; Error: Could not parse predefined CMAP file for 'PDFXC30-Indentity0-UCS2' ERROR - 2013-11-15 15:14:36.108; org.apache.p

Re: field collapsing performance in sharded environment

2013-11-15 Thread Paul Masurel
That's not the way grouping is done. On a first round all shards return their 10 best group (represented as their 10 best grouping values). As a result it's a three round thing instead of the two round for regular search, so observing an increasing in latency is normal but not in the realm of what

SolrCoreAware

2013-11-15 Thread Steven Bower
Under what circumstances will a handler that implements SolrCoreAware have its inform() method called? thanks, steve

Re: Document routing question.

2013-11-15 Thread Yago Riveiro
Joel, Thanks for the explanation. -- Yago Riveiro Sent with Sparrow (http://www.sparrowmailapp.com/?sig) On Friday, November 15, 2013 at 2:14 PM, Joel Bernstein wrote: > Yago, > > Now that I look back at this blog, I see how this can be confusing. > > This is how to breakdown the composite

Re: Date range faceting with various gap sizes?

2013-11-15 Thread jimi.hullegard
> Chris Hostetter wrote: > > You can see that in the resulting URL you got the params are duplicated -- the > problem is that when expressed this way, Solr doesn't know when the > different values of the start/end/gap params should be applied -- it just > loops over each of the facet.range fields (

Re: An UpdateHandler to run following a MySql DataImport

2013-11-15 Thread Dileepa Jayakody
I found out that you can configure any requestHandler to run a requestProcessor chain. So in my /dataimport requestHandler I just called my custom requestHandler as a chain; eg: data-config.xml *stanbolInterceptor* It works. Thanks, Dileepa On Fri, Nov 15, 2013 at 6:08 PM, Erick Erick

Re: Document routing question.

2013-11-15 Thread Joel Bernstein
Yago, Now that I look back at this blog, I see how this can be confusing. This is how to breakdown the composite id: tenant1/4!docXXX "tenant1" is the shardkey. "/" is a separator between the shardkey and bits to use from the shardkey. "4" is the number of bits taken from the shardkey to creat

Re: Solr xml img parsing exception

2013-11-15 Thread Marcello Lorenzi
Hi Jack, we have analyzed the issue and there were duplicated jar into the tomcat classpath for Tika. After the removal of the dulicated library now the search engine works as expected. Thanks for the support, Marcello On 11/14/2013 05:24 PM, Jack Krupansky wrote: The actual error appears to

Re: exceeded limit of maxWarmingSearchers ERROR

2013-11-15 Thread Erick Erickson
That's a fine place to start. This form: ${solr.autoCommit.maxTime:15000} just allows you to define a sysvar to override the 15 second default, like java -Dsolr.autoCommti.maxTime=3 -jar start.jar On Fri, Nov 15, 2013 at 8:11 AM, Loka wrote: > Hi Erickson, > > I have seen the following al

Re: exceeded limit of maxWarmingSearchers ERROR

2013-11-15 Thread Loka
Hi Erickson, I have seen the following also from google, can I use the same in : false If the above one is correct to add, can I add the below tags aslo in along with the above tag: 3 1 so finally, it will look like as: 3 1

Re: exceeded limit of maxWarmingSearchers ERROR

2013-11-15 Thread Loka
Erickson, Thanks for your reply, before your reply, I have googled and found the following and added under tag of solrconfig.xml file. 3 1 Is the above one is fine or should I go strictly as per ypur suggestion means as below: ${solr.autoCommit.maxTi

Re: An UpdateHandler to run following a MySql DataImport

2013-11-15 Thread Erick Erickson
Hmmm, don't quite know the answer to that, but when things start getting complex with DIH, you should seriously consider a SolrJ solution unless someone comes up with a quick fix. Here's an example. http://searchhub.org/2012/02/14/indexing-with-solrj/ Best, Erick On Fri, Nov 15, 2013 at 2:48 AM

Re: exceeded limit of maxWarmingSearchers ERROR

2013-11-15 Thread Erick Erickson
Where did you get that syntax? I've never seen that before. What you want to configure is the "maxTime" in your autocommit and autosoftcommit sections of solrconfig.xml, as: ${solr.autoCommit.maxTime:15000} false ${solr.autoSoftCommit.maxTime:1}

RE: facet method=enum and uninvertedfield limitations

2013-11-15 Thread Lemke, Michael SZ/HZA-ZSW
On Thu, November 14, 2013 7:26 PM, Yonik Seeley wrote: >On Thu, Nov 14, 2013 at 12:03 PM, Lemke, Michael SZ/HZA-ZSW > wrote: >> I am running into performance problems with faceted queries. >> If I do a >> >> q=word&facet.field=CONTENT&facet=true&facet.limit=10&facet.mincount=1&facet.method=fc&face

Re: Solr Core Reload causing JVM Memory Leak through FieldCache/LRUCache/LFUCache

2013-11-15 Thread Umesh Prasad
Mailing list by default removes attachments. So uploaded it to google drive .. https://drive.google.com/file/d/0B-RnB4e-vaJhX280NVllMUdHYWs/edit?usp=sharing On Fri, Nov 15, 2013 at 2:28 PM, Umesh Prasad wrote: > Hi All, > We are seeing memory leaks in our Search application whenever core

Suspicious message with attachment

2013-11-15 Thread help
The following message addressed to you was quarantined because it likely contains a virus: Subject: Solr Core Reload causing JVM Memory Leak through FieldCache/LRUCache/LFUCache From: Umesh Prasad However, if you know the sender and are expecting an attachment, please reply to this message, a

Is there a max Size for synony-Definition?

2013-11-15 Thread Michael Bulla
Hi there, yesterday I had a strange problem with using synonyms in Solr 4.3.0 In my schema there is the default-configuration for synonyms defined Everything works fine with that config, except this line Comb