New operator.

2013-06-15 Thread Yanis Kakamaikis
Hi all,I want to add a new operator to my solr. I need that operator to call my proprietary engine and build an answer vector to solr, in a way that this vector will be part of the boolean query at the next step. How do I do that? Thanks

Solr large boolean filter

2013-06-15 Thread Igor Kustov
I know i'm not the first one with this problem. I'm currently using solr 4.2.1 with approximately 10 mln documents in the index. The index is updated frequently. The filter_query is just a one big boolean or query by id. fq=id:(1 2 3 4 ... 50950) ids list is always different and not sequentia

Re: Suggest and Filtering

2013-06-15 Thread Brendan Grainger
Hi Otis and Jorge, I probably wasn't phrasing my question too well, but I think I was looking for FuzzySuggest. Messing around with the configs found here seems to be doing what I want: http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/core/src/test-files/solr/collection1/conf/solrconfig-phra

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Jack Krupansky
[My apologies to Roland for "hijacking" his original thread for this rant! Look what you started!!] And I will stand by my statement: "Solr is too much of a beast for average app developers to master." And the key word there, in case a too-casual reader missed it is "master" - not "use" in t

Managing SolrCloud

2013-06-15 Thread Furkan KAMACI
I want to design a controlling mechanism for my SolrCloud. I have two choices. First one is controlling every Solr node from a single point and when I want to start and stop jetty from remote I will connect to my nodes via an ssh library at Java. I will send backup command and recovery process wit

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Grant Ingersoll
On Jun 15, 2013, at 12:54 PM, Alexandre Rafalovitch wrote: > On Sat, Jun 15, 2013 at 10:35 AM, Grant Ingersoll wrote: >> That being said, it truly amazes me that people were ever able to implement >> Solr, given some of the FUD in this thread. I guess those tens of thousands >> of deployment

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Alexandre Rafalovitch
On Sat, Jun 15, 2013 at 10:35 AM, Grant Ingersoll wrote: > That being said, it truly amazes me that people were ever able to implement > Solr, given some of the FUD in this thread. I guess those tens of thousands > of deployments out there were all done by above average devs... I would not cla

Re: Slow Highlighter Performance Even Using FastVectorHighlighter

2013-06-15 Thread Michael McCandless
You could also try the new[ish] PostingsHighlighter: http://blog.mikemccandless.com/2012/12/a-new-lucene-highlighter-is-born.html Mike McCandless http://blog.mikemccandless.com On Sat, Jun 15, 2013 at 8:50 AM, Michael Sokolov wrote: > If you have very large documents (many MB) that can lead to

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Grant Ingersoll
On Jun 13, 2013, at 11:24 AM, Walter Underwood wrote: > That was my thought exactly. Contribute a REST request handler. --wunder > +1. The bits are already in place for a lot of it now that RESTlet is in. That being said, it truly amazes me that people were ever able to implement Solr, gi

Re: yet another optimize question

2013-06-15 Thread Otis Gospodnetic
Hi Robi, I'm going to guess you are seeing smaller heap also simply because you restarted the JVM recently (hm, you don't say you restarted, maybe I'm making this up). If you are indeed indexing continuously then you shouldn't optimize. Lucene will merge segments itself. Lower mergeFactor will for

Re: Slow Highlighter Performance Even Using FastVectorHighlighter

2013-06-15 Thread Michael Sokolov
If you have very large documents (many MB) that can lead to slow highlighting, even with FVH. See https://issues.apache.org/jira/browse/LUCENE-3234 and try setting phraseLimit=1 (or some bigger number, but not infinite, which is the default) -Mike On 6/14/13 4:52 PM, Andy Brown wrote: Bry