Re: Solr replication, HAproxy and data management

2010-12-13 Thread Paolo Castagna
Paolo Castagna wrote: Hi, we are using Solr v1.4.x with multi-cores and a master/slaves configuration. We also use HAProxy [1] to load balance search requests amongst slaves. Finally, we use MapReduce to create new Solr indexes. I'd like to share with you what I am doing when I need to

Solr replication, HAproxy and data management

2010-12-09 Thread Paolo Castagna
Hi, we are using Solr v1.4.x with multi-cores and a master/slaves configuration. We also use HAProxy [1] to load balance search requests amongst slaves. Finally, we use MapReduce to create new Solr indexes. I'd like to share with you what I am doing when I need to: 1. add a new index 2. replac

Re: EmbeddedSolrServer with one core and schema.xml loaded via ClassLoader, is it possible?

2010-10-25 Thread Paolo Castagna
lems with the second solution? Is there a better way? Paolo Paolo Castagna wrote: Hi, I am trying to use EmbeddedSolrServer with just one core and I'd like to load solrconfig.xml, schema.xml and other configuration files from a jar via getResourceAsStream(...). I've tried to use Sol

EmbeddedSolrServer with one core and schema.xml loaded via ClassLoader, is it possible?

2010-10-20 Thread Paolo Castagna
Hi, I am trying to use EmbeddedSolrServer with just one core and I'd like to load solrconfig.xml, schema.xml and other configuration files from a jar via getResourceAsStream(...). I've tried to use SolrResourceLoader, but all my attempts failed with a RuntimeException: Can't find resource [...].

Re: Faceting and omitNorms=true

2010-10-12 Thread Paolo Castagna
Paolo Castagna wrote: Hi, I am not completely sure on what's the recommended setting for fields used for faceting regarding omitNorms and potitionIncrementGap. Should I used omitNorms="true"? What about positionIncrementGap? At the moment I have this in my schema.xml:

Faceting and omitNorms=true

2010-10-12 Thread Paolo Castagna
Hi, I am not completely sure on what's the recommended setting for fields used for faceting regarding omitNorms and potitionIncrementGap. Should I used omitNorms="true"? What about positionIncrementGap? At the moment I have this in my schema.xml: stored="false" multiValued="true" />

Re: facet.method: enum vs. fc

2010-10-11 Thread Paolo Castagna
terms is or 2> you start to need to finely tune your speed. HTH Erick On Mon, Oct 11, 2010 at 11:30 AM, Paolo Castagna < castagna.li...@googlemail.com> wrote: Hi, I am using Solr v1.4 and I am not sure which facet.method I should use. What should I use if I do not know in advance if t

facet.method: enum vs. fc

2010-10-11 Thread Paolo Castagna
Hi, I am using Solr v1.4 and I am not sure which facet.method I should use. What should I use if I do not know in advance if the number of values for a given field will be high or low? What are the pros/cons of using facet.method=enum vs. facet.method=fc? When should I use enum vs. fc? I have

SolrQuery and escaping special characters

2010-06-18 Thread Paolo Castagna
Hi, I am using Solr v1.4 and SolrJ on the client side. I am not sure how SolrJ behaves regarding "escaping" special characters [1] in a query string. SolrJ does URL encoding of the query string it sends to Solr. Do I need to escape special characters [1] when I construct a SolrQuery object or n

Re: Can I use per field analyzers and dynamic fields?

2010-05-12 Thread Paolo Castagna
Chris Hostetter wrote: : However, I'd like to hear a comment on the approach of doing the parsing : using Lucene and then constructing a SolrQuery from a Lucene Query: I believe you are asking about doing this in the client code? using the Lucene QueryParser to parse a string using an analyzer,

Re: Can I use per field analyzers and dynamic fields?

2010-05-09 Thread Paolo Castagna
Hi, thank you for your reply. What you suggested is a good idea and I am probably going to follow it. However, I'd like to hear a comment on the approach of doing the parsing using Lucene and then constructing a SolrQuery from a Lucene Query: QueryParser parser = new QueryParser("", analyzer)

Re: Can I use per field analyzers and dynamic fields?

2010-05-05 Thread Paolo Castagna
On 5 May 2010 14:19, Erik Hatcher wrote: > But, I think you need to elaborate on what you're doing in your Lucene > application to know more specifically. Hi Erik, perhaps, this is another way to explain and maybe solve my issue... At query time (everything here is just an illustrative example):

Re: Can I use per field analyzers and dynamic fields?

2010-05-05 Thread Paolo Castagna
nk you need to elaborate on what you're doing in your Lucene > application to know more specifically.  A dynamic field specification in > Solr is associated with only a single field type, so you'll want to use > different dynamic field patterns for different types of fields

Can I use per field analyzers and dynamic fields?

2010-05-05 Thread Paolo Castagna
Hi, I have an existing Lucene application which I want to port to Solr. A scenario I need to support requires me to use dynamic fields with Solr, since users can add new fields at runtime. At the same time, the existing Lucene application is using a PerFieldAnalyzerWrapper in order to use differe

Re: LucidWorks Solr

2010-04-18 Thread Paolo Castagna
Thanks for asking, I am interested as well in reading the response to your questions. Paolo Andy wrote: Just wanted to know if anyone has used LucidWorks Solr. - How do you compare it to the standard Apache Solr? - the non-blocking IO of LucidWorks Solr -- is that for networking IO or disk

Re: Benchmarking Solr

2010-04-12 Thread Paolo Castagna
Shawn Heisey wrote: Anyone got a recommendation about where to put it on the wiki? There are already two related pages: - http://wiki.apache.org/solr/SolrPerformanceFactors - http://wiki.apache.org/solr/SolrPerformanceData Why not to create a new page? - http://wiki.apache.org/solr/Benchm

Re: Benchmarking Solr

2010-04-12 Thread Paolo Castagna
Paolo Castagna wrote: I do not have an answer to your questions. But, I have the same issue/problem you have. Some related threads: - http://markmail.org/message/pns4dtfvt54mu3vs - http://markmail.org/message/7on6lvabsosvj7bc - http://markmail.org/message/ftz7tkd7ekhnk4bc - http

Re: Benchmarking Solr

2010-04-12 Thread Paolo Castagna
Hi, I do not have an answer to your questions. But, I have the same issue/problem you have. It would be good if Solr community would agree and share their approach for benchmarking Solr. Indeed, it would be good to have a benchmark for "information retrieval" systems. AFIK there isn't one. :-/ T

Re: solr best practice to submit many documents

2010-04-07 Thread Paolo Castagna
Hi Brian, I had similar questions when I begun to try and evaluate Solr. If you use Java and SolrJ you might find these useful: - http://wiki.apache.org/solr/Solrj#Streaming_documents_for_an_update - http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/impl/StreamingUpdateSolrServer

Is it possible to have Lucene and Solr (or two Solr instances) pointing at the same index directory?

2010-04-07 Thread Paolo Castagna
Hi, (I know that this is probably not recommended and not a common scenario, but...) Is it possible to have an application using Lucene and a separate (i.e. different JVM) instance of Solr both pointing at the same index and read/write to the index from both applications? I am trying (separately

Re: Searching Lucene Indexes with Solr

2010-04-07 Thread Paolo Castagna
Erick Erickson wrote: It is possible but you have to take care to match Solr's schema with the structure of documents in the Lucene index. The correct field names and query-analyzers should be configured in schema.xml Is it possible to use Solr v1.4 together with a legacy Lucene (v2.1.0 and/or

Backup/restore strategies for Solr cores and "legacy" Lucene applications

2010-04-06 Thread Paolo Castagna
Hi, I have an existing web application which is using Lucene (v2.1.0 and/or v2.4.x) and which I'd like to gradually migrate to Solr. I am already using multiple cores, master/slave replication and SolrJ to re-implement current functionalities. One use case I have is: backup/restore indexes. I a