Re: Multicore Issue - Server Restart

2012-05-30 Thread Sujatha Arun
Yes ,that is correct. Regards Sujatha On Tue, May 29, 2012 at 7:23 PM, lboutros wrote: > Hi Suajtha, > > each webapps has its own solr home ? > > Ludovic. > > - > Jouve > France. > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Multicore-Issue-Server-Restart-tp39865

difference between Katta and SolrCloud (replicator factor)

2012-05-30 Thread Jamel ESSOUSSI
Hi, I would like to know the difference between Katta and SolrCloud in the distributed context ? And can we specify in SolrCloud the minimum number of copies of a document maintained by the cluster (replicator factor). Best Regards -- Jamel ESSOUSSI -- View this message in context: http://luce

Clarification requested on query results.

2012-05-30 Thread Sohail Aboobaker
Hi, We have a schema that contains categories (multi valued) for different levels. Each level of categories are indexed in a different field. Each category entry contains its parent in its value. The level is separated by a "|" symbol. We are seeing following results for a faceted query: http://l

Re: Clarification requested on query results.

2012-05-30 Thread Sohail Aboobaker
Never mind, I figured it out after sending the email: http://localhost:8983/solr/select?q=level2categories:%22Artists\|Direct%20from%20the%20Artist%22&group=true&group.field=groupby&group.main=true&facet=true&group.facet=true&facet.field=level1categories&facet.field=level2categories&rows=0 I need

Re: how to reduce the result size to 2-3 lines and expand based on user interest

2012-05-30 Thread srini
Here is the correct technical term to put. "synopsis". Yes all I wanted to do is show synopsis for the text description field I have. Thanks Srini -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-reduce-the-result-size-to-2-3-lines-and-expand-based-on-user-interest-tp3

How to show Synopsis for textual description field

2012-05-30 Thread srini
HI All, I have text filed which contains description for product. When user search for any word, I would like to show the synopsis for this field( which could be 2-3 lines). Can anyone give some hint? Thanks In Advance!!! Srini -- View this message in context: http://lucene.472066.n3.nabble.co

Re: solr limits

2012-05-30 Thread Jack Krupansky
In theory, yes. A single core/index is limited by the Lucene limit of 2.14 billion documents (document number is non-negative and represented as a signed Java "int"), but you can use shards to get a multiple of that. You can put 50 to 250 million (maybe more, depending on the data) documents in

Re: how to reduce the result size to 2-3 lines and expand based on user interest

2012-05-30 Thread Jack Krupansky
The commonly used term is "snippet" or "snippet highlighting." Read: http://wiki.apache.org/solr/HighlightingParameters Try that approach. To expand, execute a fresh query for the specific document but with highlighting disabled or with highlighted parameters that provide a larger snippet siz

Re: Multicore Issue - Server Restart

2012-05-30 Thread Siva Kommuri
Hi Sujatha, Which version of Solr are you using? Best Wishes, Siva On Wed, May 30, 2012 at 12:22 AM, Sujatha Arun wrote: > Yes ,that is correct. > > Regards > Sujatha > > On Tue, May 29, 2012 at 7:23 PM, lboutros wrote: > > > Hi Suajtha, > > > > each webapps has its own solr home ? > > > > Lu

Re: Multicore Issue - Server Restart

2012-05-30 Thread Sujatha Arun
solr 1.3 Regards Sujatha On Wed, May 30, 2012 at 8:26 PM, Siva Kommuri wrote: > Hi Sujatha, > > Which version of Solr are you using? > > Best Wishes, > Siva > > On Wed, May 30, 2012 at 12:22 AM, Sujatha Arun > wrote: > > > Yes ,that is correct. > > > > Regards > > Sujatha > > > > On Tue, May 2

solr 1.3 Multicores and maxboolean clause

2012-05-30 Thread Sujatha Arun
Hello, The solrcore Wiki says that "Lucene's BooleanQuerymaxClauseCount is a static variable, making it a single value across the entire JVM. Whichever Solr core initializes last will win the setting of the solrconfig.xml's maxBooleanClauses value. Workaro

Re: Tips on creating a custom QueryCache?

2012-05-30 Thread Aaron Daubman
Hoss, : 1) Any recommendations on which best to sub-class? I'm guessing, for this > : scenario with "rare" batch puts and no evictions, I'd be looking for get > : performance. This will also be on a box with many CPUs - so I wonder if > the > : older LRUCache would be preferable? > > i suspect yo

Re: Tips on creating a custom QueryCache?

2012-05-30 Thread Walter Underwood
On May 30, 2012, at 11:44 AM, Aaron Daubman wrote: > The bigger question is: what are the parallel task > execution paths in Solr and under what conditions are they possible? I'd go with the general servlet rules, where everything is assumed to have concurrent access. wunder -- Walter Underwoo

Re: solr 1.3 Multicores and maxboolean clause

2012-05-30 Thread Jack Krupansky
As per the source code, Solr only sets the BooleanQuery clause limit on the very first core load. It ignores any the setting on subsequent core loads, including a reload of the initial core. SolrCore.java: "// only change the BooleanQuery maxClauseCount once for ALL cores..." The cores shoul

Difference between textfield and strfield

2012-05-30 Thread Gau
Hi, Can anyone explain me the basic pros and cons between textfield and strfield. I am trying to use Levenstein distance on textfield, but it seems that it can only be applied on the strfield. So my question is whats the difference between the 2 and what are the radical advantages of one over the

Re: Difference between textfield and strfield

2012-05-30 Thread Jack Krupansky
A text field is a sequence of terms that has been tokenized while a string field is a single term (although it can also be multivalued.) Punctuation and spacing is ignored for text fields. Text tends to be lowercased, stemmed, and even stop words removed. You tend to search text using a handfu

Re: Difference between textfield and strfield

2012-05-30 Thread Gau
Well the I do not have phrases for synonym expansion. So it does work well. The synonym expansion is done at query time. And since i am just searching against the first name field, tf, idf and other ranking parameters do not make sense, hence their weight has been initialized to 1. So after applyin

Re: Difference between textfield and strfield

2012-05-30 Thread Jack Krupansky
Take a look at "strdist" and see whether that does enough of what you want: See: http://wiki.apache.org/solr/FunctionQuery#strdist Some discussion here: http://lucene.472066.n3.nabble.com/can-t-use-strdist-as-functionquery-td1023390.html And these would need to be "string" fields, not "text" si

Re: Difference between textfield and strfield

2012-05-30 Thread Gau
I cannot move from textfield to strfield, since I am using synonym expansion. Is there anything we can do on textfield itself -- View this message in context: http://lucene.472066.n3.nabble.com/Difference-between-textfield-and-strfield-tp3986916p3986938.html Sent from the Solr - User mailing list

Re: Difference between textfield and strfield

2012-05-30 Thread Jack Krupansky
I should have said that you need to have both a text and string copy of the field. Make sure one of them is "stored" and do a copyField to the other field. Then your main query can be against the text field and the sort function query against the string field. -- Jack Krupansky -Original

Re: Highlight the search word in results

2012-05-30 Thread Jack Krupansky
1. Please be specific as to what is not working. Is a highlighting section returned in the results? Is the field missing? Or what? 2. hl.fragsize is in characters. Something like 300 is more realistic. 3. Your "f.DESCRIPTION.hl.snippets=5" is missing its XML tags. I don't think you need this any

Re: How to show Synopsis for textual description field

2012-05-30 Thread Lance Norskog
This is called "Document Summarization" and there is a lot of literature on the topic. None of it is implemented for Solr. All of them require some preprocessing so the summaries need to be stored during indexing. There is some natural-language processing and some math processing in most summarizer

Re: Tips on creating a custom QueryCache?

2012-05-30 Thread Chris Hostetter
: that describes concurrency in Solr. The short question is, for such a : cache, do I need to worry about concurrent access (I'm guessing that the : firstSearcher QuerySenderListener process would be : single-threaded/non-concurrent, and thus writes would never be an issue - : is this correct?) -

Re: how to read fieldValueCacheStatistics

2012-05-30 Thread Chris Hostetter
: When I read fieldValueCache statistics I have something that looks like : : item_ABC_FACET : : {field=ABC_FACET,memSize=4224,tindexSize=32,time=92,phase1=92,nTerms=0,bigTerms=0,termInstances=0,uses=11} : : : is there a doc somewhere that explains what are ...technically that's one stat, sho

Re: Solr Caches

2012-05-30 Thread Chris Hostetter
: FilterCache: ... : So if a query contains two fq params, it will create two separate entries : for each of these fq params. The value of each entry is the list of ids of : all documents across the index that match the corresponding fq param. Each : entry is independent of any other entry

Re: Logging in Embedded SolrServer - What a nightmare.

2012-05-30 Thread solruser489
For anyone else who is still having this issue, the following may help. The embedded SOLR server uses sl4j for logging, which is a facade over other logging frameworks. It achieves this by looking for a 'binding' jar for the implementation framework in the classpath. In my case I had the 'simple' b

Poll: What do you use for Solr performance monitoring?

2012-05-30 Thread Otis Gospodnetic
Hi, Super quick poll:  What do you use for Solr performance monitoring? Vote here: http://blog.sematext.com/2012/05/30/poll-what-do-you-use-for-solr-performance-monitoring/ I'm collecting data for my Berlin Buzzwords talk that will touch on Solr, so your votes will be greatly appreciated! Tha

Re: Accent Characters

2012-05-30 Thread Vicente Couto
Hello, Jack. Yeah, I'm screwed up. Well, the documents are indexed with the accents. I started a new clean solr 3.6 configuration, with as few changes as possible; I'm running two cores, one for English and another one for French. Here is where I am now: If I try to run queries by using solrJ, it

Re: Accent Characters

2012-05-30 Thread Jack Krupansky
This might be related: https://issues.apache.org/jira/browse/SOLR-443 It suggests setting an HTTP header: Content-Type: application/x-www-form-urlencoded; charset=UTF-8 -- Jack Krupansky -Original Message- From: Vicente Couto Sent: Thursday, May 31, 2012 12:52 AM To: solr-user@luce

Re: A few random questions about solr queries.

2012-05-30 Thread santamaria2
A wee bit of clarification on the 2nd question. I meant relative performance, ie. would it be much slower to facet over 20 facet.queries & 10 facet.fields compared to say, 4 facet.queries & facet.fields. I wonder if this makes sense... So... is a bump improper etiquette here? >_> -- View this mes

Re: solr 1.3 Multicores and maxboolean clause

2012-05-30 Thread Sujatha Arun
Thanks Jack . In which case the template cores would be ones that would be initialized first and we need to take care of this on template configs . Also I notices that when we remove the core1 and core0 and try to create a new webapp without any core and empty solr.xml and try to create a new c

Re: Accent Characters

2012-05-30 Thread Sami Siren
Vicente, Are you using CommonsHttpSolrServer or HttpSolrServer? If the latter then you are probably hitting this: https://issues.apache.org/jira/browse/SOLR-3375 The remedy is to use CommonshHttpSolrServer. -- Sami Siren On Thu, May 31, 2012 at 7:52 AM, Vicente Couto wrote: > Hello, Jack. > >