Re: timeAllowed in not honoring

2014-04-30 Thread Aman Tandon
Jeff -> Thanks Jeff this discussion on jira is really quite helpful. Thanks for this. Shawn -> Yes we have some plans to move to SolrCloud, Our total index size is 40GB with 11M of Docs, Available RAM 32GB, Allowed heap space for solr is 14GB, the GC tuning parameters using in our server is -XX:+U

Re: Which Lucene search syntax is faster

2014-04-30 Thread Shawn Heisey
On 4/30/2014 3:47 PM, johnmu...@aol.com wrote: > Thank you Shawn and Erick for the quick response. > > > A follow up question. > > > Basedon > https://cwiki.apache.org/confluence/display/solr/Common+Query+Parameters#CommonQueryParameters-Thefq%28FilterQuery%29Parameter,I > see the "fl" (field lis

What are the best practices on Multiple Language support in Solr Cloud ?

2014-04-30 Thread Shamik Bandopadhyay
Hi, I'm trying to implement multiple language support in Solr Cloud (4.7). Although we've different languages in index, we were only supporting english in terms of index and query. To provide some context, our current index size is 35 GB with close to 15 million documents. We've two shards with

Re: Which Lucene search syntax is faster

2014-04-30 Thread johnmunir
Thank you Shawn and Erick for the quick response. A follow up question. Basedon https://cwiki.apache.org/confluence/display/solr/Common+Query+Parameters#CommonQueryParameters-Thefq%28FilterQuery%29Parameter,I see the "fl" (field list) parameter. Does this mean I canbuild my Lucene search s

Re: Which Lucene search syntax is faster

2014-04-30 Thread Erick Erickson
I'd add that I think you're worrying about the wrong thing. 10M documents is not very many by modern Solr standards. I rather suspect that you won't notice much difference in performance due to how you construct the query. Shawn's suggestion to use fq clauses is spot on, though. fq clauses are re-

Re: Which Lucene search syntax is faster

2014-04-30 Thread Shawn Heisey
On 4/30/2014 2:29 PM, johnmu...@aol.com wrote: > My question is this: what Lucene search syntax will give meback result the > fastest? If my user is interestedin finding data within “title” and “owner” > fields only “doc_type” “DOC”, shouldI build my Lucene search syntax as: > > 1) skyfall ian

Which Lucene search syntax is faster

2014-04-30 Thread johnmunir
Hi, Given the following Lucene document that I’m adding to my index(and I expect to have over 10 million of them, each with various sizes from 1 Kbto 50 Kb: PDF Some name Some summary Who owns this 10 1234567890 DOC Some name Some summary Who

Re: Shards don't return documents in same order

2014-04-30 Thread Erick Erickson
Hmmm, take a look at the admin/analysis page for these inputs for alphaOnlySort. If you're using the stock Solr distro, you're probably not considering the effects patternReplaceFilterFactory which is removing all non-letters. So these three terms reduce to mba mba mbanew You can look at the actu

Re: Denormalize or use multivalued field for nested data?

2014-04-30 Thread Anshum Gupta
Block joins could be what you're looking for if you can upgrade to 4.5+ [ https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-BlockJoinQueryParsers ] I'd recommend an upgrade but if that's not possible, replicating the parent information is the way to go. On Wed, Apr 30

Re: Denormalize or use multivalued field for nested data?

2014-04-30 Thread Erick Erickson
I think you are misunderstanding "denormalize" in this context. It still may not be what you want to do for other reasons, but the usual idea is to replicate the parent info in each of the children, so you'd have something like: doc1 = title:"iPad 64gb" description: "iPad 64gb with retina" mercha

Shards don't return documents in same order

2014-04-30 Thread Francois Perron
Hi guys, I have a small SolrCloud setup (3 servers, 1 collection with 1 shard and 3 replicat). In my schema, I have a alphaOnlySort field with a copyfield. This is a part of my managed-schema : with the copyfield The problem is : I query my collection wit

Denormalize or use multivalued field for nested data?

2014-04-30 Thread Utkarsh Sengar
I have to modify a schema where I can attach nested "pricing per store" information for a product. For example: 10010137332:{ title:"iPad 64gb" description: "iPad 64gb with retina" pricing:{ merchantid64354:{ locationid643:{ "USD|600" }

Re: saving user actions on item in solr for later retrieval

2014-04-30 Thread Mikhail Khludnev
is there somebody from LucidWorks who can refer to Click Score Relevance Framework in LucidWorks Search? On Mon, Apr 28, 2014 at 10:48 PM, nolim wrote: > Hi, > We are using solr in production system for around ~500 users and we have > around ~1 queries per day. > Our user's search topics mo

Re: saving user actions on item in solr for later retrieval

2014-04-30 Thread nolim
Thank you, we will check it out. On Apr 29, 2014 9:28 PM, "iorixxx [via Lucene]" < ml-node+s472066n4133796...@n3.nabble.com> wrote: > Hi Nolim, > > Actually EFF is searchable. See my comments at the end of the page > > > https://cwiki.apache.org/confluence/display/solr/Working+with+External+Files

Re: When not to use NRTCachingDirectory and what to use instead.

2014-04-30 Thread Jeff Wartes
On 4/19/14, 6:51 AM, "Ken Krugler" wrote: > >The code I see seems to be using an FSDirectory, or is there another >layer of wrapping going on here? > >return new NRTCachingDirectory(FSDirectory.open(new File(path)), >maxMergeSizeMB, maxCachedMB); I was also curious about this subject. Not

Re: search result not correct in solr

2014-04-30 Thread Erick Erickson
Neha: You _really_ need to get familiar with the admin/analysis page in the Solr admin UI. It shows you, step-by-step, what each tokenizer and filter in your analysis chain does. It'll save you a world of pain :). Best, Erick P.S. unless you care about a bunch of really gory detail, un-check the

Re: Sorting is not correct in autosuggest

2014-04-30 Thread Erick Erickson
Please review: http://wiki.apache.org/solr/UsingMailingLists You've given us virtually no information here. Best, Erick On Wed, Apr 30, 2014 at 12:35 AM, neha sinha wrote: > Hi All > > In my auto suggest page sorting is not correct for the suggestions i am > getting. > However suggestions are

Re: Solr Server Infrastructure Config

2014-04-30 Thread Erick Erickson
Impossible to answer even if you gave much more detailed information, you need to prototype and push one of your machines until it falls over, then extrapolate. See: http://searchhub.org/2012/07/23/sizing-hardware-in-the-abstract-why-we-dont-have-a-definitive-answer/ Best, Erick On Tue, Apr 29, 2

Re: Stemming not working with wildcard search

2014-04-30 Thread Erick Erickson
Did you re-index? And what do you get when adding &debug=query? That should show you the parsed query. Have you looked at the results of the admin/analysis page? That tool is invaluable for seeing what the actual transformations are. Best, Erick On Mon, Apr 28, 2014 at 11:41 AM, Geepalem wrote:

Re: merge shards indexes

2014-04-30 Thread Erick Erickson
Is this SolrCloud? If so you have to be quite careful to get the expected results, in fact I'm not all that sure you can and still have a consistent index. Best, Erick On Mon, Apr 28, 2014 at 5:33 AM, Dmitry Kan wrote: > Yes, according to this documentation: > https://wiki.apache.org/solr/Mergin

Re: timeAllowed in not honoring

2014-04-30 Thread Jeff Wartes
It¹s not just FacetComponent, here¹s the original feature ticket for timeAllowed: https://issues.apache.org/jira/browse/SOLR-502 As I read it, timeAllowed only limits the time spent actually getting documents, not the time spent figuring out what data to get or how. I think that means the primar

Re: timeAllowed in not honoring

2014-04-30 Thread Shawn Heisey
On 4/29/2014 11:43 PM, Aman Tandon wrote: > My heap size is 14GB and i am not using solr cloud currently, 40GB index > is replicated from master to two slaves. > > I read somewhere that it return the partial results which is computed by > the query in that specified amount of time which is define

Error initializing QueryElevationComponent

2014-04-30 Thread Geepalem
Hi Team, I am getting error "null:org.apache.solr.common.SolrException: SolrCore 'master' is not available due to init failure: Error initializing QueryElevationComponent." Please check below for configurations elevate.xml -- Scema.xml --- SolrConfig.xml ---

Re: timeAllowed in not honoring

2014-04-30 Thread Mikhail Khludnev
On Wed, Apr 30, 2014 at 2:16 PM, Aman Tandon wrote: > name="time">3337.0 > 6739.0 > Most time is spent in facet counting. FacetComponent doesn't checks timeAllowed right now. You can try to experiment with facet.method=enum or even with https://issues.apache.org/jira/b

RE: Problem indexing subentitienties from a multivalued field

2014-04-30 Thread Jordi Martin
Playing a bit with this dataconfig I get two different results If thk is defined in the schema.xml I get all the values for it indexed but the subentity thematization_keys is not processed. In the other hand , if I do not define thk in the schema.xml file only the last value for thk is stored

Re: Problem indexing subentitienties from a multivalued field

2014-04-30 Thread Alexandre Rafalovitch
This is a little complicated. What are you getting now with this setup? Is everything else actually working? I would have thought that even --dataSource="null"-- would cause issues. Regards, Alex. Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Acce

Problem indexing subentitienties from a multivalued field

2014-04-30 Thread Jordi Martin
Hi there I have a problem trying to create subentities during the data import. I have defined the following data-config ... ... ...

Re: timeAllowed in not honoring

2014-04-30 Thread Aman Tandon
Hi Salman, here is the my debug query dump please help!. I am unable to find the wildcards in it. true 0 10080 884159 629472 491426 259356 259029 257193 195077 193569 179369 115356 111644 86794 80621 72815 68982 65082

Problem indexing subentitienties from a multivalued field

2014-04-30 Thread Jordi Martin
Hi there I have a problem trying to create subentities during the data import. I have defined the following data-config ... ... ...

Re: search result not correct in solr

2014-04-30 Thread Anshum Gupta
Hi Neha, There are a bunch of filters available and it wouldn't make sense to suggest anything unless we know what's the intention. As they say, if you don't know where you're going, any road will take you there. If you want the most basic cases of being able to search for standard terms in your

Re: search result not correct in solr

2014-04-30 Thread neha sinha
Hello Alex Yes I reindex completely. I am new to solr so donot have much idea of all the filters.Can u suggest some filters which i can try? -- View this message in context: http://lucene.472066.n3.nabble.com/search-result-not-correct-in-solr-tp4133841p4133861.html Sent from the Solr - User

Sorting is not correct in autosuggest

2014-04-30 Thread neha sinha
Hi All In my auto suggest page sorting is not correct for the suggestions i am getting. However suggestions are all correct. Any guidance will be helpful -- View this message in context: http://lucene.472066.n3.nabble.com/Sorting-is-not-correct-in-autosuggest-tp4133859.html Sent from the