Solr Hanging

2012-04-19 Thread Trym R. Møller
Hi I am using Solr trunk and have 7 Solr instances running with 28 leaders and 28 replicas for a single collection. After indexing a while (a couple of days) the solrs start hanging and doing a thread dump on the jvm I see blocked threads like the following: Thread 2369: (state = BLOCKED)

How to escape “<” character in regex in Solr schema.xml?

2012-04-19 Thread smooth almonds
Using Solr 3.5.0 and in my schema.xml I'm using the following to mark the end of sentences and replace the end punctuation with a symbolic token: I'm not sure if that will even work for what I want, but first I need to solve the problem of escaping the '<' character in the first '?<=' lookbehind

Dismax request handler and Dismax query parser

2012-04-19 Thread mechravi25
Hi, If I give the search string as, "type list", I want my search to match both "type" & "list". The following search query which we are using /select/?qf=name%5e2.3+text+r_name%5e0.3+id%5e0.3+uid%5e0.3&fl=*&qf=name%5e2.3+text+r_name%5e0.3+id%5e0.3+uid%5e0.3&fl=*&qt=dismax&f.typeFacet.facet.mi

# open files with SolrCloud

2012-04-19 Thread Sami Siren
I have a simple solrcloud setup from trunk with default configs; 1 shard with one replica. As few other people have reported there seems to be some kind of leak somewhere that causes the number of open files to grow over time when doing indexing. One thing that correlates with the open file count

Re: Solr Hanging

2012-04-19 Thread Yonik Seeley
On Thu, Apr 19, 2012 at 4:25 AM, "Trym R. Møller" wrote: > Hi > > I am using Solr trunk and have 7 Solr instances running with 28 leaders and > 28 replicas for a single collection. > After indexing a while (a couple of days) the solrs start hanging and doing > a thread dump on the jvm I see blocke

Re: How to escape “<” character in regex in Solr schema.xml?

2012-04-19 Thread Jeevanandam
try this one pattern="(?<=[^.!?\\s][^.!?]*(?:[.!?](?![']?\s|$)[^.!?]*)*)[.!?]+(?=\\s|$)" I tested locally, solr start perfectly. now please test with data. -Jeevanandam On 19-04-2012 9:29 am, smooth almonds wrote: Using Solr 3.5.0 and in my schema.xml I'm using the following to mark the end

Re: Solr Hanging

2012-04-19 Thread Trym R. Møller
Thanks for your answer. I am running an (older) revision of solr from around the 29/2-2012 I suspect that the thread I have included is the leader of the shard? The Solr instance, I have the dump from, contains more than one leader, so I don't know which shard (slice) the thread is working on.

Re: How to escape “<” character in regex in Solr schema.xml?

2012-04-19 Thread Jeevanandam
previously given pattern will solve the '<' char issue. however you will get following exception in the log Caused by: java.util.regex.PatternSyntaxException: Look-behind group does not have an obvious maximum length near index 48 (?<=[^.!?\\s][^.!?]*(?:[.!?](?![']?\s|$)[^.!?]*)*)[.!?]+(?=\\s|

Re: Solr file size limit?

2012-04-19 Thread Bram Rongen
Hello Shawn, Thanks very much for your answer. Yesterday I've started indexing again but this time on Solr 3.6.. Again Solr is failing around the same time, but not exactly (now the largest fdt file is 4.8G).. It's right after the moment I receive memory-errors at the Drupal side which make me su

Re: Solr file size limit?

2012-04-19 Thread Bram Rongen
I've discovered some documents are 100+MB in size.. Could this be the problem? On Thu, Apr 19, 2012 at 3:49 PM, Bram Rongen wrote: > Hello Shawn, > > Thanks very much for your answer. > > Yesterday I've started indexing again but this time on Solr 3.6.. Again > Solr is failing around the same ti

AW: Wrong categorization with DIH

2012-04-19 Thread Ramo Karahasan
Does anyone has an idea what's going wrong here? Thanks, Ramo -Ursprüngliche Nachricht- Von: Gora Mohanty [mailto:g...@mimirtech.com] Gesendet: Dienstag, 17. April 2012 11:34 An: solr-user@lucene.apache.org Betreff: Re: Wrong categorization with DIH On 17 April 2012 14:47, Ramo Karahasa

Re: How sorlcloud distribute data among shards of the same cluster?

2012-04-19 Thread Boon Low
Hi, Is there any mechanism in SolrCloud for controlling how the data is distributed among the shards? For example, I'd like to create logical (standalone) shards ('A', 'B', 'C') to make up a collection ('A-C"), and be able query both a particular shard (e.g. 'A') or the collection entirely. At

Re: Large Index and OutOfMemoryError: Map failed

2012-04-19 Thread Boon Low
Hi, Also came across this error recently, while indexing with > 10 DIH processes in parallel + default index setting. The JVM grinds to a halt and throws this error. Checking the index of a core reveals thousands of files! Tuning the default autocommit from 15000ms to 90ms solved the proble

Re: Solr file size limit?

2012-04-19 Thread Shawn Heisey
On 4/19/2012 7:49 AM, Bram Rongen wrote: Yesterday I've started indexing again but this time on Solr 3.6.. Again Solr is failing around the same time, but not exactly (now the largest fdt file is 4.8G).. It's right after the moment I receive memory-errors at the Drupal side which make me suspicio

Re: Wrong categorization with DIH

2012-04-19 Thread Jeevanandam Madanagopal
Ramo - Please share DIH configuration with us. -Jeevanandam On Apr 19, 2012, at 7:46 PM, Ramo Karahasan wrote: > Does anyone has an idea what's going wrong here? > > Thanks, > Ramo > > -Ursprüngliche Nachricht- > Von: Gora Mohanty [mailto:g...@mimirtech.com] > Gesendet: Dienstag, 17.

Re: PolySearcher in Solr

2012-04-19 Thread Jeevanandam Madanagopal
Please have a look http://wiki.apache.org/solr/DistributedSearch -Jeevanandam On Apr 19, 2012, at 9:14 PM, Ramprakash Ramamoorthy wrote: > Dear all, > > > I came across this while browsing through lucy > > http://lucy.apache.org/docs/perl/Lucy/Search/PolySearcher.html > > Does solr have an

maxMergeDocs in Solr 3.6

2012-04-19 Thread Burton-West, Tom
Hello all, I'm getting ready to upgrade from Solr 3.4 to Solr 3.6 and I noticed that maxMergeDocs is no longer in the example solrconfig.xml. Has maxMergeDocs been deprecated? or doe the tieredMergePolicy ignore it? Since our Docs are about 800K or more and the setting in the old example solrco

Re: Solr with UIMA

2012-04-19 Thread dsy99
Hi Chris, Are you been able to get success to integrate the UIMA in SOLR. I too tried to integrate Uima in Solr by following the instructions provided in README i.e. the following four steps: Step1. I set tags in solrconfig.xml appropriately to point the jar files. Step2. modified my

Looking at SOLR-3221

2012-04-19 Thread Shawn Heisey
Looking at the CHANGES.txt for 3.6 so I can plan my upgrade, I have some questions about SOLR-3221. This might be a question more appropriate for the dev list, but I don't know, so I am starting here. The wiki entry on this mentions the maxConnectionsPerHost setting, but then talks about the

AW: Wrong categorization with DIH

2012-04-19 Thread Ramo Karahasan
Hi, my config is just the following: I'm doing it as described on: http://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport Any ideas? Best regars, Ramo -Ursprüngliche Nachricht- Von: Jeevanandam Madanagopal [mailto:je...@myjeeva.com] Gesendet: Donner

StandardTokenizer and domain names containing digits

2012-04-19 Thread Alex Willmer
TLDR; How should I make Solr treat "ns1.define.logica.com" as a single token in the same way "ns.define.logica.com" would be? We are just starting to use Solr 3.5.0 in production and have run into a slightly surprising behaviour involving the query "ns1.define.logica.com", through an edismax ha

Re: Wrong categorization with DIH

2012-04-19 Thread Jeevanandam Madanagopal
Ramo - Are you using all the selected columns from the query? select p.title as title, p.id, p.category_id, p.pic_thumb, c.name as category, c.id as category_id from product p, category c ... I see following attributes 'p.id', 'p.category_id' & 'p.pic_thumb' doesn't have alias defined. Pointe

Re: Date granularity

2012-04-19 Thread vybe3142
Thanks So , I tried out the suggestions. I used the main query though (not a filter) 1. Using a DATE range and DAYdoes give me the desired results. Specifically, the query that I used was 2. Without a DATE range, the parser seems to reduce the date to the beginning of the day i.e. 00:00:00 and a

AW: Wrong categorization with DIH

2012-04-19 Thread Ramo Karahasan
Hi, yes i use every oft hem. Thanks for your hint... I'll have a look at this and try to configure it correctly. Thank you, Ramo -Ursprüngliche Nachricht- Von: Jeevanandam Madanagopal [mailto:je...@myjeeva.com] Gesendet: Donnerstag, 19. April 2012 18:42 An: solr-user@lucene.apache.org

Re: Solr with UIMA

2012-04-19 Thread Rahul Warawdekar
Hi Divakar, Try making your updateRequestProcessorChain as default. Simply add default="true" as follows and check if that works. On Thu, Apr 19, 2012 at 12:01 PM, dsy99 wrote: > Hi Chris, > Are you been able to get success to integrate the UIMA in SOLR. > > I too tried to integrate Uima in

RE: StandardTokenizer and domain names containing digits

2012-04-19 Thread Steven A Rowe
Hi Alex, TLDR; Try adding WordDelimiterFilter to your analyzer(s). StandardTokenizer in Lucene/Solr v3.1+ implements the Word Boundary rules from Unicode 6.0.0 Standard Annex #29, a.k.a. UAX#29: . These rules don't include reco

Re: maxMergeDocs in Solr 3.6

2012-04-19 Thread Robert Muir
On Thu, Apr 19, 2012 at 11:54 AM, Burton-West, Tom wrote: > Hello all, > > I'm getting ready to upgrade from Solr 3.4 to Solr 3.6 and I noticed that > maxMergeDocs is no longer in the example solrconfig.xml. > Has maxMergeDocs been deprecated? or doe the tieredMergePolicy ignore it? its not appl

How can I use a function or fieldvalue as the default for query(subquery, default)?

2012-04-19 Thread jimtronic
Hi, For the solr function query(subquery, default) I'd like to be able to specify the value of another field or even a function as the default. For example, I might have: /solr/select?q=_val_:query("{!dismax qf=text v='solr rocks'}", product(this_field, that_field)) Is this possible? I see tha

String ordering appears different with sort vs range query

2012-04-19 Thread Cat Bieber
I'm trying to use a Solr query to find the next title in alphabetical order after a given string. The issue I'm facing is that the sort param seems to sort non-alphanumeric characters in a different order from the ordering used by a range filter in the q or fq param. I can't filter the non-alph

Re: How sorlcloud distribute data among shards of the same cluster?

2012-04-19 Thread Mark Miller
You can remove the distrib update processor and just distrib the data yourself. Eventually the hash implementation will also be pluggable I think. On Apr 19, 2012, at 10:30 AM, Boon Low wrote: > Hi, > > Is there any mechanism in SolrCloud for controlling how the data is > distributed among the

Re: Date granularity

2012-04-19 Thread vybe3142
Also, what's the performance impact of range queries vs. querying for a particular DAY (as described in my last post) when the index contains , say, 10 million docs ? If the range queries result in a significant performance hit, one option for us would be to define additional DAY fields when inde

Re: SolrCloud: Programmatically create multiple collections?

2012-04-19 Thread ravi
Hi Mark, Thanks for your response. I did manage to one example running with 2 solr instance running and i checked that shards are created and replicated properly. The problem that i am now facing is zookeeper's clusterstate. If i kill one solr instance (which may hold one or more cores) by pres

Re: EmbeddedSolrServer and StreamingUpdateSolrServer

2012-04-19 Thread pcrao
Hi, Any update? Thanks, PC Rao -- View this message in context: http://lucene.472066.n3.nabble.com/EmbeddedSolrServer-and-StreamingUpdateSolrServer-tp3889073p3925014.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr UI

2012-04-19 Thread dpt9876
Hi Erik, Re this project, do you have any demos available to check it out? https://github.com/lucidimagination/Prism And will it work on standard solr installs or do you need a Lucid imagination subscription. Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-UI-tp31

Re: PolySearcher in Solr

2012-04-19 Thread Ramprakash Ramamoorthy
On Thu, Apr 19, 2012 at 9:21 PM, Jeevanandam Madanagopal wrote: > Please have a look > > http://wiki.apache.org/solr/DistributedSearch > > -Jeevanandam > > On Apr 19, 2012, at 9:14 PM, Ramprakash Ramamoorthy wrote: > > > Dear all, > > > > > > I came across this while browsing through lucy > > > >