Re: external zookeeper with SolrCloud

2013-06-11 Thread Mark Miller
On Jun 11, 2013, at 10:15 AM, "Joshi, Shital" wrote: > Thanks Mark. > > Looks like this bug is fixed in Solr 4.4. Do you have any date for official > release of 4.4? Looks like it might come out in a couple of weeks. > Is there any instruction available on how t

Re: index merge question

2013-06-11 Thread Mark Miller
e job. The final step is the golive step, where the indexes will be deployed to the running Solr cluster - this is what uses the core admin merge command, and if you are doing updates or adds outside of map reduce, you will face the issues we have discussed. - Mark On Jun 11, 2013, at 11:57

Re: What is Difference Between Down and Gone At Admin Cloud Page?

2013-06-12 Thread Mark Miller
it has given up or disappeared. It's not likely to make another state change without your intervention. - Mark

Re: Solr Cloud Hangs consistently .

2013-06-17 Thread Mark Miller
ldn't have expected it to be so easily hit with only 2 replicas per shard. I should be able to tell from a stack trace though. If it is that, it's on my short list to investigate (been there a long time now though - but I still hope to look at it soon). - Mark On Jun 17, 2013, at 1:4

Re: Solr Cloud Hangs consistently .

2013-06-17 Thread Mark Miller
res/methods are being used that likely causes it or makes it easier to cause. But again, the issue I know about involves threads that are not even created in the replicationFactor = 1 case, so that could be a first report afaik. - Mark On Jun 17, 2013, at 5:52 PM, Rishi Easwaran wrote: >

Re: Avoiding OOM fatal crash

2013-06-17 Thread Mark Miller
write a middle layer and put limits on user requests before making Solr requests. - Mark On Jun 17, 2013, at 4:44 PM, Manuel Le Normand wrote: > Hello again, > > After a heavy query on my index (returning 100K docs in a single query) my > JVM heap's floods and I get an JAVA

Re: Shard splitting and document routing

2013-06-18 Thread Mark Miller
No, the hash ranges are split and new docs go to both new shards. - Mark On Jun 18, 2013, at 12:25 PM, Otis Gospodnetic wrote: > Hi, > > Imagine a (common) situation where you use document routing and you > end up with 1 large shards (e.g. 1 large user with lots of docs). >

Re: shard failure, leader transition took 11s (seems high?)

2013-06-24 Thread Mark Miller
ic clean stop api you can call first - by the time jetty (or whatever container) tells Solr it's shutting down, it's too late to pull the node out gracefully. I've danced around it in the past, but have never gotten to making that clean shutdown/stop API. - Mark On Jun 24, 2013, at

Re: Waiting until we see more replicas up message??

2013-06-24 Thread Mark Miller
o see everyone we know about in the shard up to 3 or 5 min by default. Then we know all the shards participate in the leader election and the leader will end up with all updates it should have. You can lower that wait or turn it off with 0. - Mark On Jun 24, 2013, at 2:15 PM, Timothy Potter

New NearRealtime feature in Solr trunk.

2011-09-04 Thread Mark Miller
For those that would like to start diving in, I've started documenting the recent NearRealtime work (available on trunk) on the wiki: http://wiki.apache.org/solr/NearRealtimeSearch - Mark Miller lucidimagination.com 2011.lucene-eurocon.org | Oct 17-20 | Barcelona

Re: StreamingUpdateSolrServer#handleError

2011-09-05 Thread Mark Miller
n't see how to keep > track of failures, i'd like to discover > which documents failed during the request. > > thanks in advance! > > -- > Leonardo S Souza - Mark Miller lucidimagination.com 2011.lucene-eurocon.org | Oct 17-20 | Barcelona

Re: SolrCloud and TermsComponent

2011-09-07 Thread Mark Miller
SolrCloud? Are you using trunk? I seem to remember a bug from a long time ago on the SolrCloud branch with this...if that's it, should be fixed though. - Mark Miller lucidimagination.com 2011.lucene-eurocon.org | Oct 17-20 | Barcelona

solr equivalent of "select distinct"

2011-09-09 Thread Mark juszczec
grouping? Curiously, when I use grouping like this: q=FLD1:A and FLD2:B &rows=500 &defType=edismax &indent=true &fl=FLD1, FLD2 &group=true &group.field=FLD1 &group.field=FLD2 I get 2 records as well. Has anyone dealt with mimicing "select distinct" in Solr? Any advice would be very appreciated. Mark

searching for terms containing embedded spaces

2011-09-09 Thread Mark juszczec
Hi folks I've got a field that contains 2 words separated by a single blank. What's the trick to creating a search string that contains the single blank? Mark

Re: Solr Cloud - is replication really a feature on the trunk?

2011-09-10 Thread Mark Miller
nd how they are organized in shards. >> >> >>> because you have a slightly different config between Node 1 & 2: >>> http://lucene.472066.n3.nabble.com/Replication-setup-with-SolrCloud-Zk-td2952602.html >> >> >> I have two shards, each shard having a master and a slave core. >> Cores are located so that master and slave are on different nodes. >> This protects search (but not indexing) from node failure. >> - Mark Miller lucidimagination.com 2011.lucene-eurocon.org | Oct 17-20 | Barcelona

Re: solr equivalent of "select distinct"

2011-09-11 Thread Mark juszczec
ldcards to get FLD4 and FLD5. If not, I can use fl= I'm using edismax. We are also creating the query string on the fly. I suspect using SolrJ and plugging the values into a bean would be easier - or do I have that wrong? I hope the tables of example data display properly. Mark On Sun, Sep

Re: searching for terms containing embedded spaces

2011-09-11 Thread Mark juszczec
uery assembler" producing field:a+b We've also tried making it create field:a\ b The first case just does not work and I'm unsure why. The second case ends up url encoding the \ and I'm unsure if that will cause it to be used in the query or not. Mark On Sun, Sep 11, 2

Re: searching for terms containing embedded spaces

2011-09-11 Thread Mark juszczec
a followed by a single embedded space followed by a b? I'm also wondering if this is already handled by the Solr/SolrJ API and if we are making our lives more difficult by assembling the query strings ourselves. Mark > -Yonik > http://www.lucene-eurocon.com - The Lucene/Solr User Conference >

Re: searching for terms containing embedded spaces

2011-09-11 Thread Mark juszczec
That's what I thought. The problem is, its not and I am unsure what is wrong. On Sun, Sep 11, 2011 at 1:35 PM, Yonik Seeley wrote: > On Sun, Sep 11, 2011 at 1:15 PM, Mark juszczec > wrote: > > I am looking for a text string with a single, embedded space. For the > > p

Re: searching for terms containing embedded spaces

2011-09-11 Thread Mark juszczec
+ACTIVE_IND%3A1&defType=edismax&rows=500&sort=ACCOUNT_CUSTOMER_ID+asc&start=0 Mark On Sun, Sep 11, 2011 at 2:05 PM, Yonik Seeley wrote: > On Sun, Sep 11, 2011 at 1:39 PM, Mark juszczec > wrote: > > That's what I thought. The problem is, its not and I am un

Re: SolrCloud Feedback

2011-09-11 Thread Mark Miller
a reference only >>> >>> d) How are admins supposed to update configs in ZK? Install their favourite >>> ZK editor? >>> >>> e) We should perhaps not be so afraid to make ZK a requirement for Solr in >>> v4. Ideally you should interact with a 1-

Re: Solr Cloud Number of Shard Limitation?

2011-09-27 Thread Mark Miller
their own code, but we have not yet done this in trunk. What that places the upper limit at, I don't know - I imagine it takes quite a few shards before it ends up being too much of a problem - they shard by user I believe, so lot's of shards. - Mark Miller lucidimagination.com 201

Re: Solr Cloud Number of Shard Limitation?

2011-09-28 Thread Mark Miller
, read the data for those not in the list, and get your new state efficiently. - Mark Miller lucidimagination.com 2011.lucene-eurocon.org | Oct 17-20 | Barcelona On Sep 28, 2011, at 10:35 PM, Jamie Johnson wrote: > Thanks Mark found the TODO in ZkStateReader.java > > // TODO: - possib

Re: SolrCloud: is there a programmatic way to create an ensemble

2011-09-29 Thread Mark Miller
gt; (Solr launched with -DzkRun). >>> >>> I would like to create an ensemble out of them. I know about -DzkHost >>> parameter, but can I achieve the same programmatically? Either with >>> SolrJ or REST API? >>> >>> Thanks, >>> Yury >>> >> - Mark Miller lucidimagination.com 2011.lucene-eurocon.org | Oct 17-20 | Barcelona

Re: basic solr cloud questions

2011-09-30 Thread Mark Miller
Thanks Pulkit! I'd actually been meaning to add the post.jar commands needed to index a doc to each shard to the wiki. Waiting till I streamline a few things though. - Mark On Sep 30, 2011, at 12:35 PM, Pulkit Singhal wrote: > BTW I update the wiki with the following, hope it keeps i

Re: Automate startup/shutdown of SolrCloud Shards

2011-09-30 Thread Mark Miller
, "8983", "solr") zkController.uploadConfigDir(directory, configName); zkController.close(); - Mark Miller lucidimagination.com 2011.lucene-eurocon.org | Oct 17-20 | Barcelona

Re: Automate startup/shutdown of SolrCloud Shards

2011-10-02 Thread Mark Miller
0.0.1:9983 127.0.0.1 8983 solr /home/mark/workspace/SolrCloud/solr/example/solr/conf conf1 On Fri, Sep 30, 2011 at 3:13 PM, Mark Miller wrote: > > On Sep 29, 2011, at 1:59 PM, Jamie Johnson wrote: > > > I am trying to automate the startup/shutdown of SolrCloud shards and > > have

Debugging misbehaving spellchecker search....

2011-10-03 Thread Mark Swinson
orrectly written, as I want to isolate whether or not it is my query that is at fault or my spellchecker configuration. For reference, below is the key aspects of my solr configration relating to this

Re: Determining master/slave from ZK in SolrCloud

2011-10-04 Thread Mark Miller
the problem - something then needs to read it. - Mark Miller lucidimagination.com 2011.lucene-eurocon.org | Oct 17-20 | Barcelona On Oct 4, 2011, at 10:26 AM, Jamie Johnson wrote: > Ok, so I am pretty sure this information is not available. What is > the most appropriate way to add infor

Maximum deep of nested boolean queries

2011-10-06 Thread Mark Schoy
Is there a maximum deep of nested boolean queries? For example: ((+a b -c) or (c -d) not e) and (+f +g)

Re: Maximum deep of nested boolean queries

2011-10-06 Thread Mark Schoy
Hi Otis, thanks for your answer. My queries aren't very deep. Not more than 3 or 4 times nested. I thought maybe the limit is only 2 or 3 times, but so there will be no problem. Mark 2011/10/6 Otis Gospodnetic : > Mark, > > I don't think there is any (other than just co

indexing individual terms in large database field.....

2011-10-13 Thread Mark Swinson
he closest match from within the field. However I am unsure how to index this column so that I can return a single match. Can solr support this sort of thing out of the box, or will I need to write a custom plugin? Mark http://www.bbc.co.uk/ This e-mail (and any attachments) is confidenti

Using spellcheck component with query ( q and spellcheck.q )

2011-10-20 Thread Mark Swinson
the fuzzy operator as opposed to when I don't. Regards Mark http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your s

Re: SOLR CLOUD IN TWO DIFFERENT HOSTS

2011-10-21 Thread Mark Miller
localHost = "http://"; + InetAddress.getLocalHost().getHostName(); by default. You can override it by settings host= in solr.xml on solr/cores. If you want to use a system property instead, host=${nameOfSysProp} - Mark On Oct 16, 2011, at 8:27 PM, prakash wrote: > start master nod

Re: NRT and replication

2011-10-21 Thread Mark Miller
get replicated to the rest of replicas? >> >> Nope. Index replication isn't really that compatible with NRT. >> But the new distributed indexing features we're working on will be! >> The parent issue for this effort is SOLR-2358. >> >> -Yonik >&g

Re: questions about autocommit & committing documents

2011-10-25 Thread Mark Miller
> > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/questions-about-autocommit-committing-documents-tp1582487p3450739.html > Sent from the Solr - User mailing list archive at Nabble.com. - Mark Miller lucidimagination.com

Solr Replication: relative path in confFiles Element?

2011-10-25 Thread Mark Schoy
Hi, is ist possible to define a relative path in confFile? For example: ../../x.xml If yes, to which location will the file be copied at the slave? Thanks.

case insensitive searches

2011-10-30 Thread Mark Juszczec
Hello all According to the docs, I need to use solr.LowerCaseTokenizerFactory Does anyone have any experience with it? Can anyone comment on pitfalls or things to beware of? Does anyone know of any examples I can look at? Thanks Mark

Re: SolrCloud with large synonym files

2011-11-02 Thread Mark Miller
ntually there are other ways to solve this that we may offer... Optional compression of files Store a file across multiple zk nodes transparently when size is too large - Mark Miller lucidimagination.com

Re: SOLRJ commitWithin inconsistent

2011-11-10 Thread Mark Miller
been fixed. Don't think they matched what you are describing though. - Mark Miller lucidimagination.com On Nov 2, 2011, at 11:40 PM, Vijay Sampath wrote: > Hi, > > I'm using CommitWithin for immediate commit. The response times are > inconsistent. Sometimes it's le

Re: Solr

2011-11-10 Thread Mark Miller
/SOLR-2358 - Mark Miller lucidimagination.com On Nov 4, 2011, at 9:23 AM, KARHU Toni wrote: > Hi, when is the SOLR cloud version planned to be released/stable what are > your thought of using it in a serious production environment? &g

Re: missing core name in path

2011-11-10 Thread Mark Miller
What version of Solr? Try /solr/collection1/admin/index.jsp even if it's single core. What does your solr.xml say? - Mark Miller lucidimagination.com On Oct 26, 2011, at 9:41 AM, Fred Zimmerman wrote: > It is not a multi-core setup. The solr.xml has null value for . ? > HTT

Re: Solr Test Framework

2011-11-10 Thread Mark Miller
What's the version of the source you are using? Can you send a minimum full test class demonstrating the issue instead? Makes it easier to give it a try. - Mark Miller lucidimagination.com On Nov 7, 2011, at 5:35 AM, Ronak Patel wrote: > > > Hi, > > > I am trying t

Re: overwrite=false support with SolrJ client

2011-11-10 Thread Mark Miller
rked expert. Yuck. If you already know your docs are unique, this can be much more efficient in some cases because it uses add rather than update. It's simple to do this with curl - why shouldn't it be simple with the *java* lib! - Mark Miller lucidimagination.com

Re: Out of memory, not during import or updates of the index

2011-11-10 Thread Mark Miller
tLinkedQueue.offer(ConcurrentLinkedQueue.java:273) > ... > > I presume the null pointer is a result of being out of memory. > > Should Solr possibly need more than 2GB? What else can we tune that might > reduce memory usage? - Mark Miller lucidimagination.com

Re: Using solr during optimization

2011-11-11 Thread Mark Miller
index which I would like to get reflected in > the Reader. Whats the best setup to support this. > > Thanks, > Kalika - Mark Miller lucidimagination.com

Re: Using solr during optimization

2011-11-11 Thread Mark Miller
rge factor. It defaults to 10 - lower numbers will lower the number of segments in your index, and essentially amortize the cost of an optimize. Optimize is generally only useful when you will have a mostly static index. - Mark Miller lucidimagination.com On Nov 11, 2011, at 9:12 AM, Kalika Mish

Re: Core reload vs servlet container restart

2011-11-11 Thread Mark Miller
What version of Solr? When you look at the logs, does a new SolrCore look like it comes up right away? It sounds like perhaps the old SolrCore is still serving requests - how long do you wait before trying to restart jetty? Anything interesting in the logs around that time? - Mark Miller

Re: Distributed indexing

2011-11-11 Thread Mark Miller
n other other things. - Mark Miller lucidimagination.com On Nov 11, 2011, at 2:38 PM, Otis Gospodnetic wrote: > Hi, > > I think at least a part of the question here is whether what's in JIRA is the > latest stuff or if Mark or Yonik or ... have some patches that they could pu

Re: Distributed indexing

2011-11-11 Thread Mark Miller
ate handler that works in the simple case attached to that issue, but it's got lots of limitations that we will hopefully address with the in-progress Distributed Indexing work. > > -- > Regards, > Rafał Kuć > - Mark Miller lucidimagination.com

Re: TikaEntityProcesor Exception Handling

2011-11-12 Thread Mark Miller
turday, November 12, 2011 10:08 AM > To: solr-user@lucene.apache.org > Subject: TikaEntityProcesor Exception Handling > > When indexing over 2MM documents with Solr and the TikaEntityProcessor, > the indexing fails if Tika encounters an exception with one of the > documents. How can I tell Solr to keep going and just ignore the failed > documents from the Tika Processor? > > > > Thanks. > > > > -- > > Sincerely, > > David Webb > - Mark Miller lucidimagination.com

Re: Using solr during optimization

2011-11-14 Thread Mark Miller
On Nov 14, 2011, at 8:27 AM, Isan Fulia wrote: > Hi Mark, > > In the above case , what if the index is optimized partly ie. by > specifying the max no of segments we want. > It has been observed that after optimizing(even partly optimization), the > indexing as well as

Re: To push the terms.limit parameter from the master core to all the shard cores.

2011-11-19 Thread Mark Miller
n, regardless of what the request asks for at each shard: n termsComp - Mark Miller lucidimagination.com

Re: Collection Distribution vs Replication in Solr

2011-11-23 Thread Mark Miller
say. Java replication is the path forward and what I would use. - Mark Miller lucidimagination.com

Re: To push the terms.limit parameter from the master core to all the shard cores.

2011-11-23 Thread Mark Miller
t be dynamically overridden, so it's what you want to use on your underlying shards. - Mark Miller lucidimagination.com

Re: Huge Performance: Solr distributed search

2011-11-24 Thread Mark Miller
; On Wed, Nov 23, 2011 at 4:43 PM, Dmitry Kan > wrote: > >>> If the response time from each shard shows decent figures, then > aggregator > >>> seems to be a bottleneck. Do you btw have a lot of concurrent users? > >>> > >>> On Wed, Nov 23, 2011

Re: Seek past EOF

2011-11-29 Thread Mark Miller
(SolrIndexReader.java:309) > at org.apache.lucene.search.TermQuery$TermWeight$1.add(TermQuery.java:56) > at org.apache.lucene.util.ReaderUtil$Gather.run(ReaderUtil.java:77) > at org.apache.lucene.util.ReaderUtil$Gather.run(ReaderUtil.java:82) > > > After a restart the

Re: Seek past EOF

2011-11-29 Thread Mark Miller
Also, on your master, what is the name of the index directory? Just 'index'? And are you replicating config files as well or no? On Nov 29, 2011, at 9:23 AM, Mark Miller wrote: > Does the problem index have any 0 size files in it? > > On Nov 29, 2011, at 2:54 AM, Ruben Chad

Re: Seek past EOF

2011-11-29 Thread Mark Miller
index" and it replicates the schema and a > synonyms file. > > /Ruben Chadien > > On 29 November 2011 15:29, Mark Miller wrote: > > > Also, on your master, what is the name of the index directory? Just > > 'index'? > > > > And are you re

Re: Configuring the Distributed

2011-12-01 Thread Mark Miller
distributed indexing work? > Hi Jaime - take a look at solrconfig-distrib-update.xml in solr/core/src/test-files You need to enable the update log, add an empty replication handler def, and an update chain with solr.DistributedUpdateProcessFactory in it. -- - Mark http://www.lucidimagination.com

Re: Configuring the Distributed

2011-12-01 Thread Mark Miller
ood number of partitions to start given your expected data ;) Adding more replicas is trivial though. - Mark On Thu, Dec 1, 2011 at 6:35 PM, Jamie Johnson wrote: > Another question, is there any support for repartitioning of the index > if a new shard is added? What is the recommen

Re: Configuring the Distributed

2011-12-01 Thread Mark Miller
xes (Solr also has an admin command that can do this). But I'm not sure where this fits in? - Mark > > On Thu, Dec 1, 2011 at 7:18 PM, Mark Miller wrote: >> Not yet - we don't plan on working on this until a lot of other stuff is >> working solid at this point. But so

Re: Configuring the Distributed

2011-12-01 Thread Mark Miller
node, for range greater than X/2 goto the new node. - Mark On Dec 1, 2011, at 7:44 PM, Jamie Johnson wrote: > hmmm.This doesn't sound like the hashing algorithm that's on the > branch, right? The algorithm you're mentioning sounds like there is > some logic which is able to

Re: Configuring the Distributed

2011-12-01 Thread Mark Miller
dding another partition should be a rare event if you plan carefully, and I think many will be able to handle the cost of splitting (you might even mark the replica you are splitting on so that it's not part of queries while its 'busy' splitting). - Mark On Dec 1, 2011, at 9:17 P

Re: Configuring the Distributed

2011-12-01 Thread Mark Miller
replica, and it would probably be fairly hard to juggle. On Dec 1, 2011, at 9:37 PM, Mark Miller wrote: > In this case we are still talking about moving a whole index at a time rather > than lots of little documents. You split the index into two, and then ship > one of them off. > >

Re: Configuring the Distributed

2011-12-01 Thread Mark Miller
Not sure if the design has > been thought out that far though. > > On Thu, Dec 1, 2011 at 8:15 PM, Mark Miller wrote: > > Right now lets say you have one shard - everything there hashes to range > X. > > > > Now you want to split that shard with an Index Splitter. >

Re: Configuring the Distributed

2011-12-01 Thread Mark Miller
commits tomorrow - been meaning to do it for my own convenience anyhow. Also, you want to pass the sys property numShards=1 on startup. I think it defaults to 3. That will give you one leader and one replica. - Mark On Dec 1, 2011, at 9:56 PM, Jamie Johnson wrote: > So I couldn't r

Re: Configuring the Distributed

2011-12-01 Thread Mark Miller
k if things are setup right though. - Mark On Dec 1, 2011, at 10:40 PM, Jamie Johnson wrote: > Thanks for the quick response. With that change (have not done > numShards yet) shard1 got updated. But now when executing the > following queries I get information back from both, wh

Re: Configuring the Distributed

2011-12-01 Thread Mark Miller
go to that same server or the other one depending on how it hashes. You really want to just always use the distrib update chain. I guess I don't yet understand what you are trying to test. Sent from my iPad On Dec 1, 2011, at 10:57 PM, Mark Miller wrote: > Not sure offhand - but thing

Best practise to automatically change a field value for a specific period of time

2011-12-02 Thread Mark Schoy
Hi, I have an solr index for an online shop with a field "price" which contains the standard price of a product. But in the database, the shop owner can specify a period of time with an alternative price. For example: standard price is $20.00, but 12/24/11 08:00am to 12/26/11 11:59pm = $12.59 Of

Re: Best practise to automatically change a field value for a specific period of time

2011-12-02 Thread Mark Schoy
cant be combined with field collapsing. 2011/12/2 Morten Lied Johansen : > On 02. des. 2011 12:21, Mark Schoy wrote: > > This is a problem that can be solved with grouping. > http://wiki.apache.org/solr/FieldCollapsing > > For each possible price on a product, you index a document w

Re: Configuring the Distributed

2011-12-02 Thread Mark Miller
ard leader that the doc would be forwarded to anyway. - Mark On Fri, Dec 2, 2011 at 12:09 AM, Jamie Johnson wrote: > Really just trying to do a simple add and update test, the chain > missing is just proof of my not understanding exactly how this is > supposed to work. I m

Re: Configuring the Distributed

2011-12-02 Thread Mark Miller
I can do a similar set of actions in my tests and it works fine. The only reason I could see things working like this is if it thinks you have one shard - a leader and a replica. - Mark On Dec 2, 2011, at 12:41 PM, Jamie Johnson wrote: > Glad to hear I don't need to set shards/self, but re

Re: Configuring the Distributed

2011-12-02 Thread Mark Miller
Ah, okay - you are setting the shards in solr.xml - thats still an option to force a node to a particular shard - but if you take that out, shards will be auto assigned. By the way, because of the version code, distrib deletes don't work at the moment - will get to that next week. - Mark O

Re: Configuring the Distributed

2011-12-03 Thread Mark Miller
a higher numShards? (understanding > > that index rebalancing does not happen automatically now, but > > presumably it could). > > > > On Fri, Dec 2, 2011 at 9:56 PM, Jamie Johnson wrote: > >> How does it determine the number of shards to create? How many > >

Re: Configuring the Distributed

2011-12-03 Thread Mark Miller
need the replication handler defined for recovery - when a replica goes down and then come back up, it starts buffering updates and replicates from the leader - then it applies the buffered updates and ends up current with the leader. - Mark > > On Sat, Dec 3, 2011 at 9:00 AM, Mark

Stemming - How to add tokens and dont replace the existing tokens?

2011-12-05 Thread Mark Schoy
Hi, I like to use the HunspellStemFilterFactory to improve my search results. Why isn't there an arg "inject" like in solr.PhoneticFilterFactory to add tokens instead of replacing them? I don't want to replace them, because documents with the "unstemmed" word should be more relevant. Thanks.

Re: Stemming - How to add tokens and dont replace the existing tokens?

2011-12-05 Thread Mark Schoy
Hi Marian, thanks for your answer. Using a copyField is a good idea. Mark 2011/12/5 Marian Steinbach : > Hi Mark! > > You could help yourself with creating an additional field. One field would > hold the stemmed version and the other one would hold the unstemmed > version. >

Re: Facet on a field with rows=n

2011-12-05 Thread Mark Schoy
all fields and facets, adding a filter to restrict result to those IDs collected in step 2. Mark 2011/12/5 Kashif Khan : > Hi all, > > i am looking for a solution where i want the facets to obtain based on the > paging of solr documents. > For ex:- > > say i hv a query *:* and

Re: SolrCloud and replication

2011-12-05 Thread Mark Miller
mply be forwarded to all replicas so you can also use Near Realtime and to provide better consistency. -- - Mark http://www.lucidimagination.com

Re: Continuous update on progress of "New SolrCloud Design" work

2011-12-05 Thread Mark Miller
On Mon, Dec 5, 2011 at 6:23 AM, Per Steffensen wrote: and add features What's the list of features you are looking for? -- - Mark http://www.lucidimagination.com

Testing a custom implementation of CommonsHttpSolrServer

2011-12-06 Thread Mark Swinson
n the localhost? Mark http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the informa

Re: Solr Version Upgrade issue

2011-12-06 Thread Mark Miller
s.mx.server.MBeanServerImpl.invoke(MBeanServerImpl.java:659) >at > org.jboss.web.tomcat.service.TomcatDeployer.performDeployInternal(TomcatDeployer.java:301) >at > org.jboss.web.tomcat.service.TomcatDeployer.performDeploy(TomcatDeployer.java:104) >at > org.jboss.web.AbstractWebDeployer.start(AbstractWebDeployer.java:375) >at org.jboss.web.WebModule.startModule(WebModule.java:83) > -- - Mark http://www.lucidimagination.com

Re: Lucene 4.0 Index Format

2011-12-06 Thread Mark Miller
On Tue, Dec 6, 2011 at 12:51 PM, Jamie Johnson wrote: > Does anyone know if this has been finalized yet? > It's subject to change up till release. -- - Mark http://www.lucidimagination.com

Re: Invoking an updateRequestProcessorChain from updateHandler

2011-12-06 Thread Mark Miller
here . > > Thank you, > Jan > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Invoking-an-updateRequestProcessorChain-from-updateHandler-tp3566159p3566159.html > Sent from the Solr - User mailing list archive at Nabble.com. - Mark Miller lucidimagination.com

Reducing heap space consumption for large dictionaries?

2011-12-07 Thread Mark Schoy
consumption when parsing the dictionary? I need to create several indexes and 128m for each index is too much. mark

Re: Solr 4 near real time commit

2011-12-07 Thread Mark Miller
nfiguration mean to do the commitment within 1 second, > why solr still takes 10 seconds. > > Spark > -- - Mark http://www.lucidimagination.com

Re: Solr Lucene Index Version

2011-12-07 Thread Mark Miller
e is bringing up is > going from an *unreleased* snapshot of Lucene to a later *unreleased* > snapshot of Lucene - and those types of guarantees aren't made across > snapshots like this. > > - Mark Miller lucidimagination.com

Re: Solr Lucene Index Version

2011-12-07 Thread Mark Miller
iously I'm not sure that > would work either. > > Something like this would be great too > https://issues.apache.org/jira/browse/LUCENE-3491 > > On Wed, Dec 7, 2011 at 7:48 PM, Mark Miller wrote: >> Unfortunately, I think the the only silver bullet here, for pure S

Re: Solr 4 near real time commit

2011-12-07 Thread Mark Miller
Hmmm...that sounds pretty odd... How are you measuring the commit time? You likely want to turn off any caches, as they will be expired every second, but that should not cause this... I can try and duplicate your setup tomorrow and see what i can spot. - Mark On Dec 7, 2011, at 8:13 PM, yu

Re: Solr 4 near real time commit

2011-12-08 Thread Mark Miller
Lets please keep the conversation on list. If you are using auto commit and soft auto commit, it makes no sense to be committing with solrj. The commits happen automatically. You couldn't measure them with solrj. - mark On Dec 8, 2011, at 12:44 AM, yu shen wrote: > Thanks for

Re: Solr Lucene Index Version

2011-12-08 Thread Mark Miller
On Dec 8, 2011, at 8:50 AM, Jamie Johnson wrote: > Isn't the codec stuff merged with trunk now? Robert merged this recently AFAIK. - Mark Miller lucidimagination.com

Re: Solr 4 near real time commit

2011-12-08 Thread Mark Miller
to commit from solrj and make it soft, you have to add the param softCommit = true on the update request On Thu, Dec 8, 2011 at 7:23 PM, yu shen wrote: > That sounds like a good idea. I will check my schema configuration, and see > what I can do. > > Thanks Mark and Siva for all the

Re: Solr 4 near real time commit

2011-12-08 Thread Mark Miller
ecated, but internal SolrServer code still uses it, so meh. On Dec 8, 2011, at 7:49 PM, yu shen wrote: > Hi Mark, > > My question inline. > > 2011/12/9 Mark Miller > I don't know about DataImportHandler. > > But the whole point of AutoCommit and AutoSoftCommit is

Re: performance of json vs xml?

2011-12-12 Thread Mark Miller
rst class - it just came after the XML support. -- - Mark http://www.lucidimagination.com

Using LocalParams in StatsComponent to create a price slider?

2011-12-14 Thread Mark Schoy
possible to use LocalParams (like for facets) to ignore a specific filter? Thanks. Mark

Re: SolrCloud Cores

2011-12-16 Thread Mark Miller
ld each solr instance be a separate core (i.e. > master1, master1-replica are 2 separate cores)? > At this point, its probably best/easiest to name them after the collection. -- - Mark http://www.lucidimagination.com

automatic delta imports?

2011-12-19 Thread Mark Juszczec
7;ve got multiple 3.2 instances running and this is the only one who's logs show this message. Have I turned something on accidentally? If so, what config files contain these settings? I want to turn this on for the other solr instances. Mark

Re: Release build or code for SolrCloud

2011-12-20 Thread Mark Miller
. Still some work to finish up first. - Mark On Dec 20, 2011, at 7:17 PM, Dipti Srivastava wrote: > Thanks for all responses. I got the code from the trunk. Now I will work > through rest of the steps. > Dipti > > On 12/20/11 1:58 PM, "Chris Hostetter" wrote: > >

Re: Release build or code for SolrCloud

2011-12-21 Thread Mark Miller
Y=key -jar > start.jar > ec2-user 23998 22962 0 19:25 pts/000:00:00 grep zk > --CLOUD--[ec2-user@ cloud-dev]$ > > > > Thanks! > Dipti > > On 12/21/11 10:18 AM, "Dipti Srivastava" > wrote: > > >Hi Mark, > >I built the example and dist a

<    3   4   5   6   7   8   9   10   11   12   >