Custom search component executed several times when using Zookeeper

2013-02-11 Thread jens.fosh...@evita.no
We have implemented a custom search component for SOLR which handles security. It simply adds a filter query in the prepare method. This search component is added to our search handler as the last component. The custom function retrieves from a database a list of ACLs attached to the user. When w

Searching with min and max range in solr

2013-02-11 Thread ballusethuraman
Hi, I have craeted new attribute(Year) in attribute dictionary and associated with different catentries with different values say 2000,2001,2002,2003,...2012. Now I want to search with the Year attribute with min and max range. when 2000 to 2005 is given as search condition it should fetch the cate

Re: Reverse range query

2013-02-11 Thread ballusethuraman
Hi, I have craeted new attribute(Year) in attribute dictionary and associated with different catentries with different values say 2000,2001,2002,2003,...2012. Now I want to search with the Year attribute with min and max range. when 2000 to 2005 is given as search condition it should fetch the cate

Re: SolrCloud and hardcoded 'id' field

2013-02-11 Thread Mark Miller
Doesn't sound right to me. I'd guess you heard wrong. - mark Sent from my iPhone On Feb 11, 2013, at 7:15 PM, Shawn Heisey wrote: > I have heard that SolrCloud may require the presence of a uniqeKey field > specifically named 'id' for sharding. > > Is this true? Is it still true as of Solr

Re: memory leak - multiple cores

2013-02-11 Thread Marcos Mendez
Hi Michael, Yes, we do intend to reload Solr when deploying new cores. So we deploy it, update solr.xml and then restart Solr only. So this will happen sometimes in production, but mostly testing. Which means it will be a real pain. Any way to fix this? Also, I'm running geronimo with -Xmx1024

Re: addSortField throws field not found

2013-02-11 Thread Chris Hostetter
: Subject: addSortField throws field not found : : same field name is accepted for addFacetField but throws a field not found ex : for the addSortField method. As a general rule, if you are going to ask a question about an error that you got -- you need to cut/paste the exception (verbatim) into

Re: Term Frequencies for Query Result

2013-02-11 Thread Chris Hostetter
: I am looking for a way to get the top terms for a query result. you have to elaborate on exactly what you mean ... how are you defining "top terms for a query result" ? Are you talking about hte most common terms in the entire result set of documents that match your query? or the terms fro

Re: Fwd: advice about develop AbstractSolrEventListener.

2013-02-11 Thread Chris Hostetter
: I found a solution. I am going to Configured Update Request Processors, : that I have seen in: http://wiki.apache.org/solr/UpdateRequestProcessor Sorry for the late reply, but yes -- an UpdateProcessor seems like the best place to hook in custom functionality if you need to know about ind

SolrCloud and hardcoded 'id' field

2013-02-11 Thread Shawn Heisey
I have heard that SolrCloud may require the presence of a uniqeKey field specifically named 'id' for sharding. Is this true? Is it still true as of Solr 4.2-SNAPSHOT? If not, what svn commit fixed it? If so, should I file a jira? I am not actually using SolrCloud for one index, but my worr

Re: Solr Cloud: Duplicate records while retrieving documents

2013-02-11 Thread Shawn Heisey
On 2/11/2013 12:09 PM, devb wrote: We are running a six node SOLR cloud which 3 shards and 3 replications. The version of solr cloud is 4.0.0.2012.08.06.22.50.47. We use Python PySolr client to interact with Solr. Documents that we add to solr have a unique id and it can never have duplicates. Ou

Re: Can Solr analyze content and find dates and places

2013-02-11 Thread SUJIT PAL
Cool! Thanks for the update, this will help if I ever go all the way with UIMA and Solr. -sujit On Feb 11, 2013, at 12:13 PM, jazz wrote: > Hi Sujit, > > Thanks for your help! I moved the RoomNumberAnnotator.xml to the top level of > the jar and used the same solrconfig.xml (with the /). Now

Solr Cloud: Duplicate records while retrieving documents

2013-02-11 Thread devb
We are running a six node SOLR cloud which 3 shards and 3 replications. The version of solr cloud is 4.0.0.2012.08.06.22.50.47. We use Python PySolr client to interact with Solr. Documents that we add to solr have a unique id and it can never have duplicates. Our use case is to query the index for

Solr 4.0 is stripping XML format from RSS content field

2013-02-11 Thread eShard
Hi, I'm running solr 4.0 final with manifoldcf 1.1 and I verified via fiddler that Manifold is indeed sending the content field from a RSS feed that contains xml data However, when I query the index the content field is there with just the data; the XML structure is gone. Does anyone know how to st

Re: SolrCloud new zookeper node on different ip/ replicate between two clasters

2013-02-11 Thread mizayah
Thx Mark "The replication handler can be setup to replicate to another dc. " Erm, i dont get it. I can setup replication between two solr cloud this way or just solrcloud->solr? "You can also put nodes in both dcs" Indexing will slow rly much if I understad well solrcluoud replica and leader (re

Re: Can Solr analyze content and find dates and places

2013-02-11 Thread jazz
Hi Sujit, Thanks for your help! I moved the RoomNumberAnnotator.xml to the top level of the jar and used the same solrconfig.xml (with the /). Now it works perfect. Best regards, Bart On 11 Feb 2013, at 20:13, SUJIT PAL wrote: > Hi Bart, > > Like I said, I didn't actually hook my UIMA stuff

Re: Problems using distributed More Like This

2013-02-11 Thread Mark Miller
Eventually, I'll get around to trying some more real world testing. Up till now, no dev seems to have a real interest in this. I have 0 need for it currently, so it's fairly low on my itch scale, but it's on my list anyhow. - Mark On Feb 11, 2013, at 12:26 PM, Shawn Heisey wrote: > SOLR-788 a

Re: SolrCloud new zookeper node on different ip/ replicate between two clasters

2013-02-11 Thread Mark Miller
The replication handler can be setup to replicate to another dc. You can also put nodes in both dcs. Both have plus and minuses vs just sending the same data to both dc's with separate clusters. Where it immediately gets difficult is that you need a quorum of zk nodes to survive if you want to c

Re: SolrCloud new zookeper node on different ip/ replicate between two clasters

2013-02-11 Thread mizayah
This is good sollution. One thing here is rly unyoing. The double indexing. Is there a way to replicate to another dc? Seams solrcloud cant use his ealier replication. Would be nice if i can replicate somehow between two soulrcloud. -- View this message in context: http://lucene.472066.n3.na

How to limit queries to specific IDs

2013-02-11 Thread Isaac Hebsh
Hi everyone. I have queries that should be bounded to a set of IDs (the uniqueKey field of my schema). My client front-end sends two Solr request: In the first one, it wants to get the top X IDs. This result should return very fast. No time to "waste" on highlighting. this is a very standard query

Re: Can Solr analyze content and find dates and places

2013-02-11 Thread SUJIT PAL
Hi Bart, Like I said, I didn't actually hook my UIMA stuff into Solr, content and queries are annotated before they reach Solr. What you describe sounds like a classpath problem (but of course you already knew that :-)). Since I haven't actually done what you are trying to do, here are some sug

Re: SolrCloud upgrade from 4.0 to 4.1

2013-02-11 Thread o.mares
Hey does there exist a upgrade guide? Or do you simply copy all files over? If yes, how to verify if everything is in place. -- View this message in context: http://lucene.472066.n3.nabble.com/SolrCloud-upgrade-from-4-0-to-4-1-tp4039757p4039775.html Sent from the Solr - User mailing list archiv

Re: SolrCloud upgrade from 4.0 to 4.1

2013-02-11 Thread Mark Miller
Yonik looked into it and said the process was actually fine in his testing. After the release, we did find one issue - if you don't explicitly set the host, the host 'guess' feature has changed and may guess a different address. - Mark On Feb 11, 2013, at 1:16 PM, Shawn Heisey wrote: > I'm tr

SolrCloud upgrade from 4.0 to 4.1

2013-02-11 Thread Shawn Heisey
I'm trying to help someone in #solr on IRC. Early in the 4.1 release vote process over on the dev@l.a.o mailing list, Mark Miller mentioned that ugprading SolrCloud from 4.0 to 4.1 may not be as straightforward as the usual Solr upgrade process. Providing some detailed instructions was mentio

Re: Maximum Number of Records In Index

2013-02-11 Thread Otis Gospodnetic
We don't run one ourselves at Sematext, but know of people who do have large ES clusters, one with > 10B docs. Otis -- Solr & ElasticSearch Support http://sematext.com/ On Mon, Feb 11, 2013 at 8:41 AM, Mikhail Khludnev < mkhlud...@griddynamics.com> wrote: > Otis, > Do you run 4bn docs SolrCl

Re: Do I have to reindex when upgrading from solr 4.0 to 4.1?

2013-02-11 Thread Michael Della Bitta
Arkadi, That's the answer I received at Solr Bootcamp, yes. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Mon, Feb 11, 2013 at 2:23 AM, Arkadi Colson wro

Re: Can Solr analyze content and find dates and places

2013-02-11 Thread jazz
Hi Sujit and others who answered my question, I have been working on the UIMA path which seems great with the available Eclipse tooling and this: http://sujitpal.blogspot.nl/2011/03/smart-query-parsing-with-uima.html Now I worked through the UIMA tutorial of the RoomNumberAnnotator: http://uim

RE: Solr query parser, needs to call setAutoGeneratePhraseQueries(true)

2013-02-11 Thread Zhang, Lisheng
Thanks very much, it worked perfectly !! Best regards, Lisheng -Original Message- From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: Friday, February 08, 2013 1:04 PM To: solr-user@lucene.apache.org Subject: Re: Solr query parser, needs to call setAutoGeneratePhraseQueries(true)

Problems using distributed More Like This

2013-02-11 Thread Shawn Heisey
SOLR-788 added Distributed MLT to Solr 4.1, but I have not been able to get it to work. I don't know if it's user error, which of course is very possible. If it is user error, I'd like to know what I'm doing wrong so I can fix it. I am actually using a recent checkout of Solr 4.2, not the re

Re: Maximum Number of Records In Index

2013-02-11 Thread Mikhail Khludnev
Otis, Do you run 4bn docs SolrCloud or ElasticSearch or aware of somebody who do? 10.02.2013 4:54 пользователь "Otis Gospodnetic" написал: > Exceeding 2B is no problem. But it won't happen in a single Lucene index > any time soon, so... > > Otis > Solr & ElasticSearch Support > http://sematext.c

Re: Help! How to remove shards from SolrCloud. They keep come back...

2013-02-11 Thread Rene Nederhand
Hi Mark, Thanks for you response. I did delete the data directory, but that didn't help. However, upon checking my zookeeper installation I found a "clusterstate.json" item that contained references to core data directories that didn't exist anymore. I wiped this item and it seems to work fine no

Re: [Solrj 4.0] How use JOIN

2013-02-11 Thread Roman Slavik
Hi, thanks for advice. But I need to use parent_condition and child_condition in same time. Parent condition is: (name:Thomas AND age:40) Child condition: (name:John AND age:17) join from=parent to=id So something like: (name:Thomas AND age:40) AND {!join from=parent to=id}(name:John AND age:17)

Re: Crawl Anywhere -

2013-02-11 Thread O. Klein
Yes you can run CA on different machines. In "Manage" you have to set target and engine for this to work. I've never done this, so you have to contact the developer for more details. SivaKarthik wrote > Hi All, > in our project, we need to download around millions of pages... > so is there a

Re: Crawl Anywhere -

2013-02-11 Thread Jan Høydahl
Have a look at Nutch2, it is decoupled from HDFS and can store docs in e.g. HBase or other NoSql store. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 11. feb. 2013 kl. 06:16 skrev SivaKarthik : > Dear Erick, > Thanks for ur rel

Re: Faceting on tree structure in SOLR4

2013-02-11 Thread Mikhail Khludnev
Hello, is http://wiki.apache.org/solr/HierarchicalFaceting what are you talking about? On Mon, Feb 11, 2013 at 12:42 PM, Alok Bhandari < alokomprakashbhand...@gmail.com> wrote: > > Hello, > > I have a tree data structure like > > t1 >|-t2 > |-t3 > t4 >|-t5 > > and so on . > >

Faceting on tree structure in SOLR4

2013-02-11 Thread Alok Bhandari
Hello, I have a tree data structure like t1 |-t2 |-t3 t4 |-t5 and so on . And there is no limit on tree depth as well as number of children to each node. What I want is that when I do the faceting for parent node t1 it should also include count of all of its children (t2 and

replication problems with solr4.1

2013-02-11 Thread Bernd Fehling
Hi list, after upgrading from solr4.0 to solr4.1 and running it for two weeks now it turns out that replication has problems and unpredictable results. My installation is single index 41 mio. docs / 115 GB index size / 1 master / 3 slaves. - the master builds a new index from scratch once a week