SOLR - Documents with large number of fields ~ 450

2013-03-20 Thread kobe.free.wo...@gmail.com
Hello All, Scenario: My data model consist of approx. 450 fields with different types of data. We want to include each field for indexing as a result it will create a single SOLR document with *450 fields*. The total of number of records in the data set is *755K*. We will be using the features li

答复: Advice: solrCloud + DIH

2013-03-20 Thread Rollin.R.Ma (lab.sh04.Newegg) 41099
2000docs/s is my result. Near to embededsolr. Can be tuned . Yes u can know that, u must understand shard partition. -- View this message in context: http://lucene.472066.n3.nabble.com/Advice-solrCloud-DIH-tp4047339p4047673.html Sent from the Solr - User mailing list archive at Nabble.com.

答复: SolrCloud with Zookeeper ensemble : fail to restart master server

2013-03-20 Thread Rollin.R.Ma (lab.sh04.Newegg) 41099
Mark very good. -邮件原件- 发件人: Erick Erickson [mailto:erickerick...@gmail.com] 发送时间: 2013年3月19日 21:57 收件人: solr-user@lucene.apache.org 主题: Re: SolrCloud with Zookeeper ensemble : fail to restart master server First, the bootstrap_conf and numShards should only be specified the _first_ tim

答复: SolrCloud with Zookeeper ensemble in production environment: SEVERE problems.

2013-03-20 Thread Rollin.R.Ma (lab.sh04.Newegg) 41099
Thx! -邮件原件- 发件人: Michael Della Bitta [mailto:michael.della.bi...@appinions.com] 发送时间: 2013年3月20日 20:42 收件人: solr-user@lucene.apache.org 主题: Re: SolrCloud with Zookeeper ensemble in production environment: SEVERE problems. > 2. As far as I know the better SolrJ interface to index with So

Index generation using SOLR 4.2 configurations

2013-03-20 Thread aniljayanti
Hi, Presently i am generating employee details index using solr 3.3 & apache-tomcat-7.0.19 setup. Now i want to try this with solr 4.2 new version. created new setup for this. d:\apache-tomcat-7.0.19\solr_4_2\collection1\ in this below directories are there. 1) bin 2) conf >> solrconfig.xml,

Re: opinion: Stats over the faceting component

2013-03-20 Thread Tirthankar Chatterjee
We have done something similar. Please read http://lucene.472066.n3.nabble.com/How-to-modify-Solr-StatsComponent-to-support-stats-query-td4028991.html https://plus.google.com/101157854606139706613/posts/HmYYit3RABM If this is something you wanted. On Mar 20, 2013, at 7:08 PM, Anirudha Jadhav wro

Re: Error while starting Solr on Websphere

2013-03-20 Thread rulinma
E org.apache.solr.common.SolrException log null:java.lang.NoSuchMethodError should be jar not found. no relation to websphere. -- View this message in context: http://lucene.472066.n3.nabble.com/Error-while-starting-Solr-on-Websphere-tp4049583p4049612.html Sent from the Solr - User mailing li

Error while starting Solr on Websphere

2013-03-20 Thread divz80
Hi, i'm attempting to setup Solr 4.2.0 on IBM Websphere 8.5. I've deployed the solr.war and when I try to access the admin page, I get this error. *Error 503: Server is shutting down* The log files has this error: /[3/20/13 18:56:33:564 EDT] 0061 HttpClientUti I org.apache.solr.client.solrj

opinion: Stats over the faceting component

2013-03-20 Thread Anirudha Jadhav
I want to get an opinion here , instead of having statistics as an independent component which is always limited by faceting features ( eg. does not support date ranges or custom ranges , pivots etc). Why not have a parameter to facet component to compute and return stats. eg. facet.stats=true,f

Re: How to Run Solr Within Intellij IDEA

2013-03-20 Thread Steve Rowe
Hi kamaci, see my answer to your question in the d...@lucene.apache.org mailing list. - Steve On Mar 20, 2013, at 6:40 PM, kamaci wrote: > I want to extend Solr and forked it from github. I want to develop on it at > Intellij IDEA. I opened the project however there are many modules and I > don

SynonymFilterFactory and Punctuation

2013-03-20 Thread M W
I have been reading threads all day regarding this topic and nothing seems to work the way it says it should. :) I appreciate any and all help in this matter. Solr 4 is working perfectly for in all regards with this one exception. My requirement from Solr4 is very simple. I am storing a documen

Re: Did something change with Payloads?

2013-03-20 Thread jimtronic
Actually, this is more like the code I've got in place: http://sujitpal.blogspot.com/2011/01/payloads-with-solr.html Jim -- View this message in context: http://lucene.472066.n3.nabble.com/Did-something-change-with-Payloads-tp4049561p4049566.html Sent from the Solr - User mailing list archive

Did something change with Payloads?

2013-03-20 Thread jimtronic
I've been using Payloads through several versions of Solr including 4.0, but now they are no longer working correctly on 4.2 I had originally followed Grant's article here: http://searchhub.org/2009/08/05/getting-started-with-payloads/ I have a custom query plugin {!payload} that will return the

How to Run Solr Within Intellij IDEA

2013-03-20 Thread kamaci
I want to extend Solr and forked it from github. I want to develop on it at Intellij IDEA. I opened the project however there are many modules and I don't know which one is the main module and how can I run it? Also is there any problem to use it as Maven project or it is not supported and I should

RE: Query.toString printing binary in the output...

2013-03-20 Thread Andrew Lundgren
I have not. Just guessing, but that looks like code that walks a query and uses the schema to figure out what the types should be. That looks like the call I should be using. Any idea of how much of performance impact this has compared to just the Query.toString call (that admittedly doesn't

Re: How to share config files in SolrCloud between multiple cores(collections)

2013-03-20 Thread Shawn Heisey
On 3/20/2013 1:28 PM, Li, Qiang wrote: I just want to share the solrconfig.xml and schema.xml. As there should be differences between collections for other files, such as the DIH's configurations. I believe that SolrCloud treats each config set as a completely separate entity, with no abilit

RE: How to share config files in SolrCloud between multiple cores(collections)

2013-03-20 Thread Li, Qiang
I just want to share the solrconfig.xml and schema.xml. As there should be differences between collections for other files, such as the DIH's configurations. -Original Message- From: Mark Miller [mailto:markrmil...@gmail.com] Sent: Tuesday, March 19, 2013 11:19 AM To: solr-user@lucene.ap

Re: Facets with 5000 facet fields

2013-03-20 Thread Mark Miller
On Mar 20, 2013, at 11:29 AM, Chris Hostetter wrote: > Not true ... per segment FIeldCache support is available in solr > faceting, you just have to specify facet.method=fcs (FieldCache per > Segment) Also, if you use docvalues in 4.2, Robert tells me it is uses a new per seg faceting method

how to search for special charcters

2013-03-20 Thread Niklas Langvig
Hi How can in solr serach for all documents that containing parts of the word e-mail in it? This seems to work localhost:8080/document/select?q="e\-mail" But I don't want to have an exact match but instead do localhost:8080/document/select?q=*e\-mail* But that doesn't work How should I search fo

Re: Facets with 5000 facet fields

2013-03-20 Thread Chris Hostetter
: > I seem to recall that facet cache is not per segment so every time the : > index is updated the facet cache will need to be re-computed. : : That is correct. We haven't experimented with segment based faceting Not true ... per segment FIeldCache support is available in solr faceting, you ju

Re: change default solr url /solr to /prodsolr

2013-03-20 Thread Chris Hostetter
: currently when we deploy, default url will be http://host:port/solr : : how can i change it to http://host:port/prodsolr? : : i am using jboss server. what you're asking about is the name, or somethings refered to as the "context path", of the "servlet context" for the solr applicaiton. how

Re: Extracting excerpt from solr

2013-03-20 Thread Luis
Nevermind, I found a solution. I created an excerpt field in the schema.xml, then I used the copyField method with the maxChars parameter declared to copy the content into it with a limitation of the amount of characters that I wanted. Thanks anyways. -- View this message in context: http://

Re: transientCacheSize not working

2013-03-20 Thread didier deshommes
Thanks. Is there a way to pass loadOnStartup and/or transient as parameters to the core admin http api? This doesn't seem to work: curl http://localhost:8983/solr/admin/cores?action=CREATE&transient=true&name=c1 On Tue, Mar 19, 2013 at 7:29 PM, Mark Miller wrote: > I don't think SolrCloud works

Re: Retrieving Term vectors

2013-03-20 Thread Sarita Nair
Thanks much, Koji. That was indeed the problem.  I did not realize that one has to do all three of these fieldType.setStoreTermVectors(true); fieldType.setStoreTermVectorPositions(true); fieldType.setStoreTermVectorOffsets(true); in order to store positions and offsets. -Sarita

Re: Bitwise operation

2013-03-20 Thread Walter Underwood
I agree. Your first step should not be trying to make Solr work they way your think it should. Try really hard to use the existing features, they are there because they solve a LOT of problems. Updates are pretty fast, really. wunder On Mar 20, 2013, at 2:36 AM, Jan Høydahl wrote: > Don't try

Re: Solr 4.1 monitoring with /solr/replication?command=details - indexVersion?

2013-03-20 Thread Rafał Radecki
Yes, this is solr 4.1. 2013/3/20 Mark Miller : > Hmm, I'd have to look, but first to make, this subject says 4.1? > > In 4.1 the slave will be ahead because it commits after installing the index. > In 4.2 it shouldn't. > > Your on? > > - Mark > > On Mar 20, 2013, at 4:03 AM, Rafał Radecki wrote:

change default solr url /solr to /prodsolr

2013-03-20 Thread vrparekh
Hello, currently when we deploy, default url will be http://host:port/solr how can i change it to http://host:port/prodsolr? i am using jboss server. -- View this message in context: http://lucene.472066.n3.nabble.com/change-default-solr-url-solr-to-prodsolr-tp4049339.html Sent from the Solr

Re: Solr 4.1 monitoring with /solr/replication?command=details - indexVersion?

2013-03-20 Thread Mark Miller
Hmm, I'd have to look, but first to make, this subject says 4.1? In 4.1 the slave will be ahead because it commits after installing the index. In 4.2 it shouldn't. Your on? - Mark On Mar 20, 2013, at 4:03 AM, Rafał Radecki wrote: > Thanks for the info. > I understand that the latest "replica

RE: Getting back highlights almost always works...

2013-03-20 Thread Van Tassell, Kristian
That was it, thank you! -Original Message- From: Koji Sekiguchi [mailto:k...@r.email.ne.jp] Sent: Tuesday, March 19, 2013 5:17 PM To: solr-user@lucene.apache.org Subject: Re: Getting back highlights almost always works... (13/03/20 6:14), Van Tassell, Kristian wrote: > ...but I'm finding

Re: SolrCloud with Zookeeper ensemble in production environment: SEVERE problems.

2013-03-20 Thread Michael Della Bitta
> 2. As far as I know the better SolrJ interface to index with SolrCloud is CloudSolrServer, not ConcurrentUpdateSolrServer. If you have many instances of CloudSolrServer and you correctly balance them with a Round Robin or something similar you´ll get a better performance in SolrCloud scenarios. A

Re: had query regarding the indexing and analysers

2013-03-20 Thread Rohan Thakur
hi jack I have been using text_en_splitting initially but what it was doing is it is changing by query aswell for example: if i am searching for "ace" term it is taking it as "ac" thus giving split ac higher score... see debug statment: "debug":{ "rawquerystring":"ace", "querystring":"ace

Re: Keeping query index until a complete reindex is done

2013-03-20 Thread Jan Høydahl
Hi, Yes. Two alternatives 1) If you just want to reindex, no big schema changes etc Just reindex everything on top of the old index. Search will still be live all the time 2) Create a new core/collection Reindex everything to that collection Use the SWAP core admin command when finished http://w

Keeping query index until a complete reindex is done

2013-03-20 Thread Silvio Hermann
Hi there is solr capable of keeping a query index until a complete reindex is done (within a single core)? I think something of: 1. start reindex/rebuild through a REST push (not the data import handler) 2. solr starts indexing on a fresh index queries are handled by the old index 3. w

Spatial Search with document score as distance between two points

2013-03-20 Thread Harshvardhan Ojha
Hi All, I am using spatial search from mobile to find offers nearby that lat long. Query: q={!func}geodist()&sfield=latlng&pt=28.635308,77.22496&sort=score+asc using this query I am getting distance in kms in score of solr documents. Problem : For those documents which doesn't have latlng field,

Re: Bitwise operation

2013-03-20 Thread Jan Høydahl
If you implement filtering on both user and group levels. So you record on the document ACL fields which group(s) it belongs to, and when people search you find what groups they are entitled to see and add that as a filter. So if the rights for a group changes, then you don't need to reindex the

Re: dataimporthandler: nested query is called multiple times

2013-03-20 Thread Alexandre Rafalovitch
There was something like this on Stack Overflow: http://stackoverflow.com/questions/15164166/solr-filelistentityprocessor-is-executing-sub-entities-multiple-times Upgrading Solr helped partially, but the conclusion was not fully satisfactory. Regards, Alex. Personal blog: http://blog.outerth

dataimporthandler: nested query is called multiple times

2013-03-20 Thread patrick
hi, the dataimport-config-file i'm using with solr3.6.2 uses a nested select statement. the first query retrieves the documents while the nested one retrieves the corresponding properties. when running the dataimporthandler with the verbo

Re: Bitwise operation

2013-03-20 Thread Christopher ARZUR
Actually my goal is to integrate Zend Framework ACL in Solr. My problem mainly concerns the inheritance, if rights of a group of documents are changed, I can not go through all the documents for the group and update these.

Saravanan Chinnadurai/Actionimages is out of the office.

2013-03-20 Thread Saravanan . Chinnadurai
I will be out of the office starting 13/03/2013 and will not return until 08/04/2013. Please email to itsta...@actionimages.com for any urgent issues. Action Images is a division of Reuters Limited and your data will therefore be protected in accordance with the Reuters Group Privacy / Data P

Re: Bitwise operation

2013-03-20 Thread Jan Høydahl
Don't try to optimize something which is not a problem. This is what "everyone" does - update documents when ACLs for those documents change, even with multi-million documents. It works like a charm. Or do you have a special usecase where permissions for an average document changes several time

Re: Facets with 5000 facet fields

2013-03-20 Thread Toke Eskildsen
On Wed, 2013-03-20 at 10:12 +0100, Andy wrote: > Are you doing NRT updates? No. Startup/re-open time is around 1 minute for the Solr instance, but due to we are currently doing nightly updates only. > I seem to recall that facet cache is not per segment so every time the > index is updated the f

Re: SolrCloud with Zookeeper ensemble in production environment: SEVERE problems.

2013-03-20 Thread Luis Cappa Banda
Thank you for answering. Some notes: 1. The Java engine I´ve developed that wrappers SolrJ 4.1 with some business logic only executes search queries, not index/update operations, so the problem is not related with concurrent updates, or something similar. 2. As far as I know the better SolrJ int

Re: Facets with 5000 facet fields

2013-03-20 Thread Andy
That's impressive performance. Are you doing NRT updates? I seem to recall that facet cache is not per segment so every time the index is updated the facet cache will need to be re-computed. And that's going to kill performance. Have you run into that problem?

Re: Bitwise operation

2013-03-20 Thread Christopher ARZUR
Hello and thank you for your answers. I'll try to explain my problem a little better: The goal is to manage ACLs via Solr without reindex the documents at each change of permission. I have hundreds of thousands of documents, users and groups and permissions (allow / denied) or each of these gr

Re: Facets with 5000 facet fields

2013-03-20 Thread Toke Eskildsen
On Wed, 2013-03-20 at 07:19 +0100, Andy wrote: > What about the case where there's only a small number of fields (a > dozen or two) but each field has hundreds of thousands or millions of > values? Would Solr be able to handle that? We do that on a daily basis at State and University Library, Denm

Re: Solr 4.1 monitoring with /solr/replication?command=details - indexVersion?

2013-03-20 Thread Rafał Radecki
Thanks for the info. I understand that the latest "replicateable" version of index may differ from the actual version of index on master/slave. But why when I use /solr/replication?command=indexversion On master: 0 0 1363263304585 4 And on slave: 0 1 1363263600323 5 Why do I get higher