Re: Multiple schemas in the same SolrCloud ?

2013-10-11 Thread maephisto
Thanks! My only doubt is: upload a new set of configuration files to the same configuration name like so: Initial configuration: zkcli.sh -zkhost localhost:9983 -cmd upconfig -confdir conf_initial/ -confname my_custom_config and afterwards, to change it do: zkcli.sh -zkhost localhost:9983 -cmd upc

Re: Multiple schemas in the same SolrCloud ?

2013-10-11 Thread Furkan KAMACI
Here is a topic you should read it: http://lucene.472066.n3.nabble.com/Reloading-config-to-zookeeper-td4021901.html 2013/10/11 maephisto > Thanks! > My only doubt is: upload a new set of configuration files to the same > configuration name like so: > > Initial configuration: > zkcli.sh -zkhost

Solr Cloud Basic Authentification

2013-10-11 Thread maephisto
I've deployed a SolrCloud cluster in Jetty 9 using solr 4.4.0 and I would like to add some basic authentification. My question is how can I provide the credentials so that they're used in the collection API when creating a new collection or by ZK? Are there any useful docs/wiki on this topic? Than

Re: Solr Cloud Basic Authentification

2013-10-11 Thread primoz . skale
For pre 4.x Solr (aka Solr 3.x) basic authentication works fine. Check this site: http://wiki.apache.org/solr/SolrSecurity Even "master-slave replication architecture" (*not* SolrCloud) works for me. There could be some problems with *cross-shard* queries etc. though (see SOLR-1861, SOLR-3421).

Re: Please help!, Highlighting exact phrases with solr

2013-10-11 Thread Silvia Suárez
Dear Koji, Thanks a lot for your answer and Sorry about my english I tried to configure FastVectorHighlighter However, I have this error: fragCharSize(1) is too small. It must be 18 or higher. java.lang.Illegal

Re: Multiple schemas in the same SolrCloud ?

2013-10-11 Thread xinwu
Hi,kamaci. Is that means I just need to upload new config files ,and do not need to reload every node in solrCloud ,when I want to change my configurations? -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-schemas-in-the-same-SolrCloud-tp4094279p4094908.html Sent fro

Re: Solr Cloud Basic Authentification

2013-10-11 Thread Furkan KAMACI
Here is more information about security that you can use: http://wiki.apache.org/solr/SolrSecurity 2013/10/11 maephisto > I've deployed a SolrCloud cluster in Jetty 9 using solr 4.4.0 and I would > like to add some basic authentification. > My question is how can I provide the credentials so th

Re: Solr Cloud Basic Authentification

2013-10-11 Thread maephisto
Thank you! I'm more interested in the SolrCloud architecture, with shards, shards replicas and distributed index and search. This are the features i use and would like to protect by some basic authentification. I imagine that there must be a way to have this, otherwise anybody could mess with or

Re: Please help!, Highlighting exact phrases with solr

2013-10-11 Thread Furkan KAMACI
Here is a similar question: http://search-lucene.com/m/vnMGKACGM1/%252218+or+higher.%2522&subj=FastVectorHighlighter+and+hl+fragsize+parameter+set+to+zero+causes+exception and a related fixed issue: https://issues.apache.org/jira/browse/SOLR-1268 2013/10/11 Silvia Suárez > Dear Koji, > > Than

Re: Solr Cloud Basic Authentification

2013-10-11 Thread primoz . skale
One possible solution is to "firewall" access to SolrCloud server(s). Only proxy/load-balacing servers should have unrestricted access to Solr infrastructure. Then you can implement basic/advanced authentication on the proxy/LB side. Primož From: maephisto To: solr-user@lucene.apache.

Re: Solr Cloud Basic Authentification

2013-10-11 Thread maephisto
Thank you, But I'm afraid that wiki page does not cover my topic of interest -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Cloud-Basic-Authentification-tp4094903p4094915.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr Cloud Basic Authentification

2013-10-11 Thread primoz . skale
If you want to deploy basic authentication in a way that a login is required when creating collections it is only a simple matter of constrainting a url pattern (eg. /solr/admin/collections/*). Maybe this link will help: http://stackoverflow.com/questions/5323855/jetty-webserver-security/533204

Cores with lot of folders with prefix index.XXXXXXX

2013-10-11 Thread yriveiro
Hi, I have some cores with lot of folder with format index.X, my question is why? The collateral effect of this are shards with 50% of size than replicas in other nodes. There is any way to delete this folders to free space? It's a bug? /Yago - Best regards -- View this message in c

Re: Cores with lot of folders with prefix index.XXXXXXX

2013-10-11 Thread primoz . skale
I think this is connected to replications being made? I also have quite some of them but currently I am not worried :) Primož From: yriveiro To: solr-user@lucene.apache.org Date: 11.10.2013 11:54 Subject:Cores with lot of folders with prefix index.XXX Hi, I have some c

solrnet sample

2013-10-11 Thread Kishan Parmar
i want to change the schema file of solrnet sample and want to add xml file and want to facet data so what i have to need to do in sample file??? Regards, Kishan Parmar Software Developer +91 95 100 77394 Jay Shree Krishnaa !!

Re: Cores with lot of folders with prefix index.XXXXXXX

2013-10-11 Thread Yago Riveiro
I have ssd's therefor my space is like gold, I can have 30% of my space waste in failed replications, or replications that are not cleaned. The question for me is if this a normal behaviour or is a bug. If is a normal behaviour I have a trouble because a ssd with more than 512G is expensive. -

Re: Please help!, Highlighting exact phrases with solr

2013-10-11 Thread Silvia Suárez
Hi, Thanks for your answer Furkan, I'm sorry, I don't understand the proposed solution... I did this: 1. eliminate hl.useHighlighter parameter 2. introduce hl.useFastVectorHighlighter However the result is the same... is something missing? Thanks a lot in advance for your help... Sil

Re: Cores with lot of folders with prefix index.XXXXXXX

2013-10-11 Thread primoz . skale
Do you have a lot of failed replications? Maybe those folders have something to do with this (please see the last answer at http://stackoverflow.com/questions/3145192/why-does-my-solr-slave-index-keep-growing ). If your disk space is valuable check index.properties file under data folder and try

Re: Cores with lot of folders with prefix index.XXXXXXX

2013-10-11 Thread Yago Riveiro
The thread that you point is about master / slave - replication, Is this issue valid on SolrCloud context? I check the index.properties and indeed the variable index=index.X point to a folder, the others can be deleted without any scary side effect? -- Yago Riveiro Sent with Sparrow (h

Re: Cores with lot of folders with prefix index.XXXXXXX

2013-10-11 Thread Shalin Shekhar Mangar
There are open issues related to extra index.XXX folders lying around if replication/recovery fails. See https://issues.apache.org/jira/browse/SOLR-4506 On Fri, Oct 11, 2013 at 4:06 PM, Yago Riveiro wrote: > The thread that you point is about master / slave - replication, Is this > issue valid o

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Peter Schmidt
Not so hard switching it to Oracle JDK 7u40. Just download it and change the JAVA_HOME path in /etc/default/jetty, so it's not nescessary to switch java version with update-java-alternatives The machine is 64bit :) 2013/10/11 Bill Bell > Does this work ? > I can suggest -XX:-UseLoopPredicate

Re: Cores with lot of folders with prefix index.XXXXXXX

2013-10-11 Thread primoz . skale
Honestly I don't know for sure if you can delete then. Maybe make a backup then delete them and see if it still works :) Replication works differently in SolrCloud world as I currently know. I don't think there are any additional index.* folders because fallback does not work in SolrCloud (some

Re: Cores with lot of folders with prefix index.XXXXXXX

2013-10-11 Thread primoz . skale
Thanks, I guess I was wrong after all in my last post. Primož From: Shalin Shekhar Mangar To: solr-user@lucene.apache.org Date: 11.10.2013 12:43 Subject:Re: Cores with lot of folders with prefix index.XXX There are open issues related to extra index.XXX folders lying ar

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Peter Schmidt
So the main problem was that the libs must be copied to the WEB-INF/lib directory insteed of the jetty lib/ext directory. Is the fact that you should you use WEB-INF/lib somewhere documented? 2013/10/11 Peter Schmidt > Not so hard switching it to Oracle JDK 7u40. > Just download it and change t

Re: Re: feedback on Solr 4.x LotsOfCores feature

2013-10-11 Thread Erick Erickson
bq: sharing the underlying solrconfig object the configset introduced in JIRA SOLR-4478 seems to be the solution for non-SolrCloud mode SOLR-4478 will NOT share the underlying config objects, it simply shares the underlying directory. Each core will, at least as presently envisioned, simply read t

Re: Find documents that are composed of % words

2013-10-11 Thread Erick Erickson
bq: but you cannot ask this to client. You _can_ ask this of a client. IMO you are obligated to. A gentle way to do that is say something like: "Solr doesn't do that out-of-the-box. I estimate it will take me XXX weeks to implement that in custom code. I will be unable to make progress on feature

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Peter Schmidt
I can report that jetty is running now with this options: JAVA_OPTIONS="-Djava.awt.headless=true -Dfile.encoding=UTF-8 -Xms256m -Xmx256m -XX:+UseG1GC -XX:MaxGCPauseMillis=50 -XX:+OptimizeStringConcat -XX:+UseStringCache -Dsolr.solr.home=/usr/share/solr $JAVA_OPTIONS" @Guido: I reduced the min/max

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Guido Medina
I can't tell for sure if that is documented somewhere, I did that straight forward cause of years I have been developing Java webapps, a class not found usually means that some jar/class is missing somewhere, because of all the issues I have seen with parent-child class loaders, my 1st choice i

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Guido Medina
Remember the "-server" which for Java webapps or dedicated Java services will improve things. Guido. On 11/10/13 12:26, Peter Schmidt wrote: I can report that jetty is running now with this options: JAVA_OPTIONS="-Djava.awt.headless=true -Dfile.encoding=UTF-8 -Xms256m -Xmx256m -XX:+UseG1GC -X

Re: Find documents that are composed of % words

2013-10-11 Thread shahzad73
Eric agreed Solr + Nutch solution was proposed by myself and had never used these technologies, this is first time i handle these 2. My initial response to client's requirments were to try to work out existing industry tools and then modify it according to client requirements instead of re-inve

Re: Find documents that are composed of % words

2013-10-11 Thread shahzad73
Aloke Ghoshal i'm trying to work out your equation. i am using standard scheme provided by nutch for solr and not aware of how to calculate myfieldwordcount in first query.no idea where this count will come from. is there any filter that will store number of tokens generated for a speci

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Peter Schmidt
@Guido: Itried it before and than i thought you marked just the server options Because the -sever causes a: sudo service jetty start * Starting Jetty servlet engine. jetty Invalid option -server Cannot parse command line arguments Or should i substitute server with ...? Options with -server:

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Guido Medina
It is JVM parameter, example: JAVA_OPTIONS="-Djava.awt.headless=true -Dfile.encoding=UTF-8 -server -Xms256m -Xmx256m" If you want to concatenate more JVM parameters you do it like this: JAVA_OPTIONS="-Dsolr.solr.home=/usr/share/solr $JAVA_OPTIONS" Take a good look at the format, Guido. On 11

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Peter Schmidt
Strange. When i add "-server" to the arguments, i got everytime the error on jetty startup Invalid option -server Cannot parse command line arguments 2013/10/11 Guido Medina > It is JVM parameter, example: > > JAVA_OPTIONS="-Djava.awt.**headless=true -Dfile.encoding=UTF-8 -server > -Xms256m -X

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Peter Schmidt
Oh, i got it http://stackoverflow.com/a/5273166/326905 "at least 2 cores and at least 2 GB physical memory" Until know i'm using a VM with single core and 1GB RAM. So this will be later for production :) Thank you Guido. 2013/10/11 Peter Schmidt > Strange. When i add "-server" to the argume

SolrCloud on SSL

2013-10-11 Thread Christopher Gross
I have 3 SolrCloud nodes (call them idx1, idx2, idx3), and the boxes have SSL & certs configured on them to protect the Solr Indexes. Right now, I can do queries on idx1 and it works fine. If I try to query on idx3, I get: org.apache.solr.common.SolrException: org.apache.sorl.client.solrj.SolrServ

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Guido Medina
If your single core is at 32bits use Oracle JDK 7u25 or Ubuntu Open JDK 7, the JDK 7u40 for 32bits will corrupt indexes as stated on the lucene bug report. Guido. On 11/10/13 15:13, Peter Schmidt wrote: Oh, i got it http://stackoverflow.com/a/5273166/326905 "at least 2 cores and at least 2 G

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Peter Schmidt
no it is 64bit and just a development VM. In production the solr will use multicore, also 64bit and some gb ram. 2013/10/11 Guido Medina > If your single core is at 32bits use Oracle JDK 7u25 or Ubuntu Open JDK 7, > the JDK 7u40 for 32bits will corrupt indexes as stated on the lucene bug > repo

Problems using DataImportHandler and TikaEntityProcessor

2013-10-11 Thread Peter Bleackley
Starting Solr with the command line java -Dsolr.solr.home=example-DIH/solr -jar start.jar and then trying to import some data with java -Durl=http://localhost:8983/solr/tika/update -Dtype=application/pdf -jar post.jar *.pdf fails with error SimplePostTool: WARNING: Solr returned an error

Re: Multiple schemas in the same SolrCloud ?

2013-10-11 Thread maephisto
Upload the new configuration and the use the collection API to reload you collection https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-ReloadaCollection -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-schemas-in-the-same-SolrCloud-tp409

Re: Questions developing custom functionquery

2013-10-11 Thread JT
Hey Mikhail, Thanks for responding. Field: resourcename Field-Type: org.apache.solr.schema.TextField All 9 boxes checked (indexed, tokenized, stored). I have various other fields (including MD5-checksums) in my Schema. When I use a md5sum field (which is a field, but doesn't have spaces, forwa

Re: Problems using DataImportHandler and TikaEntityProcessor

2013-10-11 Thread Furkan KAMACI
There may be a problem with you schema. Could you send your solr logs? 2013/10/11 Peter Bleackley > Starting Solr with the command line > > > java -Dsolr.solr.home=example-DIH/**solr -jar start.jar > > > and then trying to import some data with > > java > -Durl=http://localhost:8983/**solr/tik

Re: SolrCloud on SSL

2013-10-11 Thread Shawn Heisey
On 10/11/2013 8:17 AM, Christopher Gross wrote: > I have 3 SolrCloud nodes (call them idx1, idx2, idx3), and the boxes have > SSL & certs configured on them to protect the Solr Indexes. > > Right now, I can do queries on idx1 and it works fine. > If I try to query on idx3, I get: > org.apache.solr

Re: Cores with lot of folders with prefix index.XXXXXXX

2013-10-11 Thread Shawn Heisey
On 10/11/2013 4:36 AM, Yago Riveiro wrote: > The thread that you point is about master / slave - replication, Is this > issue valid on SolrCloud context? > > I check the index.properties and indeed the variable index=index.X point > to a folder, the others can be deleted without any scary

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Guido Medina
Then I think you downloaded the wrong JDK 7 (32bits JDK?), if you are running JDK 7 64bits the -server flag should be recognized. According to the stackoverflow link you mentioned before. Guido. On 11/10/13 15:48, Peter Schmidt wrote: no it is 64bit and just a development VM. In production th

Solr Slave warning: No content recieved for file

2013-10-11 Thread Arcadius Ahouansou
Hello. We are running a master-slave solr 3.x and we are seeing more and more of this in the slave log file: *"* *Oct 10, 2013 10:17:00 PM org.apache.solr.handler.SnapPuller$FileFetcher fetchPackets* *WARNING: No content recieved for file: {name=_56l.prx, lastmodified=1381443413000, size=0}* *"*

SOLR Cloud on JBOSS

2013-10-11 Thread Branham, Jeremy [HR]
Hello - This wiki page is gone - https://wiki.apache.org/solr/SolrCloud%20using%20Jboss I have been able to configure an external instance of Zookeeper, and an instance of SOLR in JBOSS.. But I am unsure how to point my SOLR instance to the ZK instance and upload the configuration. All the exa

Re: Problems using DataImportHandler and TikaEntityProcessor

2013-10-11 Thread PeteBleackley
kamaci wrote > There may be a problem with you schema. Could you send your solr logs? > > > 2013/10/11 Peter Bleackley < > bleackleyp@.co > > > >> Starting Solr with the command line >> >> >> java -Dsolr.solr.home=example-DIH/**solr -jar start.jar >> >> >> and then trying to import some data w

Re: Solr 4.4.0 on Ubuntu 10.04 with Jetty 6.1 from package Repository

2013-10-11 Thread Shawn Heisey
On 10/11/2013 4:55 AM, Peter Schmidt wrote: > So the main problem was that the libs must be copied to the WEB-INF/lib > directory insteed of the jetty lib/ext directory. Is the fact that you > should you use WEB-INF/lib somewhere documented? Actually, jetty's lib/ext is preferred, modifying the .w

Re: SolrCloud on SSL

2013-10-11 Thread Christopher Gross
On Fri, Oct 11, 2013 at 11:08 AM, Shawn Heisey wrote: > On 10/11/2013 8:17 AM, Christopher Gross wrote: > > Is there a spot in a Solr configuration that I can set this up to use > HTTPS? > > From what I can tell, not yet. > > https://issues.apache.org/jira/browse/SOLR-3854 > https://issues.a

Re: SOLR Cloud on JBOSS

2013-10-11 Thread Shawn Heisey
On 10/11/2013 9:24 AM, Branham, Jeremy [HR] wrote: > This wiki page is gone - > https://wiki.apache.org/solr/SolrCloud%20using%20Jboss > > I have been able to configure an external instance of Zookeeper, and an > instance of SOLR in JBOSS.. > But I am unsure how to point my SOLR instance to the

Re: SolrCloud on SSL

2013-10-11 Thread Guido Medina
You could resolve that with SSH tunnels. Autossh with the right parameters works like a charm. HTH, Guido. On 11/10/13 16:08, Shawn Heisey wrote: On 10/11/2013 8:17 AM, Christopher Gross wrote: I have 3 SolrCloud nodes (call them idx1, idx2, idx3), and the boxes have SSL & certs configured o

Re: What's the purpose of the bits option in compositeId (Solr 4.5)?

2013-10-11 Thread Brett Hoerner
Thanks folks, As an update for future readers --- the problem was on my side (my logic in picking the _route_ was flawed) as expected. :) On Tue, Oct 8, 2013 at 7:35 PM, Yonik Seeley wrote: > On Tue, Oct 8, 2013 at 8:27 PM, Shawn Heisey wrote: > > There is also the "distrib=false" parameter t

Re: Problems using DataImportHandler and TikaEntityProcessor

2013-10-11 Thread Furkan KAMACI
Here is a similar conversation: http://search-lucene.com/m/GeXcg1YfgQ32/Re%253A+Solr+4.0+error+message%253A+%2522Unsupported+ContentType%253A+Content-type%253Atext%252Fxml%2522&subj=Re+Solr+4+0+error+message+Unsupported+ContentType+Content+type+text+xml+ Could you change -Dauto into -Dtype=applica

Re: Problems using DataImportHandler and TikaEntityProcessor

2013-10-11 Thread Shawn Heisey
On 10/11/2013 9:32 AM, PeteBleackley wrote: > I tried changing the options to -Dauto -Dfiletypes=pdf. This gave me a 404 > error, apparently caused by post.jar adding /extract to the end of the URL In order to use post.jar, you would need the /update/extract handler, which is not defined in the ti

Re: Question about plug-in update handler failure

2013-10-11 Thread Jack Park
Issue resolved. Not a Solr issue; a really hard to discover missing library in my installation. On Thu, Oct 10, 2013 at 7:10 PM, Jack Park wrote: > I have an "interceptor" which grabs SolrDocument instances in the > update handler chain. It feeds those documents as a JSON string out to > an agent

RE: Solr 4.4 - Master/Slave configuration - Replication Issue with Commits after deleting documents using Delete by ID

2013-10-11 Thread Akkinepalli, Bharat (ELS-CON)
Hi Otis, Thanks for the response. The log files can be found here. MasterLog : http://pastebin.com/DPLKMPcF Slave Log: http://pastebin.com/DX9sV6Jx One more point worth mentioning here is that when we issue the commit with expungeDeletes=true, then the delete by id replication is successful

Re: Using split in updateCSV for SolrCloud 4.4

2013-10-11 Thread Utkarsh Sengar
Interestingly this URL by Jack works: 1. curl ' http://localhost/solr/prodinfo/update/csv?commit=true&f.merchantList.split=true&f.merchantList.separator=%3A&f.merchantList.encapsulator=%22&stream.contentType=text/csv&stream.file=/tmp/test.csv ' But this doesn't (i.e. it doesn't split the column):

Re: Using split in updateCSV for SolrCloud 4.4

2013-10-11 Thread Jack Krupansky
There is this note for escape: "If an escape is specified, the encapsulator is not used unless also explicitly specified since most formats use either encapsulation or escaping, not both." -- Jack Krupansky -Original Message- From: Utkarsh Sengar Sent: Friday, October 11, 2013 4:35 P

Setting SolrCloudServer collection

2013-10-11 Thread Mark
If using one static SolrCloudServer how can I add a bean to a certain collection. Do I need to update setDefaultCollection() each time? I doubt that thread safe? Thanks

Re: Solr Cloud hangs when replicating updates

2013-10-11 Thread mewmewball
Hey guys, We just hit a deadlock similar to this one on 4.5, and it seems to be related to leaked connections probably due to https://issues.apache.org/jira/browse/SOLR-4327. We're going to apply the suggested change to add "method.abort()" in the finally block and see if it fixes things. Jessica

Replace NULL with 0 while Indexing

2013-10-11 Thread keshari.prerna
Hello, One of my indexing field have NULL values and i want it to be replaces with 0 while indexing itself. So that when i search after indexing it gives me 0 instead of NULL. This is my data-config.xml and duration is the field which has null values.

Re: Setting SolrCloudServer collection

2013-10-11 Thread Mark Miller
Set the collection param per request. It only uses the default if you don't set it. - Mark On Oct 11, 2013, at 5:26 PM, Mark wrote: > If using one static SolrCloudServer how can I add a bean to a certain > collection. Do I need to update setDefaultCollection() each time? I doubt > that threa

Re: Solr's Filtering approaches

2013-10-11 Thread David Philip
Groups are pharmaceutical research expts.. User is presented with graph view, he can select some region and all the groups in that region gets included..user can modify the groups also here.. so we didn't maintain group information in same solr index but we have externalized. I looked at post filte

Storing 2 dimension array in Solr

2013-10-11 Thread David Philip
Hi, I have a 2 dimension array and want it to be persisted in solr. How can I do that? Sample case: disease1disease2 disease3 group1exist slight not found groups2 slightnot foundexist group2slight exist exist-1 not found - 2