Possible Bug - MDC handling in org.apache.solr.common.util.ExecutorUtil.MDCAwareThreadPoolExecutor.execute(Runnable)

2016-01-11 Thread Konstantin Hollerith
Hi, I'm using SLF4J MDC to log additional Information in my WebApp. Some of my MDC-Parameters even include Line-Breaks. It seems, that Solr takes _all_ MDC parameters and puts them into the Thread-Name, see org.apache.solr.common.util.ExecutorUtil.MDCAwareThreadPoolExecutor.execute(Runnable). Whe

Pro and cons of using Solr Cloud vs standard Master Slave Replica

2016-01-11 Thread Gian Maria Ricci - aka Alkampfer
Hi guys, a customer need a comprehensive list of all pro and cons of using standard Master Slave replica VS using Solr Cloud. I'm interested especially in query performance consideration, because in this specific situation the rate of new documents is really slow, but the amount of data is abou

Solr has multiple log lines for single search

2016-01-11 Thread Syed Mudasseer
Hi, I have solr configured on cloud with the following details: Every collection has 3 shards andEach shard consists of 3 replicas. Whenever I search for any field in solr, having faceting and highlighting query checked,then I get more than 2 search logs stored in the log file. (sometimes, it goe

Re: Bad return type exception

2016-01-11 Thread Asanka Sanjaya Herath
Hi Shawn, Thank you for your explanation. Yes, without Oozie the project runs successfully. On Mon, Jan 11, 2016 at 1:03 PM, Shawn Heisey wrote: > On 1/10/2016 11:56 PM, Asanka Sanjaya Herath wrote: > > I tried to create a solr client using following code. > > > > ​ solrClient = new CloudSolrCl

collapse filter query

2016-01-11 Thread sara hajili
hi all i have a MLT query and i wanna to use collapse filter query. and i wanna to use collapse expand nullPolicy. in this way when i used it : {!collapse field=original_post_id nullPolicy=expand} i got my appropriate result . (in solr web UI) but in regular search handler "/select",when i used {!

Re: Pro and cons of using Solr Cloud vs standard Master Slave Replica

2016-01-11 Thread Rahul Ramesh
Please have a look at this post https://support.lucidworks.com/hc/en-us/articles/201298317-What-is-SolrCloud-And-how-does-it-compare-to-master-slave- We dont use Master slave architecture, however we use solr cloud and standalone solr for our documents. Indexing is a bit slow in cloud when compa

Re: how to search miilions of record in solr query

2016-01-11 Thread Mugeesh Husain
Thanks Erick, "You have to cache (or something) somewhere to make this work."-- Actually they are not interested to use cache mechanism. they dont need paging,they want only 10 records with 1 millions ID search in background etc. As of now i have implemented terms query parser but result perform

RE: Spellcheck response format differs between a single core and SolrCloud

2016-01-11 Thread Dyer, James
Ryan, The json response format changed for Solr 5.0. See https://issues.apache.org/jira/browse/SOLR-3029 . Is the single-core solr running a 4.x version with the cloud solr running 5.x ? If they are both on the same major version, then we have a bug. James Dyer Ingram Content Group -O

Re: Possible Bug - MDC handling in org.apache.solr.common.util.ExecutorUtil.MDCAwareThreadPoolExecutor.execute(Runnable)

2016-01-11 Thread Mark Miller
Not sure I'm onboard with the first proposed solution, but yes, I'd open a JIRA issue to discuss. - Mark On Mon, Jan 11, 2016 at 4:01 AM Konstantin Hollerith wrote: > Hi, > > I'm using SLF4J MDC to log additional Information in my WebApp. Some of my > MDC-Parameters even include Line-Breaks. >

Re: Solr has multiple log lines for single search

2016-01-11 Thread Mark Miller
Two of them are sub requests. They have params isShard=true and distrib=false. The top level user query will not have distrib or isShard because they default the other way. - Mark On Mon, Jan 11, 2016 at 6:30 AM Syed Mudasseer wrote: > Hi, > I have solr configured on cloud with the following de

WArning in SolrCloud logs

2016-01-11 Thread Gian Maria Ricci - aka Alkampfer
I’ve configured three node in solrcloud, everything seems ok, but in the log I see this kind of warning SolrCloud is enabled for core xxx_shard3_replica1 but so is old-style replication. Make sure you intend this behavior,​ it usually indicates a mis-configuration. Master setting is true an

Re: [More Like This] Query building

2016-01-11 Thread Alessandro Benedetti
Hi guys, the patch seems fine to me. I didn't spend much more time on the code but I checked the tests and the pre-commit checks. It seems fine to me. Let me know , Cheers On 31 December 2015 at 18:40, Alessandro Benedetti wrote: > https://issues.apache.org/jira/browse/LUCENE-6954 > > First dra

Re: Spellcheck response format differs between a single core and SolrCloud

2016-01-11 Thread Ryan Yacyshyn
That's solves the mystery. The single-core is running 4.10.1 and SolrCloud on 5.3.1. Thanks James. On Mon, 11 Jan 2016 at 22:24 Dyer, James wrote: > Ryan, > > The json response format changed for Solr 5.0. See > https://issues.apache.org/jira/browse/SOLR-3029 . Is the single-core > solr run

Change leader in SolrCloud

2016-01-11 Thread Gian Maria Ricci - aka Alkampfer
I've a test solrCloud installation consisting of Three CentOS machines, each one running one zookeeper node and one solr instance. I've created a collection with 3 shards and 2 replica per each shard, then, after some tests, rebooted all three machines. Due to the different reboot times probabl

Re: Change leader in SolrCloud

2016-01-11 Thread Alessandro Benedetti
This is an interesting point. Unfortunately I am not sure it is possible to configure anywhere to avoid the leaders co-location. I think zookeeper ideally assumes each solr node is on a separate machine. Curious to know if we can optimize the colocation though config. Cheers On 11 January 2016 at

Re: WArning in SolrCloud logs

2016-01-11 Thread Alessandro Benedetti
To be honest it seems to me more a wrong usage of java environment variables. Is it possible you are sending the enable master/slave config to the node ? Strictly talking about the replication request handler, it is required for SolrCloud ( there are scenarios where old style replication is still u

Re: Change leader in SolrCloud

2016-01-11 Thread Shawn Heisey
On 1/11/2016 8:45 AM, Gian Maria Ricci - aka Alkampfer wrote: > Due to the different reboot times probably, I’ve noticed that upon > reboot all three leader shards are on a single machine. I’m expecting > shard leaders to be distributed evenly between machines, because if > all shard leader are on

Re: Change leader in SolrCloud

2016-01-11 Thread Erick Erickson
Shawn is spot-on, here's a little bit of "color commentary" bq: all new documents to index will be routed to the same machine, thus indexing load is not subdivided This is something of a misconception. Indexing is always done on all nodes, leaders and replicas alike in SolrCloud. The leader is r

Re: Querying only replica's

2016-01-11 Thread Alessandro Benedetti
mmm i think there is a misconception here : On 10 January 2016 at 19:00, Robert Brown wrote: > I'm thinking more about how the external load-balancer will know if a node > is down, as to take it out the pool of active servers to even attempt > sending a query to. > This is SolrCloud responsibili

RE: Change leader in SolrCloud

2016-01-11 Thread Gian Maria Ricci - aka Alkampfer
Thanks. This arise a different question: when I index a document, it is assigned to one of the three shard based on the value of the ID field. Actually indexing a document is usually a CPU and RAM intensive work to parse text, tokenize, etc. How this works in SolrCloud? I probably incorrectly a

Re: Querying only replica's

2016-01-11 Thread Robert Brown
We won't be using SolrJ, etc. anytime soon unfortunately. We'll be using a hardware load-balancer to send requests into the cloud/pool of servers. The LB therefore needs to know when a node is down, otherwise a query wouldn't get anywhere. The solr.PingRequestHandler is what I was after.

Re: Pro and cons of using Solr Cloud vs standard Master Slave Replica

2016-01-11 Thread Shawn Heisey
On 1/11/2016 4:28 AM, Gian Maria Ricci - aka Alkampfer wrote: > a customer need a comprehensive list of all pro and cons of using > standard Master Slave replica VS using Solr Cloud. I’m interested > especially in query performance consideration, because in this > specific situation the rate of new

Re: Change leader in SolrCloud

2016-01-11 Thread Erick Erickson
You have to assign the preferredLeader role first. You can do that node-by-node via ADDREPLICAPROP or have the system do it for you with BALANCESHARDUNIQUE. As I said before, in SolrCloud the leader forwards the raw document to each follower. There is no pre-processing, analysis anything else done

collection configuration stored in Zoo Keeper with solrCloud

2016-01-11 Thread Jim Shi
Hi, I have question regarding collection configurations stored Zoo Keeper with solrCloud. All collection configurations are stored at Zoo Keeper. What happens if you want to restart all Zoo Keeper instances? Does the Zoo Keeper persists data on disk and can restore all configurations from disk?

Re: WArning in SolrCloud logs

2016-01-11 Thread Shawn Heisey
On 1/11/2016 8:08 AM, Gian Maria Ricci - aka Alkampfer wrote: > > I've configured three node in solrcloud, everything seems ok, but in > the log I see this kind of warning > > > > SolrCloud is enabled for core xxx_shard3_replica1 but so is old-style > replication. Make sure you intend this behavi

Solr /export handler is exporting only unique values from multivalued field?

2016-01-11 Thread Alok Bhandari
Hello , I am using solr /export handler to export search results and it is performing well. Today I faced an issue , actually there are 2 multivalued fields I am fetching lets say which holds list of items and which holds list of sellers. here I am storing information such that seller for 1st i

Re: collection configuration stored in Zoo Keeper with solrCloud

2016-01-11 Thread Shawn Heisey
On 1/11/2016 11:13 AM, Jim Shi wrote: > Hi, I have question regarding collection configurations stored Zoo Keeper > with solrCloud. > All collection configurations are stored at Zoo Keeper. What happens if you > want to restart all Zoo Keeper instances? Does the Zoo Keeper persists data > on dis

Re: Solr /export handler is exporting only unique values from multivalued field?

2016-01-11 Thread Joel Bernstein
The /export handler is using DocValues for export, which stores the multi-value fields as a sorted set. So the sorting is the expected behavior. If you have duplicates in the multi-value field this could account for the list being of different sizes. Joel Bernstein http://joelsolr.blogspot.com/

Re: Solr /export handler is exporting only unique values from multivalued field?

2016-01-11 Thread Joel Bernstein
Perhaps you can achieve what you're trying to do with a prefix on the data so the sort is maintained and duplicates are not eliminated. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Jan 11, 2016 at 1:42 PM, Joel Bernstein wrote: > The /export handler is using DocValues for export, which

Re: collection configuration stored in Zoo Keeper with solrCloud

2016-01-11 Thread Jeff Courtade
Yes its stored in the directories configured in zoo.cfg .Jeff Courtade M: 240.507.6116 On Jan 11, 2016 1:16 PM, "Jim Shi" wrote: > Hi, I have question regarding collection configurations stored Zoo Keeper > with solrCloud. > All collection configurations are stored at Zoo Keeper. What happens if

Re: collapse filter query

2016-01-11 Thread Joel Bernstein
Which version of Solr are you using? Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Jan 11, 2016 at 6:39 AM, sara hajili wrote: > hi all > i have a MLT query and i wanna to use collapse filter query. > and i wanna to use collapse expand nullPolicy. > in this way when i used it : > {!colla

Re: collapse filter query

2016-01-11 Thread sara hajili
I am using solr 5.3.1 On Jan 11, 2016 10:30 PM, "Joel Bernstein" wrote: > Which version of Solr are you using? > > Joel Bernstein > http://joelsolr.blogspot.com/ > > On Mon, Jan 11, 2016 at 6:39 AM, sara hajili > wrote: > > > hi all > > i have a MLT query and i wanna to use collapse filter query

Re: collapse filter query

2016-01-11 Thread Joel Bernstein
I believe this is a bug. I think the reason this is occurring is that you have an index segment with no values at all in the collapse field. If you could create a jira ticket for this I will look at resolving the issue. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Jan 11, 2016 at 2:03 PM,

Re: collapse filter query

2016-01-11 Thread sara hajili
Tnx.How I can create a jira ticket? On Jan 11, 2016 10:42 PM, "Joel Bernstein" wrote: > I believe this is a bug. I think the reason this is occurring is that you > have an index segment with no values at all in the collapse field. If you > could create a jira ticket for this I will look at resolv

Re: collection configuration stored in Zoo Keeper with solrCloud

2016-01-11 Thread Erick Erickson
Do be a little careful though. The sample zookeeper config that comes with an Apache install of Zookeeper defaults to storing the data in /tmp/zookeeper which is _not_ a place you want persistent data on *nix systems. Note, this is _not_ the default for embedded Zookeeper in Solr. And the othe

Re: collapse filter query

2016-01-11 Thread Joel Bernstein
I'll create it later today and update this thread with the Jira number. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Jan 11, 2016 at 2:15 PM, sara hajili wrote: > Tnx.How I can create a jira ticket? > On Jan 11, 2016 10:42 PM, "Joel Bernstein" wrote: > > > I believe this is a bug. I th

Re: collapse filter query

2016-01-11 Thread Susheel Kumar
You can go to https://issues.apache.org/jira/browse/SOLR/ and create Jira ticket after signing in. Thanks, Susheel On Mon, Jan 11, 2016 at 2:15 PM, sara hajili wrote: > Tnx.How I can create a jira ticket? > On Jan 11, 2016 10:42 PM, "Joel Bernstein" wrote: > > > I believe this is a bug. I thin

Re: Kerberos ticket not renewing when storing index on Kerberized HDFS

2016-01-11 Thread Ishan Chattopadhyaya
Not sure how reliably renewals are taken care of in the context of kerberized HDFS, but here's my 10-15 minute analysis. Seems to me that the auto renewal thread is not spawned [0]. This relies on kinit. Not sure if having a login configuration with renewTGT is sufficient (which seems to be passed

RE: WArning in SolrCloud logs

2016-01-11 Thread Gian Maria Ricci - aka Alkampfer
Actually that is a collection I've created uploading into Zookeeper a configuration I used for single node, with a replication handler activated to backup the core. I did not send any master/slave config actually, I just created the collection using collection API and the warning is immediately

multiple solr-config.xml files per core

2016-01-11 Thread techqnq
I assume distinct solr-config.xml file is allowed for every solr core, but I got suspicious based upon the data size of the core. So thought to get my facts confirmed/corrected here: Q. Can Solr Server have different/multiple solr-config.xml file per core? Use Case: - For one core solr-config.x

RE: Change leader in SolrCloud

2016-01-11 Thread Gian Maria Ricci - aka Alkampfer
Ok, this imply that if I have X replica of a shard, the document is indexed X+1 times? one for each replica plus the leader shard? It seems to me a huge wasting of resources. In a Master/slave scenario indexing takes places only on master node, then slave replicates analyzed data. -- Gian

Re: Change leader in SolrCloud

2016-01-11 Thread Shawn Heisey
On 1/11/2016 1:23 PM, Gian Maria Ricci - aka Alkampfer wrote: > Ok, this imply that if I have X replica of a shard, the document is indexed > X+1 times? one for each replica plus the leader shard? It seems to me a huge > wasting of resources. > > In a Master/slave scenario indexing takes places

Re: collapse filter query

2016-01-11 Thread Joel Bernstein
I went to go work on the issue and found it was already fixed 7 weeks ago. The bug fix is available in Solr 5.4. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Jan 11, 2016 at 3:12 PM, Susheel Kumar wrote: > You can go to https://issues.apache.org/jira/browse/SOLR/ and create Jira > ticke

Re: Possible Bug - MDC handling in org.apache.solr.common.util.ExecutorUtil.MDCAwareThreadPoolExecutor.execut e(Runnable)

2016-01-11 Thread Chris Hostetter
: Not sure I'm onboard with the first proposed solution, but yes, I'd open a : JIRA issue to discuss. we should standardize the context keys to use use fully qualified (org.apache.solr.*) java class name prefixes -- just like we do with the logger names themselves. : : - Mark : : On Mon, Jan

indexing rich data with solr 5.3

2016-01-11 Thread kostali hassan
such files msword and pdf donsnt indexing using *dataimoprt i have this error:* Full Import failed:java.lang.RuntimeException: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to read content Processing Document # 2 at org.apache.solr.handl

Problems using MapReduceIndexerTool with multiple reducers

2016-01-11 Thread Douglas Rapp
Hello, I am using Solr 4.10.4 in SolrCloud mode, but so far with only a single instance (so just a single shard - not very cloud-like..). I have been experimenting using the MapReduceIndexerTool to handle batch indexing of CSV files in HDFS. I got it working on a weaker single-node Hadoop test sy

Re: Problems using MapReduceIndexerTool with multiple reducers

2016-01-11 Thread Erick Erickson
Hmm, it looks like you created your collection with the "implicit" router. Does the same thing happen when you use the default compositeId router? Note, this should be OK with either, this is just to gather more info. Other questions: 1> Are you running MRIT over Solr indexes that are actually ho

Re: indexing rich data with solr 5.3

2016-01-11 Thread Erick Erickson
Looks like a bad file. Do you have any success using DIH on any files? What happens if you just send that particular file throug the ExtractingRequestHandler? Best, Erick On Mon, Jan 11, 2016 at 3:51 PM, kostali hassan wrote: > such files msword and pdf donsnt indexing using *dataimoprt i have

Re: Change leader in SolrCloud

2016-01-11 Thread Erick Erickson
bq: It seems to me a huge wasting of resources. How else would you guarantee consistency? Especially taking in to account Lucene's write-once segments? Master/Slave sidesteps the problem by moving entire, closed segments to the slave, but as Shawn says if the master goes down the slaves don't hav

Re: multiple solr-config.xml files per core

2016-01-11 Thread Erick Erickson
bq: Can Solr Server have different/multiple solr-config.xml file per core? Yes. Each separate core can (and usually does) have its own configs, solrconfig.xml, schema and the like. Your question could be interpreted as asking if you can have multiple solrconfig.xml files in the _same_ core, the

Re: WArning in SolrCloud logs

2016-01-11 Thread Erick Erickson
Just show us the solrconfig.xml file, particularly anything referring to replication, it's easier than talking past each other. Best, Erick. On Mon, Jan 11, 2016 at 12:18 PM, Gian Maria Ricci - aka Alkampfer wrote: > Actually that is a collection I've created uploading into Zookeeper a > confi

Re: multiple solr-config.xml files per core

2016-01-11 Thread techqnq
Thanks Eric, for confirming and putting it correctly in your response. Appreciate your help! -- View this message in context: http://lucene.472066.n3.nabble.com/multiple-solr-config-xml-files-per-core-tp4250009p4250050.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr UIMA Custom Annotator PEAR file installation on Linux

2016-01-11 Thread techqnq
Hi, Tommaso Teofili: any help on this ^ -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-UIMA-Custom-Annotator-PEAR-file-installation-on-Linux-tp4249302p4250054.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: solrcloud -How to delete a doc at a specific shard

2016-01-11 Thread elvis鱼人
hi Erick ,i really want to know too. i remembered change zookeeper,may be correlated with it shard1: 192.168.100.210:7001-leader 192.168.100.211:7001-replica shard2: 192.168.100.211:7002:leader 192.168.100.212:7001:replica shard3: 192.168.

Solr 5.3.1 ArrayIndexOutOfBoundsException while running a query

2016-01-11 Thread Kelly, Frank
Using Solr 5.3.1 in Solr Cloud mode deployed on AWS (each Solr instance has -Xmx 1024m and the server has 8GB of RAM) Am getting a 500 error running a query via the UI Looking in the logs I just see this with no stack trace 2016-01-12 02:04:22.181 ERROR (qtp59559151-7313) [c:qa_us-east-1_here_a

Re: Solr search and index rate optimization

2016-01-11 Thread Zap Org
hello dear thanks for replying it means 3 ZK instances are more than enough in my case On Fri, Jan 8, 2016 at 10:07 PM, Erick Erickson wrote: > Here's a longer form of Toke's answer: > > https://lucidworks.com/blog/2012/07/23/sizing-hardware-in-the-abstract-why-we-dont-have-a-definitive-answer/

Re: Solr search and index rate optimization

2016-01-11 Thread Zap Org
thanks for replying currently my machine specs are 32 GB RAM 4 core processor windows server 2008 64bit 500 GB HD 16 GB swap memorey now the already running machine with cpu usage not more than 10% already consumed all the RAM and now started to use swap memorey what my guess is my server will cho

Re: Solr 5.3.1 ArrayIndexOutOfBoundsException while running a query

2016-01-11 Thread Erick Erickson
The Solr logs should have a much more complete stack trace if you can locate them. 1G of memory is very little for any serious Solr. I'm assuming you restarted Solr after the OOM, but Java isn't entirely reliable after an OOM. FWIW, Erick On Mon, Jan 11, 2016 at 6:34 PM, Kelly, Frank wrote: >

Re: Solrcloud for Java 1.6

2016-01-11 Thread Zap Org
hello shawn yes it is written in a style readble with java 7 what i have done is, altered the syntax into java 6 and then compile and it is 100% working. if you need, i can send you the jar. On Fri, Jan 8, 2016 at 11:59 AM, Shawn Heisey wrote: > On 1/7/2016 10:59 PM, Zap Org wrote: > > i have so

Re: solrcloud -How to delete a doc at a specific shard

2016-01-11 Thread Erick Erickson
OK, what exactly do you mean you "changed zookeeper"? If you went in and reassigned IP addresses to nodes then all bets are off. So do you have just a single (or a few) docs that are dups or lots? And by "lots", I'm thinking if all the duplicate IDs are documents that have been indexed since you "

Re: Solr /export handler is exporting only unique values from multivalued field?

2016-01-11 Thread Alok Bhandari
Thanks Joel. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-export-handler-is-exporting-only-unique-values-from-multivalued-field-tp4249986p4250067.html Sent from the Solr - User mailing list archive at Nabble.com.

solr in action - multiple language content in one field

2016-01-11 Thread vidya
Hi I have gone through solr in action 14th chapter which tells - "searching content in multiple languages" . But i have a doubt that when i put documents in solr web UI, it recognises every language and gives me the result when queried for it. What exactly did they depict in that chapter. can't so

Re: solrcloud -How to delete a doc at a specific shard

2016-01-11 Thread vidya
Hi I am new to solr and having a doubt on how one can know that a particular shard is in that particular node or ip address. Thanks in advance -- View this message in context: http://lucene.472066.n3.nabble.com/solrcloud-How-to-delete-a-doc-at-a-specific-shard-tp4249354p4250072.html Sent from

Re: solrcloud -How to delete a doc at a specific shard

2016-01-11 Thread elvis鱼人
try config core.properties -- View this message in context: http://lucene.472066.n3.nabble.com/solrcloud-How-to-delete-a-doc-at-a-specific-shard-tp4249354p4250079.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: solrcloud -How to delete a doc at a specific shard

2016-01-11 Thread elvis鱼人
i mean i changed zookeeper server from one ip address to the other ip address. it is too hard understand,what do you mean "lots"? -- View this message in context: http://lucene.472066.n3.nabble.com/solrcloud-How-to-delete-a-doc-at-a-specific-shard-tp4249354p4250078.html Sent from the Solr - U