Re: Replacing a document in Solr5

2015-12-18 Thread Debraj Manna
Ok. Then what is the difference between addField & setField

Re: Replacing a document in Solr5

2015-12-18 Thread Andrea Gazzarini
As far as I know, this is how Solr works (e.g. it replaces the whole document): how do you replace only a part of a document? Just send a SolrInputDocument with an existing (i.e. already indexed) id and the document (on Solr) will be replaced. Andrea 2015-12-19 8:16 GMT+01:00 Debraj Manna : > C

Replacing a document in Solr5

2015-12-18 Thread Debraj Manna
Can someone let me know how can I replace a document on each update in Solr 5.2.1 using SolrJ? I don;t want to update parts of the document. On doing update it should replace the entire document.

RE: How for distributed search only log collective search response

2015-12-18 Thread Koorosh Vakhshoori
It turns out there is a better way to do this. It does not require code change in Solr, if you are using log4j. However, you need to migrate to log4j.xml file format. The solution is to use the filter feature. Here is what my console appender looks like with the filter:

facet component and uninverted field

2015-12-18 Thread Jamie Johnson
I recently saw that the new JSON Facet API supports controlling the facet method that is used and was wondering if there was any support for doing the same thing in the original facet component? Also is there a plan to deprecate one of these components over the other or is there an expectation tha

Some problems when upload data to index in cloud environment

2015-12-18 Thread 周建二
Hello everyone: I am building a solr cloud production environment. My solr version is 5.3.1. The environment consists three nodes running CentOS 6.5. First I build the zookeeper environment by the three nodes, and then run solr on the three nodes, and at last build a collection consists of th

Re: Load-balancing Solr instances

2015-12-18 Thread Upayavira
And they do the right thing far faster than a load balancer would. In one test I made, zookeeper updated th cluster state within 200ms. It may well have been less than that, I didn't check. I had requests going to a cluster in a loop, and my client (pysolr with PR #138) retried on connection failur

Re: Admin Optimize

2015-12-18 Thread Erick Erickson
I looked and it's the new UI. Usually only visible on the "core admin" tab and it's not there in either Cloud mode or stand-alone. definitely visible on the old UI. Erick On Fri, Dec 18, 2015 at 11:46 AM, Upayavira wrote: > I did not deliberately remove that button, so if it is missing, it co

Re: Please add me to the ContributorsGroup

2015-12-18 Thread Erick Erickson
Done, thanks for contributing! On Fri, Dec 18, 2015 at 11:58 AM, Alan Thompson wrote: > I would like to contribute to the Solr wiki. I am user "AlanThompson > " > > Thanks, > > Alan > > > - > > Con

RE: Permutations of entries in a multivalued field

2015-12-18 Thread Allison, Timothy B.
Duh, didn't realize you could set inOrder in Solr. Y, that's the better solution. -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Friday, December 18, 2015 2:27 PM To: solr-user Subject: Re: Permutations of entries in a multivalued field The other thin

Please add me to the ContributorsGroup

2015-12-18 Thread Alan Thompson
I would like to contribute to the Solr wiki. I am user "AlanThompson " Thanks, Alan - Contact a member of the wiki admin and ask to be added to the ContributorsGroup

Re: Admin Optimize

2015-12-18 Thread Upayavira
I did not deliberately remove that button, so if it is missing, it could be a bug. Is it missing on the old, or new, or both UIs? Thanks! Upayavira On Fri, Dec 18, 2015, at 07:08 PM, Erick Erickson wrote: > Right, the whole optimize thing is in a bit of a state of flux. For > indexes that change

Re: Permutations of entries in a multivalued field

2015-12-18 Thread Erick Erickson
The other thing to check is the ComplexPhraseQueryParser, see: https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-ComplexPhraseQueryParser It uses the Span queries to build up the query... Best, Erick On Fri, Dec 18, 2015 at 11:23 AM, Allison, Timothy B. wrote: > Hi Joh

RE: Permutations of entries in a multivalued field

2015-12-18 Thread Allison, Timothy B.
Hi Johannes, I suspect that Scott's answer would be more efficient than the following, and I may be misunderstanding the problem! This type of search is supported at the Lucene level by a SpanNearQuery with inOrder set to false. So, how do you get a SpanQuery in Solr? You might want to l

Re: Admin Optimize

2015-12-18 Thread Erick Erickson
Right, the whole optimize thing is in a bit of a state of flux. For indexes that change quite regularly, it's something of a trap as making one big segment gets in the way of the merging algorithm. It'll still work, but it's not all that useful. For static indexes there's anecdotal evidence that i

Admin Optimize

2015-12-18 Thread Rick Leir
Hi all, The Solr Admin new UI in 5.4 lost the Optimise button in core admin. What did it do, and is the 'Optimize' more automatic? Google is not helping me, but then perhaps I should go past the first results page! It is in the screendump here: https://cwiki.apache.org/confluence/display/solr/Core

Re: Permutations of entries in a multivalued field

2015-12-18 Thread Scott Stults
Johannes, I think your best bet is to create a QParserPlugin that orders the terms of the incoming query. It sounds like you have control over the way that field is indexed, so you could enforce the same ordering when the document comes into Solr. If that's not the case then you'll also want to wr

Re: SolR 5.3.1 deletes index files

2015-12-18 Thread Erick Erickson
Andreas: Let me see if I understand correctly: You have two Solr instances pointing at the _same_ NFS-mounted directory. The lock type of "single" implies this. And you're totally and absolutely sure that only _one_ Solr instance writes to that directory _ever_, right? It's not even the case that

Re: Issues when indexing PDF files

2015-12-18 Thread Zheng Lin Edwin Yeo
Hi Erick, Thanks for your reply. However, it is unlikely to be the browser issue, as the same result occurs when I tried it in the Tika app. Regards, Edwin On 18 December 2015 at 23:39, Erick Erickson wrote: > This could also simply be your browser isn't set up to > display UTF-8, the charac

RE: SolR 5.3.1 deletes index files

2015-12-18 Thread Moll, Dr. Andreas
Hi Shawn, >To figure out what's going on, we will need information about your >server, exactly how you installed Solr, how it is started, how it is >stopped, etc. thanks for your interest in our problem. I currently start SolR on Linux via solr-5.3.1/bin/solr start -f because we are still in a

Re: While idexing millions of data Getting error

2015-12-18 Thread Jack Krupansky
Deep in that stack trace: "Suppressed: java.io.IOException: No space left on device". Out of disk, apparently. Seems unlikely for the big disks on most systems these days. Are you using SSD? They can be relatively small, especially if on a box that has been virtualized into multiple VMs. Some disc

Re: Issues when indexing PDF files

2015-12-18 Thread Erick Erickson
This could also simply be your browser isn't set up to display UTF-8, the characters may be just fine. Best, Erick On Fri, Dec 18, 2015 at 12:58 AM, Zheng Lin Edwin Yeo wrote: > Thanks for all your replies. > > I did chance upon this question from stackoverflow which it says is able to > solve t

Re: Load-balancing Solr instances

2015-12-18 Thread Erick Erickson
You're over-complicating it, the complexity is already in Solr ;)... First, if your using a SolrJ client (assuming you're accessing Solr from your app), use the CloudSolrClient class. This takes a ZK ensemble and does it's own load balancing via a software load balancer. If you're not using SolrJ

Re: Jetty Server in Production

2015-12-18 Thread Upayavira
On Fri, Dec 18, 2015, at 02:57 PM, Mugeesh Husain wrote: > > >> You are *expected* to use the inbuilt Jetty rather than anything else > Yes i am deciding to use inbuilt Jetty server instead of any other. > Question: > If we use inbuit jetty server what type of expected issue may be occurred > ?

Re: While idexing millions of data Getting error

2015-12-18 Thread Toke Eskildsen
Mugeesh Husain wrote: > could you tell me the maximum number of limit for posting data to solr. The data size can be at most 2GB, possibly minus a few bytes. It is due to the HttpUrlComponent used inside of Solr, which only accepts a signed integer as size. As for the number of documents, the

Re: While idexing millions of data Getting error

2015-12-18 Thread Mugeesh Husain
Thanks Toke Eskildsen could you tell me the maximum number of limit for posting data to solr. -- View this message in context: http://lucene.472066.n3.nabble.com/While-idexing-millions-of-data-Getting-error-tp4246320p4246327.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Jetty Server in Production

2015-12-18 Thread Mugeesh Husain
>> You are *expected* to use the inbuilt Jetty rather than anything else Yes i am deciding to use inbuilt Jetty server instead of any other. Question: If we use inbuit jetty server what type of expected issue may be occurred ? or can we use inbuilt jetty server in production ? >> What do you mea

Re: While idexing millions of data Getting error

2015-12-18 Thread Toke Eskildsen
Mugeesh Husain wrote: > I am indexing around 2 millions of data(220) while Api, i am getting > following error. > badMessage: java.lang.IllegalStateException: too much data after closed for > HttpChannelOverHttp@2da6188{r=1,​c=false,​a=IDLE,​uri=-} Sounds like you are trying post more than 2

While idexing millions of data Getting error

2015-12-18 Thread Mugeesh Husain
Hi, I am indexing around 2 millions of data(220) while Api, i am getting following error. badMessage: java.lang.IllegalStateException: too much data after closed for HttpChannelOverHttp@2da6188{r=1,​c=false,​a=IDLE,​uri=-} also i am sharing below Dashboard info. Physical Memory 94.8% Swap Spa

Re: faceting is unusable slow since upgrade to 5.3.0

2015-12-18 Thread Jamie Johnson
Also can we get the capability to choose the method of faceting in the older faceting component? I'm not looking for complete feature parity just the ability to specify the method. As always thanks. On Fri, Dec 18, 2015 at 8:04 AM, Jamie Johnson wrote: > Can we still specify the cache implemen

Re: pf2 pf3 and stopwords

2015-12-18 Thread elisabeth benoit
ok, thanks a lot for your advice. i'll try that. 2015-12-17 10:05 GMT+01:00 Binoy Dalal : > For this case of inversion in particular a slop of 1 won't cause any issues > since such a reverse match will require the slop to be 2 > > On Thu, 17 Dec 2015, 14:20 elisabeth benoit > wrote: > > > Inv

Re: Jetty Server in Production

2015-12-18 Thread Upayavira
On Fri, Dec 18, 2015, at 01:46 PM, Mugeesh Husain wrote: > Hello > > Suggestion Please > 1.) can we use inbuilt jetty server in production > 2.)Can we use solr instance without using service 1) You are *expected* to use the inbuilt Jetty rather than anything else 2) What do you mean "service"

Jetty Server in Production

2015-12-18 Thread Mugeesh Husain
Hello Suggestion Please 1.) can we use inbuilt jetty server in production 2.)Can we use solr instance without using service -- View this message in context: http://lucene.472066.n3.nabble.com/Jetty-Server-in-Production-tp4246309.html Sent from the Solr - User mailing list archive at Nabble.co

Re: faceting is unusable slow since upgrade to 5.3.0

2015-12-18 Thread Jamie Johnson
Can we still specify the cache implementation for the field cache? When this change occurred to faceting (uninverting reader vs field ) it prevented us from moving to 5.x but if we can get the 4.x functionality using that api we could look to port to the latest. Jamie On Dec 17, 2015 9:18 AM, "Yo

Re: Sold Integration tests

2015-12-18 Thread Mikhail Khludnev
fwiw, There is TestCrossCoreJoin, I'm not sure how it's useful for you. On Fri, Dec 18, 2015 at 3:25 PM, Alessandro Benedetti wrote: > Hi Guys, > was investigating a little bit about the use of the SolrJettyTestBase as I > had to run in my integration tests a multi core Solr instance ( is then t

RE: Load-balancing Solr instances

2015-12-18 Thread Markus Jelsma
Hello - a simple load balancer will do just fine. Or more sophisticated tools such as Varnish, HAProxy or Nginx, which we use. A hardware loadbalancer would obviously also do the job Markus -Original message- > From:Andrej van der Zee > Sent: Friday 18th December 2015 13:20 > To: sol

Re: Sold Integration tests

2015-12-18 Thread Alessandro Benedetti
Hi Guys, was investigating a little bit about the use of the SolrJettyTestBase as I had to run in my integration tests a multi core Solr instance ( is then the application than applies the proper " collection" each time it uses the services ) Unfortunately the current implementation of the SolrJet

Load-balancing Solr instances

2015-12-18 Thread Andrej van der Zee
Hi, Could someone please inform me about best practices when load-balancing queries over Solr instances? We will have many shards each with multiple replications. I understand that sending my request to one particular Solr instance will be routed appropriately, but requests will still be sent to

Re: Issues when indexing PDF files

2015-12-18 Thread Zheng Lin Edwin Yeo
Thanks for all your replies. I did chance upon this question from stackoverflow which it says is able to solve the issues: http://stackoverflow.com/questions/32354209/apache-tika-extract-scanned-pdf-files/ However, when I tried to run it, it still get the same "?" output in the content, the s

Re: Expected mime type application/octet-stream but got text/html

2015-12-18 Thread Andrej van der Zee
Hi, > Fixed on erick's behalf because he had to run to a meeting... > > https://cwiki.apache.org/confluence/display/solr/Distributed+Requests > > ...i also wen't ahead to shift the examples to more emphasize using shard > Ids since that's probably safer/cleaner for most people. > Thanks for the