Re: Is solr reindex all documents for each commit?

2013-02-20 Thread Rafał Kuć
Hello! You don't need to re-index all the documents, you only need to index the new ones. -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch > Hi > We have a requirement to add 10 or 100 new documents to solr on a daily > basis. Our solr appl

Re: difference between q=field:"value1 value2" and q=field:value1 value2

2013-02-20 Thread Sebastian Saip
I see.. this is exactly as lucene then. Thanks Erick! On 20 February 2013 03:04, Erick Erickson wrote: > Of course you are > (field1:article test OR field2:article test OR ...) > > parses as > > field1:article defaultfield:test OR field2:article defaultfield:test > probably with an implie

Re: SOLR4 SAN vs Local Disk?

2013-02-20 Thread Toke Eskildsen
On Tue, 2013-02-19 at 18:39 +0100, chamara wrote: > Hi Thanks Shawn for the Input, Yes i am using SolrCloud to replicate the > index to another server running with the same spec with 32cores and 72GB RAM > on each machine. I have to test the performance of RAID 10? Have you ever > done a deployment

Re: Newbie question on recurring theme: Dynamic Fields

2013-02-20 Thread Erik Hatcher
You need to use not , that's all :) Erik On Feb 20, 2013, at 4:06, Erik Dybdahl wrote: > Hi, > I'm currently assessing lucene/solr as a search front end for documents > currently stored in an rdbms. > The data has been made searchable to clients, in a way so that each > client/customer ma

Re: Newbie question on recurring theme: Dynamic Fields

2013-02-20 Thread Toke Eskildsen
On Wed, 2013-02-20 at 10:06 +0100, Erik Dybdahl wrote: > However, after definining > stored="true" multiValued="true"/> Seems like a typo to me: You need to write "

array of array in document response structure

2013-02-20 Thread Danny
Hello, I'm wondering if it is possible to have arrays of arrays in the response structure. For example, I can use DataImportHandler to index this XML file : into this structure : But as you can see, in this index I lost the links between "two" elements and "one" elements. So I'm wondering

RE: Edismax odd results

2013-02-20 Thread David Quarterman
Hi Shawn, Schema's at http://justpaste.it/davidqhog. It's the basic SOLR 4.0 with additions! Regards, DQ -Original Message- From: Shawn Heisey [mailto:s...@elyograg.org] Sent: 19 February 2013 18:32 To: solr-user@lucene.apache.org Subject: Re: Edismax odd results On 2/19/2013 11:16

solr bug feedback

2013-02-20 Thread ????
solr: 3.6.1 ---> Class: SolrRequestParsers --->line: 75 hava a manual mistake: ??long uploadLimitKB = 1048; // 2MB default?? should to ??long uploadLimitKB = 2048; // 2MB default thinks for open source!!!

Re: Newbie question on recurring theme: Dynamic Fields

2013-02-20 Thread Erik Dybdahl
Excellent, works like a charm! Though embarassing, it's still a good thing the only problem was me being blind :-) Thank you, Toke and Erik. On Wed, Feb 20, 2013 at 11:47 AM, Toke Eskildsen wrote: > On Wed, 2013-02-20 at 10:06 +0100, Erik Dybdahl wrote: > > However, after definining > >

RE: Edismax odd results

2013-02-20 Thread David Quarterman
Hi Erick, Debug=all posted on http://justpaste.it/davidqhogdebug. Can't see anything obvious myselfbut then I'm not an expert! Regards, DQ -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: 20 February 2013 02:02 To: solr-user@lucene.apache.org Subject:

Re: WIKI: Does JSON Update format actually support single-object submit?

2013-02-20 Thread Dmitry Kan
Looks like a mistake in wiki. At least when compared with yonik's post: http://yonik.com/solr/optimistic-concurrency/ i.e. [{}] syntax is used for optimistic concurrency update. Regards, Dmitry On Tue, Feb 19, 2013 at 3:09 PM, Alexandre Rafalovitch wrote: > Right you are. I meant to say that

Re: Storing all attributes in the document so that I can avoid a distributed cache?

2013-02-20 Thread Erick Erickson
bq: Does the new compressed stored field format in Solr 4.1 do anything to reduce the number of disk seeks required to retrieve all document fields? Probably, but I doubt by a whole lot. Although I confess I really don't know the guts. Let's assume that all the stored content for a doc is contiguo

Re: Threads running while querrying

2013-02-20 Thread Erick Erickson
Silly question perhaps, but are you feeding queries at Solr with a single thread? Because Solr uses multiple threads to search AFAIK. Best Erick On Wed, Feb 20, 2013 at 4:01 AM, Manuel Le Normand < manuel.lenorm...@gmail.com> wrote: > More to it, i do see 75 more threads under the process of t

Re: Edismax odd results

2013-02-20 Thread Erick Erickson
OK, first: wildcarding and stemming don't get along well together. Since you've stemmed the field, enginee* would not match the stemmed term engin. This is actually pretty tricky to try to implement. For instance, how would enginee stem? So the fqs you posted are going to mislead you in that regard

RE: solr bug feedback

2013-02-20 Thread Uwe Schindler
This is already fixed in Solr 4.1! - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de From: 虛客 [mailto:itemdet...@qq.com] Sent: Wednesday, February 20, 2013 11:17 AM To: solr-user Subject: solr bug feedback

Re: html parsing with dataimport handler from sql column

2013-02-20 Thread Gora Mohanty
On 20 February 2013 18:31, Rohan Thakur wrote: > hi all > > I have data stored in HTML format in a column in sql database and want to > index the data from that field to solr how can I do that any one has idea > please help. right now i am treating it as a string which is indexing > complete HTML

Re: html parsing with dataimport handler from sql column

2013-02-20 Thread Rohan Thakur
hi yes I dont have html as documents I have data saved in sql data base in HTML format and I want to index it on solr but not as complete string that is with tags but just want to index the actual text in it...that is strip off the tags. regards Rohan On Wed, Feb 20, 2013 at 6:40 PM, Gora Mohant

Nagios

2013-02-20 Thread Arkadi Colson
Anyone running Nagios monitoring without JMX on Solr 4.0 or 4.1? Thx! -- Best regards Arkadi Colson

Re: Threads running while querrying

2013-02-20 Thread Manuel Le Normand
Yes, i made a one threaded script which sends a querry by a post request to the shard's url, gets back the response and posts the next querry. How can it matter? Manuel On Wednesday, February 20, 2013, Erick Erickson wrote: > Silly question perhaps, but are you feeding queries at Solr with a sin

Re: Solr 4 Spatial: NoClassDefFoundError: com/vividsolutions/jts/geom/Geometry

2013-02-20 Thread Dotan Cohen
Note that the issue is present in Solr 4.1 as well. I did find this post, which is not very encouraging: http://grokbase.com/t/lucene/solr-user/128sz03jdk/recursiveprefixtreestrategy-class-not-found Might the name of the class be simply a typo that is easily rectified? How might one go about chec

Re: SOLR4 SAN vs Local Disk?

2013-02-20 Thread Shawn Heisey
On 2/20/2013 1:12 AM, Toke Eskildsen wrote: On Tue, 2013-02-19 at 18:39 +0100, chamara wrote: Hi Thanks Shawn for the Input, Yes i am using SolrCloud to replicate the index to another server running with the same spec with 32cores and 72GB RAM on each machine. I have to test the performance of R

Re: Threads running while querrying

2013-02-20 Thread Erick Erickson
Well, it matters because your single-threaded client is firing one request, waiting for the response, then firing another. There's no opportunity for Solr to use more than one thread for queries if there's only a single thread on a single client ever making requests Or I misunderstand what you

Re: array of array in document response structure

2013-02-20 Thread Erick Erickson
None of your additions came through my e-mail client, can you put them somewhere (maybe paste bin?) and provide a link? Best Erick On Wed, Feb 20, 2013 at 6:20 AM, Danny wrote: > Hello, > > I'm wondering if it is possible to have arrays of arrays in the response > structure. > > For example, I

Re: array of array in document response structure

2013-02-20 Thread Danny
Ok :) I'm wondering if it is possible to have arrays of arrays in the response structure. For example, I can use DataImportHandler to index this XML file : http://pastebin.com/j2s3iVmk into this structure : http://pastebin.com/MJ7YYh0L But as you can see, in this index I lost the links between

Re: [solr cloud 4.1] Issue with order in a batch of commands

2013-02-20 Thread Mark Miller
It's because of how we currently handle batched requests - we buffer a different number of deletes thqn we do adds and flush them separately - mainly because the size of each is likely to be so different, at one point we would buffer a lot more deletes. So currently, you want to break these up

Re: Dynamic field searching with edismax

2013-02-20 Thread adityab
thanks Erick, looks like i need to generate some part of the query in Application layer to handle this. -- View this message in context: http://lucene.472066.n3.nabble.com/Dynamic-field-searching-with-edismax-tp4041461p4041551.html Sent from the Solr - User mailing list archive at Nabble.com.

Parallel indexing and swapping

2013-02-20 Thread Shankar Sundararaju
Hi All, I am using Solr 4.1. I have a Solr cluster of 3 leaders and 3 replicas hosting collection1 consisting of thousands of documents currently serving the search requests. I would like re-index all the documents in another collection, say collection2 in this same solr cluster and swap it with

Re: array of array in document response structure

2013-02-20 Thread Erick Erickson
This isn't possible that I know of, and I'm skeptical that it would be embraced. The problem here is that this would break the current format pretty severely I think, making back-compat a pain. And then there would be the arrays-of-arrays-of-arrays, and supporting N-dimensional arrays seems like a

Re: edismax and fields with spaces - SOLR 3.6

2013-02-20 Thread no spam
Thanks, it's what I have suggested. The vendor we use has a layer on top of SOLR and that makes it non-trivial to change field names. On Tue, Feb 19, 2013 at 6:53 PM, Otis Gospodnetic < otis.gospodne...@gmail.com> wrote: > Hi, > > I think you'll save yourself a ot of trouble if you just rename y

Re: Japanese mm parameter in Solr3.6.2 generated lots of results with big performance hit

2013-02-20 Thread kirpakaro
Hi Jack, The requestHandler in solrconfig.xml is defined as, thanks for your response AND schema.xml: Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/Japanese-mm-parameter-in-Solr3-6-2-generated-lots-of-results-with-big-performance-hit-tp40412

Re: array of array in document response structure

2013-02-20 Thread Michael Della Bitta
If you don't need to query based on correlations between associated values, you can store parallel arrays in separate fields in Solr and get them back in insertion order. So like this: https://gist.github.com/mdellabitta/4996336 Michael Della Bitta ---

Re: Storing all attributes in the document so that I can avoid a distributed cache?

2013-02-20 Thread Steve Rowe
See also Adrien Grand's blog post on this feature (he implemented it): Steve On Feb 20, 2013, at 7:22 AM, Erick Erickson wrote: > bq: Does the new compressed stored field format in Solr 4.1 do anything to > redu

Re: [solr cloud 4.1] Issue with order in a batch of commands

2013-02-20 Thread Vinay Pothnis
Thanks for the reply. In my case, the order is definitely critical. It would be great if this can be fixed. And yes, even SolrJ deals with deletes first and then the add/updates. And that was the reason why I switched from SolrJ to plain http. There is a ticket with SolrJ as well https://issues.ap

Re: array of array in document response structure

2013-02-20 Thread Danny
Erick : you're right, allowing nested arrays would be like opening Pandora's Box :) Michaël : having parallel arrays and losing their relations is what I want to avoid, actually :) I guess I'll have to find another way. Thanks, Danny. -- View this message in context: http://lucene.472066.n3

Re: Threads running while querrying

2013-02-20 Thread Walter Underwood
In production, you should have requests arriving at Solr simultaneously. Those simultaneous requests will be processed in parallel. For each query, there are many ways to improve response time. It depends on the query and the schema. What query response time are you seeing? wunder On Feb 20,

Running SOLR over SSL

2013-02-20 Thread Daire Mac Mathúna
Hi, I'm running SOLR 3.6.1, using SOLRJ 3.6.1 in my application to run queries against it. Two queries: 1. I presume there is no problem running SOLR over SSL / HTTPS and communicating with it using SOLRJ? 2. We are looking at implementing SOLR Replication using one master node and one slave nod

RE: Edismax odd results

2013-02-20 Thread David Quarterman
Hi Erick, I understand the wildcard issue - that was more desperation on our part than logic! TermsComponent showed 222 197 so the term is in the index. Using the explainOther, I can see that the relevance of documents with 'engineer boots' in the name is low compared to the

If we Open Source our platform, would it be interesting to you?

2013-02-20 Thread Marcelo Elias Del Valle
Hello All, I’m sending this email because I think it may be interesting for Solr users, as this project have a strong usage of Solr platform. We are strongly considering opening the source of our DMP (Data Management Platform), if it proves to be technically interesting to other developers / comp

Re: How to use shardId

2013-02-20 Thread Gary Yngve
the param in solr.xml should be shard, not shardId. i tripped over this too. -g On Mon, Jan 14, 2013 at 7:01 AM, starbuck wrote: > Hi all, > > I am trying to realize a solr cloud cluster with 2 collections and 4 shards > each with 2 replicates hosted by 4 solr instances. If shardNum parm is s

RE: If we Open Source our platform, would it be interesting to you?

2013-02-20 Thread Dyer, James
I only looked at your link super fast, but this seems like a very viable alternative to Solr's DIH. DIH does the job fairly well but we've struggled to have developers who are willing to maintain it. The problem, I think, is that DIH appeals to non-programmers who want to index their data with

Re: If we Open Source our platform, would it be interesting to you?

2013-02-20 Thread Alexandre Rafalovitch
How does this compare to Pentaho Kettle? http://kettle.pentaho.com/ I am not terribly familiar with this space, so - if the question is stupid, feel free to address it appropriately. :-) Regards, Alex. Personal blog: http://blog.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexand

Re: If we Open Source our platform, would it be interesting to you?

2013-02-20 Thread Marcelo Elias Del Valle
Hello James, First of all, thanks for your feedback. I will try to clarify some questions bellow. 2013/2/20 Dyer, James > I only looked at your link super fast, but this seems like a very viable > alternative to Solr's DIH. DIH does the job fairly well but we've > struggled to have develo

Re: Running SOLR over SSL

2013-02-20 Thread Michael Della Bitta
I don't think SSL is a problem, but you're going to have to take care to make sure your SSL certs are either signed by a cert trusted by the JVM, or imported into the JVM's keystore. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New

Re: array of array in document response structure

2013-02-20 Thread Michael Della Bitta
Danny, You don't lose their relations with parallel arrays, it's just that you can't *query* those relations. So for example, if you query: one:1 AND two:2, you'll get documents where both those values are set, but not ones where both of those values are *paired*. Michael Della Bitta ---

Re: array of array in document response structure

2013-02-20 Thread Michael Della Bitta
So then you'd be fine. You get the document back and iterate over both multivalued fields in parallel. Insertion order is the same as retrieval order. Unless you forget to stick an empty value in for a null or something like that, you're golden. Michael Della Bitta ---

Solr UIMA

2013-02-20 Thread jazz
Hi, I managed to get Solr and UIMA work together. When I send a document to Solr it annotates the field "contents" and adds the result of the UIMA annotations to e.g. a field "location". My question is: how do I annotate the contents of an already existing solr database without triggering an /u

Re: Solr UIMA

2013-02-20 Thread Alexandre Rafalovitch
Why not just reindex? I could think of a couple of ugly hacks, but reindex is better first option. Regards, Alex On 20 Feb 2013 15:43, "jazz" wrote: > Hi, > > I managed to get Solr and UIMA work together. When I send a document to > Solr it annotates the field "contents" and adds the result

Re: If we Open Source our platform, would it be interesting to you?

2013-02-20 Thread Marcelo Elias Del Valle
Hello Alexandre, 2013/2/20 Alexandre Rafalovitch > How does this compare to Pentaho Kettle? http://kettle.pentaho.com/ > > I am not terribly familiar with this space, so - if the question is stupid, > feel free to address it appropriately. :-) Don't worry, it's not stupid. I will make my best

Re: Timestamp field is changed on update

2013-02-20 Thread Isaac Hebsh
Nobody responded my JIRA issue :( Should I commit this patch into SVN's trunk, and set the issue as Resolved? On Sun, Feb 17, 2013 at 9:26 PM, Isaac Hebsh wrote: > Thank you Alex. > Atomic Update allows you to "add" new values into multivalued field, for > example... It means that the original

Re: RequestHandler init failure

2013-02-20 Thread Mingfeng Yang
Chris, My config file did include the section of loading related plugin. Ming On Tue, Feb 19, 2013 at 10:42 AM, Chris Hostetter wrote: > > : Found it by myself. It's here > : > http://mirrors.ibiblio.org/maven2/org/apache/solr/solr-dataimporthandler/4.1.0/ > : > : Download and move the jar fi

Re: Multivalued non-indexed field?

2013-02-20 Thread Steve Rowe
Hi Isaac, Can you describe the problem? E.g., what do you mean by "ignores"? Usually sample inputs and expected outputs help. Steve On Feb 20, 2013, at 4:25 PM, Isaac Hebsh wrote: > Hi. > > I'm trying to configure a stored-, non-indexed field, to be multiValued. > It seems that Solr ignores

(info)solrCloud swap core query lost data and cannot insert data

2013-02-20 Thread Rollin.R.Ma (lab.sh04.Newegg) 41099
Hi I am a newer to solrCloud, I use solr4.1 solrCloud and swap core to change index. For example (1) Collection1 has 2 shards and collection2 same to collection1. (2) Add content to collection2 (3) Swap collection1 & collection2 (4) Query collection1 (below ), sometimes

To get Term Offsets of a term per document

2013-02-20 Thread vempap
Hello, Is there a way to get Term Offsets of a given term per document without enabling the termVectors ? Is it that Lucene index stores the positions but not the offsets by default - is it correct ? Thanks, Phani. -- View this message in context: http://lucene.472066.n3.nabble.com/To-get-

Re: (info)solrCloud swap core query lost data and cannot insert data

2013-02-20 Thread Mark Miller
Swap is unsupported - really it should throw an exception right now. There is a JIRA issue to add support for swap in SolrCloud mode of some kind. - Mark On Feb 20, 2013, at 7:59 PM, Rollin.R.Ma (lab.sh04.Newegg) 41099 wrote: > > Hi > > I am a newer to solrCloud, I use solr4.1 solrClou

Re: Custom shard key, shard partitioning

2013-02-20 Thread Mark Miller
On Feb 19, 2013, at 9:16 AM, Markus Jelsma wrote: > Ah, thanks. Got a Jira? I don't think i'm watching that one right now. https://issues.apache.org/jira/browse/SOLR-3154 - Mark

Re: Parallel indexing and swapping

2013-02-20 Thread Mark Miller
There is an open issue somewhere for this type of support. We don't have a simple way to do it currently. We also will be looking at adding index alias', which is probably another feature you could use to solve this. Currently, you would need some kind of load balancer to achieve this nicely I

solr shards

2013-02-20 Thread rulinma
Hi Use SolrCloud4.1 with one collection and 2 shards. I add 4 docs with id(1,2,3,4) set. I found that they appeard in shard1 or shard2 together, not (1,3) in shard1 and (2,4) in shard2, I want to know why? who can give me some advice. Thanks! -- View this message in context: http://lu

Re: solr shards

2013-02-20 Thread Mark Miller
Can you give some more details? When you look at the cloud tab of the admin UI, does the cluster visualization look right? Are all the nodes green? Perhaps the shard is a leader and a replica single shrad and you just think it's 2 shards? - Mark On Feb 20, 2013, at 8:26 PM, rulinma wrote: > H

答复: solr shards

2013-02-20 Thread Rollin.R.Ma (lab.sh04.Newegg) 41099
Hi I user below code is right: SolrInputDocument doc1 = new SolrInputDocument(); doc1.addField("id", "1"); doc1.addField("name", "11s"); SolrInputDocument doc2 = new SolrInputDocument();

Re: solr shards

2013-02-20 Thread Mark Miller
On Feb 20, 2013, at 9:47 PM, Rollin.R.Ma (lab.sh04.Newegg) 41099 wrote: > I also see " Concerning CloudSolrServer, there is a JIRA to make it hash and > send updates to the "right" leader, but currently it still doesn't - it just > favors leaders in general over non leaders currently. " > >

Document update question

2013-02-20 Thread Jack Park
>From what I can read about partial updates, it will only work for singleton fields where you can set them to something else, or multi-valued fields where you can add something. I am testing on 4.1 I ran some tests to prove to me that you cannot do anything else to a multi-valued field, like remov

SolrCloud as my primary data store

2013-02-20 Thread jimtronic
Now that I've been running Solr Cloud for a couple months and gotten comfortable with it, I think it's time to revisit this subject. When I search for the topic of using Solr as a primary db online, I get lots of discussions from 2-3 years ago and usually they point out a lot of hurdles that have

答复: solr shards

2013-02-20 Thread Rollin.R.Ma (lab.sh04.Newegg) 41099
Now it is ok truly, but I met the trouble yesterday, I guess the environment problem. Anyone can give me some advice about the algorithm of the hash router. I test id(1,4) go to shard1 (2,3) go to shard2. What is the algorithm ? Thanks a lot. -邮件原件- 发件人: Mark Miller [mailto:markrmil...@

Solr UIMA

2013-02-20 Thread jazzsalsa
Reposted because I did not arrive at the list (I didn't see it)On Feb 20, 2013, at 12:42 PM, jazz wrote:Hi, I managed to get Solr and UIMA work together. When I send a document to Solr it annotates the field "contents" and adds the result of the UIMA annotations to e.g. a field "location". My que

Re: Nagios

2013-02-20 Thread Otis Gospodnetic
Hi, I'm not, but Googling shows a few links that look relevant. You may also want to check out http://sematext.com/spm/solr-performance-monitoring/index.html which has both monitoring and alerting it in and will soon be getting an API for Nagios and Sensu scripts to hit. Otis -- Solr & ElasticSe