Re: Delete in Solr based on foreign key (like SQL delete from … where id in (select id from…)

2014-10-09 Thread Matthew Nigl
I was going to say that the below should do what you are asking: {!join from=docid_s to=foreign_key_docid_s}(message_state_ts:[* TO 2014-10-05T00:00:00Z} AND message_state_ts:{2014-10-01T00:00:00Z TO *]) But I get the same response as in https://issues.apache.org/jira/browse/SOLR-6357 I can't th

Re: Is it possible to replicate just the solrconfig.xml file

2014-10-09 Thread Erick Erickson
You can set up a config files section of in solrconfig on the master, something like: schema.xml,stopwords.txt I'm not totally sure whether this only replicates the files if they've changed, but even if not it's not that much network traffic that I'd worry about. bq: ... schema.xml, solrconfig.xm

Is it possible to replicate just the solrconfig.xml file

2014-10-09 Thread Tang, Rebecca
I have a master-slave set up. Most of the times when I replicate, I want to replicate the index as well as some of the config files like schema.xml, solrconfig.xml, etc. I have this set up and it works well. But sometimes, I make a small tweak to solrconfig.xml and deploy it to the master. Af

Re: Data Import Handler for CSV file

2014-10-09 Thread Ahmet Arslan
Hi, I think you can define field names in the first line of csv. Why don't you use curl to index csv? I don't have full working example with DIH but I have following example that indexed every line as a separate solr scoument. You need to add a transformer that splits each line according to co

Re: Data Import Handler for CSV file

2014-10-09 Thread Alexandre Rafalovitch
You could always define the parameters in the solrconfig.XML on a custom handler. Don't have to pass the same values over and over again. Regards, Alex On 09/10/2014 5:26 pm, "nabil Kouici" wrote: > Hi Ahmet, > > Thank you for this replay. Agree with you that csv update handler is fast > bu

Re: Stripping html from text before indexing to solr

2014-10-09 Thread Ahmet Arslan
Yes, your plain string queries will automatically match in index. This is always true. If you don't strip html, html tags are considered part of the document and would cause false matches. For example q=bold,code,class, etc. On Friday, October 10, 2014 12:35 AM, Vishal Sharma wrote: I think

Re: Stripping html from text before indexing to solr

2014-10-09 Thread Vishal Sharma
I think I dint get you completely. I am really sorry for asking this again. New to solr world :) Are you saying if I don't strip html my plain string queries will automatically match in index? *Vishal Sharma**TL, Grazitti Interactive*T: +1 650­ 641 1754 E: vish...@grazitti.com www.grazitti.com [i

Re: Data Import Handler for CSV file

2014-10-09 Thread nabil Kouici
Hi Ahmet, Thank you for this replay. Agree with you that csv update handler is fast but we need always to specify columns in the http request. In addition, I don't find documentation how to use csv update from solrj. Could you please send me an example of DIH to load CSV file? Regards, Nabil.

Re: Stripping html from text before indexing to solr

2014-10-09 Thread Ahmet Arslan
It depends on you, if you strip html using a char filter, it won't match htm tags. But the original document, when requested using fl= parameter, will be html. If you do not strip html at all, q=html will return all documents. Ahmet On Friday, October 10, 2014 12:01 AM, Vishal Sharma wrote:

Re: Stripping html from text before indexing to solr

2014-10-09 Thread Vishal Sharma
Ahmet, So if its not necessary to strip html. Are you saying that plain text query strings will automatically match the html content indexed to solr? *Vishal Sharma**TL, Grazitti Interactive*T: +1 650­ 641 1754 E: vish...@grazitti.com www.grazitti.com [image: Description: LinkedIn]

Re: Stripping html from text before indexing to solr

2014-10-09 Thread Ahmet Arslan
Hi Vishal, Stripping html is not mandatory. Solr indexes it just like other text. By the way, there are to places where you can strip html. i) at analysis : char filter ii) before analysis : Update processor, html strip transformer Ahmet On Thursday, October 9, 2014 11:50 PM, Vishal Sharma

Stripping html from text before indexing to solr

2014-10-09 Thread Vishal Sharma
Is stripping html is always required before sending content to Solr or it accepts html based data also? If yes, in that scenario how does the match happen? Looking for some best foolproof way of indexing html data to solr fields where it would always be ready for match with query string *Vis

Delete in Solr based on foreign key (like SQL delete from … where id in (select id from…)

2014-10-09 Thread Luis Festas Matos
Given the following Solr data: 1008rs1cz0icl2pk 2014-10-07T14:18:29.784Z h60fmtybz0i7sx87 1481314421768716288 u42xyz1cz0i7sx87 h60fmtybz0i7sx87 1481314421768716288 u42xyz1cz0i7sx87 h60fmtybz0i7sx87 1481314421448900608 I would like to know how to *DELETE docum

SuggestComponent in distributed (SolrCloud) environment

2014-10-09 Thread Frank Wesemann
Hi, I'm about to integrate the SuggestCompont in our application and noticed some behavior I didn't expect. My Solr version Solr 4.9. 1. The component returns common terms shards-n times. 2. Due to how the suggestions from each shard are collected, the "exactMatchFirst" Parameter on the LookupImpl

Re: Data Import Handler for CSV file

2014-10-09 Thread Ahmet Arslan
Hi Nabil, whats wrong with csv update handler? It is quite fast. By the way DIH has line entity processor, yes it is doable with existing DIH components. Ahmet On Thursday, October 9, 2014 9:58 PM, nabil Kouici wrote: Hi All, Is it possible to have in solr a DIH to load from CSV file.

Data Import Handler for CSV file

2014-10-09 Thread nabil Kouici
Hi All, Is it possible to have in solr a DIH to load from CSV file. Actually I'm using update/csv handler but not responding to my need. Regards, NKI.

Facets for Child Documents?

2014-10-09 Thread Edwards, Joshua
Is it possible to use a facet to filter parent documents based on a child field? For example, if I have Authors as my main record, and Books as the child record, would it be possible to have a facet that filtered Authors by Book publication date (with publication date existing on the Book docum

Re: does one need to reindex when changing similarity class

2014-10-09 Thread Ahmet Arslan
How about SweetSpotSimilarity? Length norm is saved at index time? On Thursday, October 9, 2014 5:44 PM, Jack Krupansky wrote: The similarity class is only invoked at query time, so it doesn't participate in indexing. -- Jack Krupansky -Original Message- From: Markus Jelsma Sent

Re: SolrCloud - Cloud tab on admin dashboard not loading

2014-10-09 Thread Shawn Heisey
On 10/9/2014 9:35 AM, Erick Erickson wrote: > Hmmm, works fine for me. But I'm a little puzzled where the /zookeeper > is coming from in your URL, that isn't the URL sent by the admin API > that I know of. > > Bottom line: It Works On My Machine. > > given that you do hvae 8080 in your URL I'm gues

Re: [ANNOUNCE] Luke 4.10.1 released

2014-10-09 Thread Dmitry Kan
Hi Bernd, Thanks for checking out these warnings. Would you like to create a pull request on github? Or alternatively, create an issue there and describe what you did to fix this. Thanks, Dmitry On Thu, Oct 9, 2014 at 12:00 PM, Bernd Fehling < bernd.fehl...@uni-bielefeld.de> wrote: > Thanks fo

Re: Solr Cloud has lower performance with more servers

2014-10-09 Thread Erick Erickson
Just to check: your index is NOT sharded, correct? Assuming not sharded, is it SolrCloud? If not SolrCloud, how are the indexes kept in synch? Master/slave? Manual copy? But for an unchanging index, this is definitely odd. Best, Erick On Thu, Oct 9, 2014 at 7:40 AM, Walter Underwood wrote: > I

Re: SolrCloud - Cloud tab on admin dashboard not loading

2014-10-09 Thread Erick Erickson
Hmmm, works fine for me. But I'm a little puzzled where the /zookeeper is coming from in your URL, that isn't the URL sent by the admin API that I know of. Bottom line: It Works On My Machine. given that you do hvae 8080 in your URL I'm guessing you're on Tomcat or some such? Maybe there's some p

Re: Solr Index to Helio Search

2014-10-09 Thread Yonik Seeley
Hmmm, I imagine this is due to the lucene back compat bugs that were in 4.10, and the fact that the last release of heliosearch was branched off of the 4x branch. I just tried moving an index back and forth between my local heliosearch copy and solr 4.10.1 and things worked fine. Here's the snaps

Re: does one need to reindex when changing similarity class

2014-10-09 Thread Jack Krupansky
The similarity class is only invoked at query time, so it doesn't participate in indexing. -- Jack Krupansky -Original Message- From: Markus Jelsma Sent: Thursday, October 9, 2014 6:59 AM To: solr-user@lucene.apache.org Subject: RE: does one need to reindex when changing similarity cl

Re: Solr Cloud has lower performance with more servers

2014-10-09 Thread Walter Underwood
Is this a production log of queries, with lots of repeats? If so, you may be seeing the normal effect of lower cache hit rates. Check the hit rate for the query result cache in the two setups. With a single machine, the second occurrence of a query will be a cache hit. With two machines, it wil

Re: Solr Cloud has lower performance with more servers

2014-10-09 Thread Charlie Hull
On 09/10/2014 14:06, Yannick wrote: Hello good Solr people, I have the following surprising situation. I created a group of 2 Solr servers with a load-balancer in front (Haproxy). I have a batch client that sends requests (read-only) continuously to the load-balancer. The problem is: the perfor

SolrCloud - Cloud tab on admin dashboard not loading

2014-10-09 Thread arild.nils...@gmail.com
I'm trying to set up SolrCloud with embedded Zookeeper for Solr 4.10.1. The logs seems fine when starting up with a single Solr instance creating and using a embedded Zookeeper instance. I'm also able to create collections and query collections via curl. However, there is a HTTP 404 not found when

Re: Solr Cloud has lower performance with more servers

2014-10-09 Thread Yannick
Hi Toke, thanks for your suggestion - definitely an interesting idea. But unfortunately no, no indexing job is running; those are static indexes being queried. The execution time is also very consistent in each condition, I did quite a few tests. Yann On Thursday, October 9, 2014 3:56 PM, T

Re: Solr Cloud has lower performance with more servers

2014-10-09 Thread Toke Eskildsen
On Thu, 2014-10-09 at 15:06 +0200, Yannick wrote: > I created a group of 2 Solr servers with a load-balancer in front > (Haproxy). I have a batch client that sends requests (read-only) > continuously to the load-balancer. The problem is: the performance is > slower with 2 servers than it is with a

RE: Using Velocity with Child Documents?

2014-10-09 Thread Edwards, Joshua
I just realized that Solr supports returning child records with the parent starting in version 4.9. I was on 4.8, so I will be upgrading to latest before continuing on this. I think it will then make it easier to show the results in Velocity (in case anyone else needs to do this). Thanks, Jos

Re: eDisMax parser and special characters

2014-10-09 Thread Lanke,Aniruddha
Is there a way to override this default behavior? — Lanke On Oct 8, 2014, at 4:55 PM, Jack Krupansky wrote: > Hyphen is a "prefix operator" and is normally followed by a term to indicate > that the term "must not" be present. So, your query has a syntax error. The > two query parsers differ i

Re: per field similarity not working with solr 4.2.1

2014-10-09 Thread elisabeth benoit
ok thanks. I think something is not working here (I'm quite sure my similarity class is not beeing used because when I use SchemaSimilarityFactory and a custom fieldtype similarity definition with NoTFSimilarity, I don't get the same scoring as when I use NoTFSimilarity as global similarity; but

RE: Using Velocity with Child Documents?

2014-10-09 Thread Edwards, Joshua
Hey, Erick - Thanks for the response. Yes, I've played around with Velocity before, and I've been able to get some good results. However, with how Solr stores (and returns) child documents, I don't know of a way to get a response that is similar to the initial Json going in - with each parent

Solr Cloud has lower performance with more servers

2014-10-09 Thread Yannick
Hello good Solr people, I have the following surprising situation. I created a group of 2 Solr servers with a load-balancer in front (Haproxy). I have a batch client that sends requests (read-only) continuously to the load-balancer. The problem is: the performance is slower with 2 servers than

RE: per field similarity not working with solr 4.2.1

2014-10-09 Thread Markus Jelsma
Well, it is either the output of your calculation or writing something to System.out Markus -Original message- > From:elisabeth benoit > Sent: Thursday 9th October 2014 13:31 > To: solr-user@lucene.apache.org > Subject: Re: per field similarity not working with solr 4.2.1 > > Thanks

Re: SolrCloud with client ssl

2014-10-09 Thread Sindre Fiskaa
This is output from overseer with severity to INFO 942420 [http-nio-443-exec-7] INFO org.apache.solr.handler.admin.CollectionsHandler ? Creating Collection : numShards=3&createNodeSet=vt-searchln03:443_solr,vt-searchln04:443_solr,vt- searchln01:443_solr,vt-searchln02:443_solr,vt-searchln05:443_s

Re: Edismax parser and boosts

2014-10-09 Thread Pawel Rog
Hi, Thank you for your response. I checked it in Solr 4.8 but I think this works as I described from very long time. I'm not 100% sure if it is really bug or not. When I run phrase query like "foo^1.0 bar" this works very similarto what happens in edismax with set *pf* parameter (boost part is not

Re: per field similarity not working with solr 4.2.1

2014-10-09 Thread elisabeth benoit
Thanks for the information! I've been struggling with that debug output. Any other way to know for sure my similarity class is being used? Thanks again, Elisabeth 2014-10-09 13:03 GMT+02:00 Markus Jelsma : > Hi - it should work, not seeing your implemenation in the debug output is > a known iss

RE: per field similarity not working with solr 4.2.1

2014-10-09 Thread Markus Jelsma
Hi - it should work, not seeing your implemenation in the debug output is a known issue. -Original message- > From:elisabeth benoit > Sent: Thursday 9th October 2014 12:22 > To: solr-user@lucene.apache.org > Subject: per field similarity not working with solr 4.2.1 > > Hello, > > I

RE: does one need to reindex when changing similarity class

2014-10-09 Thread Markus Jelsma
Hi - no you don't have to, although maybe if you changed on how norms are encoded. Markus -Original message- > From:elisabeth benoit > Sent: Thursday 9th October 2014 12:26 > To: solr-user@lucene.apache.org > Subject: does one need to reindex when changing similarity class > > I've

does one need to reindex when changing similarity class

2014-10-09 Thread elisabeth benoit
I've read somewhere that we do have to reindex when changing similarity class. Is that right? Thanks again, Elisabeth

per field similarity not working with solr 4.2.1

2014-10-09 Thread elisabeth benoit
Hello, I am using Solr 4..2.1 and I've tried to use a per field similarity, as described in https://apache.googlesource.com/lucene-solr/+/c5bb5cd921e1ce65e18eceb55e738f40591214f0/solr/core/src/test-files/solr/collection1/conf/schema-sim.xml so in my schema I have and a custom similarity in f

Re: SolrCloud with client ssl

2014-10-09 Thread Jan Høydahl
We also have another bug here, that the request responds with status=0, which means success, when only parts of the distributed request succeeded, but not all. That probably warrants its own JIRA issue. The logs you printed are from the client. Can you also dig up the corresponding logs from th

Re: [ANNOUNCE] Luke 4.10.1 released

2014-10-09 Thread Bernd Fehling
Thanks for keeping this up to date. When starting luke-4.10.1.jar I get: SLF4J: Failed to load class “org.slf4j.impl.StaticLoggerBinder” May I suggest to add that class also directly to luke? And another one, I get a warning that log4j.properties should not use "org.apache.hadoop.metrics.jvm.Ev

Re: Advise on an architecture with lot of cores

2014-10-09 Thread Aditya
Hi Manoj There are advantages in both the approach. I recently read an article, http://lucidworks.com/blog/podcast-solr-at-scale-at-aol/ . AOL uses Solr and it uses one core per user. Having one core per customer helps you 1. Easily migrate / backup the index 2. Load the core as and when require