Re: Running Solr 6.3+ on Tomcat : Help Please

2017-02-22 Thread Prashant Saraswat
Hi Shawn, The problem was described in my original post: On 6.2.0: This works: http://localhost:8080/solr/index.html On 6.3+ http://localhost:8080/solr/index.html returns an HTTP error code 403 The tomcat versions are the same so I don't think its a tomcat problem. Although there are no errors i

Re: requiring both

2017-02-22 Thread Rick Leir
Google handles this query (apparently) by having '&' as a synonym for AND, then finding the 3 word phrase "johnson & johnson". On 2017-02-22 10:35 AM, John Blythe wrote: hi all. how would you handle a query like "johnson AND johnson"? i don't want something that has "author: linden b. johnson

Re: Division in JSON Facet

2017-02-22 Thread Zheng Lin Edwin Yeo
No worries. Thanks for creating the JIRA too. Regards, Edwin On 18 February 2017 at 06:24, Chris Hostetter wrote: > > I'm sorry I didn't see your original question last month, but as you've > since realized you can't use functions like "div" in this way. > > The syntax for getting "stats" in t

Re: Select TOP 10 items from Solr Query

2017-02-22 Thread Zheng Lin Edwin Yeo
Hi Emir, Thanks for your suggestion. This should meet my current needs. Regards, Edwin On 20 February 2017 at 18:22, Emir Arnautovic wrote: > Json facets also have subfacets: http://yonik.com/solr-subfacets/ (check > if something changed in version that you use) > > Emir > > > > On 20.02.2017

Re: CloudSolrClient stuck in a loop with a recurring exception

2017-02-22 Thread Chetas Joshi
Yes, it is scala. And yes, I just wanted to confirm that I had to add exception handling and break out of the loop. Chetas. On Wed, Feb 22, 2017 at 4:25 PM, Shawn Heisey wrote: > On 2/22/2017 4:59 PM, Chetas Joshi wrote: > > 2017-02-22 15:27:06,994 ERROR o.a.s.c.solrj.impl.CloudSolrClient ~ > R

Re: CloudSolrClient stuck in a loop with a recurring exception

2017-02-22 Thread Shawn Heisey
On 2/22/2017 4:59 PM, Chetas Joshi wrote: > 2017-02-22 15:27:06,994 ERROR o.a.s.c.solrj.impl.CloudSolrClient ~ Request > to collection xx failed due to (510) org.apache.solr.common. > SolrException: Could not find a healthy node to handle the request., retry? > > Here is my code snippet. I go t

CloudSolrClient stuck in a loop with a recurring exception

2017-02-22 Thread Chetas Joshi
Hello, I am using Solr 5.5.1. Solr Cloud of 80 nodes deployed on HDFS. To get back results from Solr, I use the cursor approach and the cloudSolrClient object. While a query was running, I took the solr Cloud down. The client got stuck in a loop with the following exception: 2017-02-22 15:27:06,

Get docs with same value in one other field ?

2017-02-22 Thread Bruno Mannina
Hello all, I’m facing a problem that I would like to know if it’s possible to do it with one request in SOLR. I have SOLR 5. I have docs with several fields but here two are useful for us. Field 1 : id (unique key) Field 2 : fid (family Id) i.e: id:XXX fid: 1254 id: YYY fid: 125

RE: Get docs with same value in one other field ?

2017-02-22 Thread Bruno Mannina
Ok Alex, I will looking for a best solution. I'm afraid to have a OOM with a huge number of ids. And yes I already use a POST query, it was just to show my problem. Anyway thanks to indicate me this information also. -Message d'origine- De : Alexandre Rafalovitch [mailto:arafa...@gmail.

Re: Get docs with same value in one other field ?

2017-02-22 Thread Alexandre Rafalovitch
A thousand of IDs could be painful to send and perhaps to run against. At minimum, look into splitting your query into multiple variables (so you could reuse the list in both direct and join query). Look also at using terms query processor that specializes in the list of IDs. You may also need to

RE: Get docs with same value in one other field ?

2017-02-22 Thread Bruno Mannina
Just a little more thing, I need to request up to 1000 id's Actually I test with 2 or 3 and it takes times (my db is around 100 000 000 docs, 128Go RAM). Do you think, it could be OOM error ? if I test with up to 1000 id ? -Message d'origine- De : Bruno Mannina [mailto:bmann...@free.fr]

RE: Get docs with same value in one other field ?

2017-02-22 Thread Bruno Mannina
Ye it's perfect !!! it works. Thanks David & Alexandre ! -Message d'origine- De : David Hastings [mailto:hastings.recurs...@gmail.com] Envoyé : mercredi 22 février 2017 23:00 À : solr-user@lucene.apache.org Objet : Re: Get docs with same value in one other field ? sorry embedded link

Re: Get docs with same value in one other field ?

2017-02-22 Thread David Hastings
sorry embedded link: q={!join+from=fid=fid}id:ZZZ On Wed, Feb 22, 2017 at 4:58 PM, David Hastings < hastings.recurs...@gmail.com> wrote: > for a reference to some examples: > > https://wiki.apache.org/solr/Join > > sor youd want something like: > > q={!join+from=fid=fid}i >

Re: Get docs with same value in one other field ?

2017-02-22 Thread David Hastings
for a reference to some examples: https://wiki.apache.org/solr/Join sor youd want something like: q={!join+from=fid=fid}i d:ZZZ i dont have much experience with this function however On Wed, Feb 22, 2017 at 4:40 PM, Alexan

Re: Get docs with same value in one other field ?

2017-02-22 Thread Alexandre Rafalovitch
Sounds like two clauses with the second clause being a JOINT search where you match by ID and then join on FID. Would that work? Regards, Alex. http://www.solr-start.com/ - Resources for Solr users, new and experienced On 22 February 2017 at 16:27, Bruno Mannina wrote: > > > Hello all,

Get docs with same value in one other field ?

2017-02-22 Thread Bruno Mannina
Hello all, I'm facing a problem that I would like to know if it's possible to do it with one request in SOLR. I have SOLR 5. I have docs with several fields but here two are useful for us. Field 1 : id (unique key) Field 2 : fid (family Id) i.e: id:XXX fid: 1254 id: YYY fid: 1

Re: Solr on HDFS: AutoAddReplica does not add a replica

2017-02-22 Thread Hendrik Haddorp
I'm also not really an HDFS expert but I believe it is slightly different: The HDFS data is replicated, lets say 3 times, between the HDFS data nodes but for an HDFS client it looks like one directory and it is hidden that the data is replicated. Every client should see the same data. Just lik

Re: Question about best way to architect a Solr application with many data sources

2017-02-22 Thread Tim Casey
I would possibly extend this a bit futher. There is the source, then the 'normalized' version of the data, then the indexed version. Sometimes you realize you miss something in the normalized view and you have to go back to the actual source. This will be as likely as there are number of sources

Nested Document (aka Block Join) Query with Multiple Criteria

2017-02-22 Thread Chris Bell
I am trying to perform a Nested Document query containing multiple criteria, which can be treated as an AND in regards to the parent document. With the structure: ParentDocument: { id:1 ChildDocument: { id:1.1 field1:value1, field2:value2 }, ChildDocument:

Re: JapanesePartOfSpeechStopFilterFactory

2017-02-22 Thread Alexandre Rafalovitch
On 22 February 2017 at 12:27, Noriyuki TAKEI wrote: > field2:★ What does this (star) part of the query means to you in business terms? Are you doing a wildcard search (see previous email exchange I replied to)? Or is this a specific special character you are trying to search for? Regards, Ale

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Alexandre Rafalovitch
Actually just doing a debug-enabled query with real keywords would show you what happens, as it will list the analyzed keywords and against which fields they run. So, if your stopword is present in debug, it got through the chain. If it is not - it has not. But I am glad that at least your puzzle

JapanesePartOfSpeechStopFilterFactory

2017-02-22 Thread Noriyuki TAKEI
Hi,All I would like to execute query as below. field1:someword AND field2:★ But,It seems that the query as below is executed. field1:someword I guess that "solr.JapanesePartOfSpeechStopFilterFactory" Filter excepted the word ★ from indexing target and as a result the query "AND field:★" is exc

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Pratik Patel
That explains why I was getting back the results. Thanks! I was doing that query only to test whether stopwords are being indexed or not but apparently the query I had would not serve the purpose. I should rather have a document field with just the stop word and search against it without using wil

Re: Problem managing Solr configsets on Zookeeper

2017-02-22 Thread Chris Rogers
Thanks Erick, Shawn, that was exactly it! On 22/02/2017, 16:29, "Erick Erickson" wrote: This should do it: bin/solr zk upconfig -z 172.28.128.9:2181/solr -n tolkien -d /home/bodl-tei-svc/solr-6.4.0/server/solr/configsets/tolkien_config Or use the 'bin/solr cp -r...' c

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Alexandre Rafalovitch
StopFilterFactory (and WordDelimiterFilterFactory and maybe others) are NOT multiterm aware. Using wildcards triggers the edge-case third type of analyzer chain that is automatically constructed unless you specify it explicitly. You can see the full list of analyzers and whether they are multiter

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Pratik Patel
Asterisks were not for formatting, I was trying to use a wildcard operator. Here is another example query and "parsed_query toString" entry for that. Query : http://localhost:8081/solr/collection1/select?debugQuery=on&indent=on&q=Description_note:*their*&wt=json "parsedquery_toString":"Descriptio

Re: Problem managing Solr configsets on Zookeeper

2017-02-22 Thread Erick Erickson
This should do it: bin/solr zk upconfig -z 172.28.128.9:2181/solr -n tolkien -d /home/bodl-tei-svc/solr-6.4.0/server/solr/configsets/tolkien_config Or use the 'bin/solr cp -r...' command and specify the destination as zk:/solr/tolkien or something. upconfig/downconfig is just a form of cp design

Re: Problem managing Solr configsets on Zookeeper

2017-02-22 Thread Chris Rogers
Hi Shawn, Thanks. I am indeed using a chroot. In my solr.in.ih file I have the following: ZK_HOST="172.28.128.9/solr" I think I understand you’re saying that I need to specify this chroot dir in the upconfig command? Where should this be specified? Something like: bin/solr zk upconfig -z 172.

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Erick Erickson
That's not what I'm looking for. Way down near the end there should be an entry like "parsed_query toString" This line is pretty suspicious: 82, "params":{ "q":"Description_note:* and *" Are you really searching for asterisks (I'd originally interpreted that as bolding which sometimes happens). P

Re: Problem managing Solr configsets on Zookeeper

2017-02-22 Thread Erick Erickson
Shawn's nailed it. I wanted to add that in 6.4 there are additional commands in bin/solr to explore ZK, e.g. "ls" which also has a recurse option. Erick On Wed, Feb 22, 2017 at 8:15 AM, Shawn Heisey wrote: > On 2/22/2017 8:25 AM, Chris Rogers wrote: >> … as uploaded with upconfig. However, in th

Re: Problem managing Solr configsets on Zookeeper

2017-02-22 Thread Shawn Heisey
On 2/22/2017 8:25 AM, Chris Rogers wrote: > … as uploaded with upconfig. However, in the /solr directory we have a > second /configs folder with the old schema.xml within it. I presume > this is the collection config being used, as it is named simply > ‘tolkien’ as defined in the create collection

Re: How to figure out whether stopwords are being indexed or not

2017-02-22 Thread Pratik Patel
Hi Eric, Thanks for the reply! Following is the relevant part of response header with debugQuery on. { "responseHeader":{ "status":0, "QTime":282, "params":{ "q":"Description_note:* and *", "indent":"on", "wt":"json", "debugQuery":"on", "_":"1487773835305"}}, "response":{"numFound":81771,"start":

Re: requiring both

2017-02-22 Thread Chris Hostetter
: how would you handle a query like "johnson AND johnson"? i don't want : something that has "author: linden b. johnson" to hit, only things that : actually have two occurrences. I'm not even sure if/how that would be possible using the underlying lucene Query objects available -- IIUC the Boole

Re: Running Solr 6.3+ on Tomcat : Help Please

2017-02-22 Thread Shawn Heisey
On 2/17/2017 12:46 PM, ps40 wrote: > Thanks for the quick response. Unfortunately, slf4j has nothing to do with > this. I removed the duplicate jars and the problem persists. Here are the > updated logs. None of the log messages you've shared indicate any errors, and you haven't described the prob

Re: Solr - example for using percentiles

2017-02-22 Thread Chris Hostetter
: > Using the stats component makes short work of things. : > : > stats.true&stats.field=foo : : The stats component has been rendered obsolete by the newer and shinier : json.facet stuff. json.facet still doesn't support multi-shard refinement, so saying stats (and/or) facet components are ob

requiring both

2017-02-22 Thread John Blythe
hi all. how would you handle a query like "johnson AND johnson"? i don't want something that has "author: linden b. johnson" to hit, only things that actually have two occurrences. currently using standard handler, not e/dismax. only thing i can think of currently is to migrate to dismax and when

Re: Problem managing Solr configsets on Zookeeper

2017-02-22 Thread Chris Rogers
Hi Erick, Thanks for your reply. 1> Looking in the Solr UI I have only a single set of config files. 2> No, browser is not caching the page. Tried in anonymous browser and result is the same. 3> No, I don’t have a managed Schema file or managed schema factory definition. I am using Further in

Re: Solr on HDFS: AutoAddReplica does not add a replica

2017-02-22 Thread Erick Erickson
bq: in the none HDFS case that sounds logical but in the HDFS case all the index data is in the shared HDFS file system That's not really the point, and it's not quite true. The Solr index unique _per replica_. So replica1 points to an HDFS directory (that's triply replicated to be sure). replica2

Re: Additional information from external database

2017-02-22 Thread Erick Erickson
There really isn't a _Tika_ database connector, Tika parses the structured files. A typical jdbc connector can connect to a DB. You might be thinking of Data Import Handler (DIH). Here's a program that both uses Tika and connects to a DB that might give you a hint. It uses an older version of Solr

Re: Problem managing Solr configsets on Zookeeper

2017-02-22 Thread Erick Erickson
Chris: I'd check a couple of things: 1> did the files get copied where you expect? Do you have only one set of files in the zknode configs node (admin UI>>cloud>>tree>>expand the appropriate branches). This seems unlikely if you did the downconfig, but stranger things have happened. 2> Is there

Re: Solr - example for using percentiles

2017-02-22 Thread Bram Van Dam
On 17/02/17 13:39, John Blythe wrote: > Using the stats component makes short work of things. > > stats.true&stats.field=foo The stats component has been rendered obsolete by the newer and shinier json.facet stuff. - Bram

SolrClient.queryAndStreamResponse - QueryResponse should be used with care

2017-02-22 Thread Sebastian Riemer
Dear solr users, I am considering to switch from SolrClient.execute to SolrClient.queryAndStreamResponse, because I want to display the process of query execution. I've found http://stackoverflow.com/a/15810200/2747410 which seems to be a good starting point for me. However, the docs for SolrC

Re: Problem managing Solr configsets on Zookeeper

2017-02-22 Thread Chris Rogers
Hi Gus, Thanks for the reply, but unfortunately this is not the problem. I am using the basic schema factory, with a directly edited schema file. Best, Chris On 22/02/2017, 12:00, "Gus Heck" wrote: Hi Chris, Are you perhaps using (by default) ManagedIndexSchemaFactory? https

Re: bin/solr start - long response on screen

2017-02-22 Thread Alexandre Rafalovitch
I _think_ there was a JIRA about lsof recently which seems to be an issue. What O/S are you on? I don't think this affects startup itself, just monitoring. Regards, Alex On 22 Feb 2017 2:35 AM, "Uchit Patel" wrote: Hi, I have upgraded SOLR to 6.4.0 from 5.1.0. When I am starting my SOLR I

Additional information from external database

2017-02-22 Thread Wilhelm Eger
Hi! I am using a setup of datafari (www.datafari.com), which more or less combines a ManifoldCF file index with SolR as a search engine. My setup consists of ~35 files, which are composed mainly of doc(x), xls(x), msg and pdf files. pdf files are ocr'd externally before they are added to t

Re: Problem managing Solr configsets on Zookeeper

2017-02-22 Thread Gus Heck
Hi Chris, Are you perhaps using (by default) ManagedIndexSchemaFactory? https://cwiki.apache.org/confluence/display/solr/Schema+Factory+Definition+in+SolrConfig If so on first boot the schema.xml file is copied and then subsequently ignored in favor of the managed copy. If you do not wish to use

Problem managing Solr configsets on Zookeeper

2017-02-22 Thread Chris Rogers
Hi all, I’ve got a Zookeeper (3.4.8) / Solrcloud (6.4.0) setup with two nodes. I’ve successfully uploaded a collection through one of the Solr nodes: ~~~ bin/solr create -c tolkien -d tolkien_config -s 2 -rf 2 ~~~ I’ve then changed the schema.xml in the config set for this collection (in the

OBF SSL passwords do not work. [was: SSL Problem solr 6.4.1 - Error from shard - ADDREPLICA failed to create replica]

2017-02-22 Thread Marcel Berteler
OK, so I managed to 'fix' this issue, but I cannot explain why. The problem was that the SSL password was obfuscated on the server, where on my personal machine it was not. When starting SOLR 6.4.1 in cloud mode with and obfuscated SLL password, it starts, but is not fully functioning. The fir