Re: Get matched Term in join query

2014-12-08 Thread Mikhail Khludnev
Hello Peter, Let's limit or just fix the problem definition. I've got that dealing with cross core join id mandatory. Is it right? Then, do you need facets (from all resultset) or just a snippets (just from result page)? 09.12.2014 1:23 пользователь "Peter Sturge" написал: > Hi Forum, > > Is it

SOLR shards stay down forever

2014-12-08 Thread Norgorn
I'm using SOLR 4.10.1 in cloud mode with 3 instances, 5 shards per instance without replication. I restarted one SOLR and now all shards from that instance are down, but there are no errors in logs. All I see is 09.12.2014, 11:13:40WARNUpdateLog Starting log replay tlog{file=/opt/dat

Re: Clearing SolrCaches

2014-12-08 Thread Manohar Sripada
How to edit the configuration that is linked to a collection?? I am using SolrCloud and I upload my config to Zookeeper. So, if I modify and upload the config, will that not impact the latest collection as well, if I don't reload the latest collection? Thanks, Manohar On Mon, Dec 8, 2014 at 7:45

Re: Boosting the score using edismax for a non empty and non indexed field.

2014-12-08 Thread S.L
Anyone ? On Mon, Dec 8, 2014 at 2:45 AM, S.L wrote: > Hi All, > > I have a situation where I need to boost the score of a query if a field > (imageURL) in the given document is non empty , I am using edismax so I > know that using bq parameter would solve the problem. However the field > imageUR

Re: CloudSolrServer, concurrency and too many connections

2014-12-08 Thread JoeSmith
Thanks, Shawn. I updated to 7u72 and was not able to reproduce the problem. That was good. But just to be sure about this, I backed back down to 7u55 and again was not able to reproduce. So at least for now, this has gone away even if the reason is inconclusive. On Mon, Dec 8, 2014 at 7:37 AM,

Re: unable to build spellcheck in solr

2014-12-08 Thread Min L
Below is the solrconfig.xml. myfieldName is indexed in solr and is searchable. The spellcheck.build=true works in one env but not in another. suggestDict org.apache.solr.spelling.suggest.Suggester org.

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
Yes, that worked quite well. I still need the "//tagname" but that is the only DIH incantation I need. This will substantially accelerate things. On Mon, Dec 8, 2014 at 5:37 PM, Dan Davis wrote: > The problem is that XPathEntityProcessor implements Xpath on its own, and > implements a subset

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
The problem is that XPathEntityProcessor implements Xpath on its own, and implements a subset of XPath. So, if the input document is small enough, it makes no sense to fight it. One possibility is to apply an XSLT to the file before processing ite This blog post

Get matched Term in join query

2014-12-08 Thread Peter Sturge
Hi Forum, Is it possible for a Solr query to return the term(s) that matched a particular field/query? For example, let's say there's a field like this: raw="This is a raw text field that happens to contain some text that's also in the action field value..." And another field in a different inde

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Alexandre Rafalovitch
I don't believe there are any alternatives. At least I could not get anything but the full path to work. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedi

Re: DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
In experimentation with a much simpler and smaller XML file, it doesn't look like '//health-topic/@url" will not work, nor will '//@url' etc.So far, only spelling it all out will work. With child elements, such as , an xpath of "//title" works fine, but it is beginning to same dangerous. Is t

CLUSTERSTATUS timeout

2014-12-08 Thread Hutchins, Jonathan
We are currently running Solr 4.10.0 in production. We have run into an issue where we cannot have our monitoring system hit the CLUSTERSTATUS api command every five minutes (or even as long as every hour) without getting a significant number of timeouts on the command. Does this command retur

DIH XPathEntityProcessor question

2014-12-08 Thread Dan Davis
When I have a forEach attribute like the following: forEach="/medical-topics/medical-topic/health-topic[@language='English']" And then need to match an attribute of that, is there any alternative to spelling it all out: I suppose I could do "//health-topic/@url" since the document should

Re: Tika HTTP 400 Errors with DIH

2014-12-08 Thread Dan Davis
I would say that you could determine a row that gives a bad URL, and then run it in DIH admin interface (or the command-line) with "debug" enabled The url parameter going into tika should be present in its transformed form before the next entity gets going. This works in a similar scenario for me

Custom Rollup (Join) Query

2014-12-08 Thread Darin Amos
Hello, I posted this question within another thread and I think it got lost so I wanted to start a new thread about it. I have built a small POC for a customization I am hoping to get some validation on in case what I have built is a really bad implementation. I have been doing a lot of digging

Re: AW: Keeping capitalization in suggestions?

2014-12-08 Thread Ahmet Arslan
Hi Clemens, There a a number of ways to implement auto complete/suggest. Some of them pull data from indexed terms, therefore they will be lowercased. Some pull data from stored values, therefore capitalisation is preserved. Here are great resources on this topic. https://lucidworks.com/blog/a

Re: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Yonik Seeley
On Mon, Dec 8, 2014 at 12:01 PM, Erik Hatcher wrote: > debug output tells a lot. Looks like in the last two examples that the > second part (Viewpoint*) is NOT parsed with the complex phrase parser - the > whitespace thwarts it. Actually, it looks like it is, but you're not telling the complex

RE: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Dinesh Babu
Thanks Erik Regards, Dinesh Babu. -Original Message- From: Erik Hatcher [mailto:erik.hatc...@gmail.com] Sent: 08 December 2014 17:02 To: solr-user@lucene.apache.org Subject: Re: How to stop Solr tokenising search terms with spaces debug output tells a lot. Looks like in the last two e

Re: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Erik Hatcher
debug output tells a lot. Looks like in the last two examples that the second part (Viewpoint*) is NOT parsed with the complex phrase parser - the whitespace thwarts it. I’d recommend doing something like this to test that parser out to avoid the “meta” parsing issue. q={!complexphrase v=

Re: Preferred Scema/Config for Chinese Language Cores?

2014-12-08 Thread Tom Zimmermann
I tracked down an example from a sample solr config of a CJK setup with bigrams and no CJK tokenizer: < fieldType name="text_cjk" class="solr.TextField" positionIncrementGap="100"> Seems like it could be a good approach, but I also saw mention of an ICU Tokenizer that might be

RE: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Dinesh Babu
Hi Erik, 1. With search phrase in quotes {!complexphrase}displayName:"RVN Viewpoint*" "debug": { "rawquerystring": "{!complexphrase}displayName:\"RVN Viewpoint*\"", "querystring": "{!complexphrase}displayName:\"RVN Viewpoint*\"", "parsedquery": "ComplexPhraseQuery(\"RVN Viewpoint*\")"

Re: [ANN] Heliosearch 0.09 (JSON Request API + Distrib for Facet API)

2014-12-08 Thread Mike Murphy
Very nice, I like the integration of request parameters, parameter substitution, and the handling of multiple JSON snippets. This is starting to feel nicer than elasticsearch even! You just need to round it out a little more now. --Mike On Fri, Dec 5, 2014 at 12:45 PM, Yonik Seeley wrote: > http:

Re: Anti-Pattern in lucent-join jar?

2014-12-08 Thread Michael Sokolov
Right - allowing Solr to manage these queries (SOLR-6234) seems like the way to go ... OP == original poster (I lost track of who started the discussion) -Mike On 12/08/2014 10:19 AM, Mikhail Khludnev wrote: On Mon, Dec 8, 2014 at 5:38 PM, Michael Sokolov < msoko...@safaribooksonline.com>

AW: Keeping capitalization in suggestions?

2014-12-08 Thread Clemens Wyss DEV
Allthough making use of AnalyzingInfixSuggester I still getting "either or". When lowercase-filter is active I always get suggestions, BUT they are lowercased (i.e. "chamäleon"). When lowercase-filter is not active I only get suggestions when querying "Chamä" my solrconfig.xml ...

Re: Anti-Pattern in lucent-join jar?

2014-12-08 Thread Darin Amos
Hi Mikhail, I was merely posing a thought in an effort to continue to learn and educate myself. Your point about Weight.scorer() being called per segment helps my understanding. I am in the middle of building a POC for a customer of mine that I pointed out in this thread on Dec 5th (shortly aft

Re: Length norm not functioning in solr queries.

2014-12-08 Thread Mikhail Khludnev
It's worth to look into to check particular scoring values. But for most suspect is the reducing precision when float norms are stored in byte vals. See javadoc for DefaultSimilarity.encodeNormValue(float) On Mon, Dec 8, 2014 at 5:49 PM, S.L wrote: > I have two documents doc1 and doc2 and each

Re: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Ahmet Arslan
Hi, May be you have omitTermFreqAndPositions=true set for your fields? Positions are necessary for phrase queries to work. Ahmet On Monday, December 8, 2014 5:20 PM, Dinesh Babu wrote: Hi Yonik, It is a text field ( all our search fields are of type text ). Very unlucky for me that it is

Re: Anti-Pattern in lucent-join jar?

2014-12-08 Thread Mikhail Khludnev
On Mon, Dec 8, 2014 at 5:38 PM, Michael Sokolov < msoko...@safaribooksonline.com> wrote: > I get the impression there was a concern that the caller could hold on to > the query generated by JoinUtil for too long - eg across requests in Solr. Michael, if you still bother, SOLR-6234

RE: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Dinesh Babu
Hi Yonik, It is a text field ( all our search fields are of type text ). Very unlucky for me that it is not working. Will try the NGram solution provided by Jack. Regards, Dinesh Babu. -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent

RE: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Dinesh Babu
Thanks a lot Jack. Will try this Solution. Regards, Dinesh Babu. -Original Message- From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: 07 December 2014 20:38 To: solr-user@lucene.apache.org Subject: Re: How to stop Solr tokenising search terms with spaces Thanks for the clarif

Length norm not functioning in solr queries.

2014-12-08 Thread S.L
I have two documents doc1 and doc2 and each one of those has a field called phoneName. doc1:phoneName:"Details about Apple iPhone 4s - 16GB - White (Verizon) Smartphone Factory Unlocked" doc2:phoneName:"Apple iPhone 4S 16GB for Net10, No Contract, White" Here if I search for q=iphone+4s+16gb&qf

Re: Anti-Pattern in lucent-join jar?

2014-12-08 Thread Michael Sokolov
I get the impression there was a concern that the caller could hold on to the query generated by JoinUtil for too long - eg across requests in Solr. I'm not sure why the OP thinks that would happen, though. -Mike On 12/08/2014 04:57 AM, Mikhail Khludnev wrote: On Fri, Dec 5, 2014 at 10:44 PM,

Re: Clearing SolrCaches

2014-12-08 Thread Shawn Heisey
On 12/8/2014 3:02 AM, Manohar Sripada wrote: > Can anyone please let me know on how to clear caches associated with an > IndexSearcher explicitly? > > In my project, I am creating a collection (say collection_1) which holds > the data for my organizations dataset. I am using filterCache, > queryRe

Re: Question on Solr Caching

2014-12-08 Thread Shawn Heisey
On 12/8/2014 2:42 AM, Manohar Sripada wrote: > Can you please re-direct me to any wiki which describes (in detail) the > differences between MMapDirectoryFactory and NRTCachingDirectoryFactory? I > found this blog > very > he

Re: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Yonik Seeley
On Mon, Dec 8, 2014 at 2:50 AM, Dinesh Babu wrote: > I just tried your suggestion > > {!complexphrase}displayName:"RVN Viewpoint users" > > Even the above did not work. Am I missing any configuration changes for this > parser to work? What is the fieldType of displayName? The complexphrase quer

Re: CloudSolrServer, concurrency and too many connections

2014-12-08 Thread JoeSmith
We will need to update to 7u52, we are using 7u55. On the client side, this happens with zookeeper 3.4.6 and 4.10.2 solrj. And we will need to update both on the server side. What kind of config/setup information would you need to see if we do still have an issue after these updates? On Mon, D

Re: SegmentInfos exposed to /admin/luke

2014-12-08 Thread Mikhail Khludnev
Alexey, I've got that you need to get number of deleted docs in the index http://localhost:8983/solr/admin/mbeans?stats=true&cat=CORE here it goes 27 30 3 if you need to get detailed segmentation, parse StandardDirectoryReader(segments_b:19:nrt _6(4.10.2):C9 _7(4.10.2):C9 _8(4.10.2):C9) note C#

Re: SegmentInfos exposed to /admin/luke

2014-12-08 Thread Dmitry Kan
Hi Alexey, In GUI luke there is an option to "Just expunge deleted docs without re-merging". In case you want to give it a try. Dmitry On Wed, Dec 3, 2014 at 1:35 PM, Alexey Kozhemiakin < alexey_kozhemia...@epam.com> wrote: > Dear All, > > We have a high percentage of deleted docs which do not

Re: Slow queries

2014-12-08 Thread Siegfried Goeschl
Hi, using Jetty is the recommended approach while using Tomcat is not recommend (unless you are a Tomcat shop). But any discussion comes back to the original question - why is it slow now? Are you I/O-bound, are CPU-bound, how many documents are committed/deleted over the time, do you having

Clearing SolrCaches

2014-12-08 Thread Manohar Sripada
Hi, Can anyone please let me know on how to clear caches associated with an IndexSearcher explicitly? In my project, I am creating a collection (say collection_1) which holds the data for my organizations dataset. I am using filterCache, queryResultCache and DocumentCache extensively and these ar

Re: Slow queries

2014-12-08 Thread melb
THnks for the answer A dedicated box will be a great solution but I will wait for that solution, I have restricted sources Is Optimze action can improve performance? Is using default servlet engine Jetty can be harmful for the performance, SHould I use an independant tomcat engine? rgds, -- Vi

Re: Anti-Pattern in lucent-join jar?

2014-12-08 Thread Mikhail Khludnev
On Fri, Dec 5, 2014 at 10:44 PM, Darin Amos wrote: > public Scorer scorer(){ > TermsWithScoreCollector collector = new > TermsWithScoreCollector(); > JoinQuery.this.s.search(JoinQuery.this.q, > collector); > >

Re: Question on Solr Caching

2014-12-08 Thread Manohar Sripada
Thanks Shawn, Can you please re-direct me to any wiki which describes (in detail) the differences between MMapDirectoryFactory and NRTCachingDirectoryFactory? I found this blog very helpful which describes about MMapDirector

Re: How to stop Solr tokenising search terms with spaces

2014-12-08 Thread Erik Hatcher
What's the parsed query? &debug=true > On Dec 8, 2014, at 02:50, Dinesh Babu wrote: > > I just tried your suggestion > > {!complexphrase}displayName:"RVN Viewpoint users" > > Even the above did not work. Am I missing any configuration changes for this > parser to work? > > Regards, > Dine