Re: Optimize SolrCloud without downtime

2015-03-31 Thread Pavel Hladik
When we indexing I see the deleted docs are a bit changing.. I was surprised when developer reindex 120M index, we had around 110M of deleted docs and this number was not falling. As you wrote, the typical behavior should be merging deleted docs to 10-20% of whole index? So it should be after two w

Re: Same schema.xml is loaded for different cores in SolrCloud

2015-03-31 Thread Zheng Lin Edwin Yeo
Yes, I've delete my previous collections, and retried these using zkcli and creating my collections using the collections API thereafter. It's working now. Thanks Erick. Regards, Edwin On 31 March 2015 at 13:55, Erick Erickson wrote: > By now, I wouldn't particularly trust my setup. I'd blow

Re: SolrJ commit with openSearcher=false

2015-03-31 Thread vidit.asthana
Thanks for reply Shawn. I will try it out. The reason that I am forced to do a hard commit through code is to handle a problem I am facing with transaction logs. I am forced to delete tlogs manually at regular interval and hence I want to issue a hard commit before deleting them to ensure that no

Re: Collapse and Expand behaviour on result with 1 document.

2015-03-31 Thread Joel Bernstein
You should be able to use collapse/expand with one result. Does the document in the main result set have group members that aren't being expanded? Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Mar 31, 2015 at 2:00 AM, Derek Poh wrote: > If I want to group the results (by a certain fie

Re: Collapse and Expand behaviour on result with 1 document.

2015-03-31 Thread Joel Bernstein
The way that collapse/expand is designed to be used is as follows: The main result set will contain the collapsed group heads. The expanded section will contain the expanded groups for the page of results. To render the page you iterate the main result set. For each document check to see if ther

What's the need for copyField> when you have "fq"

2015-03-31 Thread Steven White
Hi folks, I'm new to Solr and I have a question about , "q" and "fq". If I have 50 fields in a Solr doc and I index them without doing any to a catch-all-field called "all_text". During search I use "fq" to list all the 50 fields to search on. Now how different is this from not using "fq" and

RE: What's the need for copyField> when you have "fq"

2015-03-31 Thread Toke Eskildsen
Steven White [swhite4...@gmail.com] wrote: > If I have 50 fields in a Solr doc and I index them without doing any > to a catch-all-field called "all_text". During search I use > "fq" to list all the 50 fields to search on. Now how different is this > from not using "fq" and searching against my

RE: how do you replicate solr-cloud between datacenters?

2015-03-31 Thread Davis, Daniel (NIH/NLM) [C]
I got the answer to my most recent question without even asking it! Thanks -Original Message- From: Jack Krupansky [mailto:jack.krupan...@gmail.com] Sent: Monday, March 30, 2015 6:40 PM To: solr-user@lucene.apache.org Subject: Re: how do you replicate solr-cloud between datacenters? That

FYI: danizen and me

2015-03-31 Thread Davis, Daniel (NIH/NLM) [C]
In the wake of Hillary Clinton's email, I'll be asking questions about work related stuff as daniel.da...@nih.gov. https://github.com/danizen/ is both work and personal, as is the norm for github. Disclaimer - posts made my daniel.da...@nih.gov

Re: Optimize SolrCloud without downtime

2015-03-31 Thread Erick Erickson
I really don't have a good explanation here, those are the default values and the folks who set them up no doubt chose them with some care. Afraid I'll have to defer to people who actually know the code... Erick On Mon, Mar 30, 2015 at 11:59 PM, Pavel Hladik wrote: > When we indexing I see the d

solr.DictionaryCompoundWordTokenFilterFactory extracts words in string

2015-03-31 Thread Simon Martinelli
Hi, I configured solr.DictionaryCompoundWordTokenFilterFactory using a dictionary with the following content: - lindor - schlitten - dorsch - filet I want to index the compound words - dorschfilet - lindorschlitten dorschfilet is processed as expected dorsch filet but lindorschlitten is comp

Re: SolrJ commit with openSearcher=false

2015-03-31 Thread Shawn Heisey
On 3/31/2015 2:56 AM, vidit.asthana wrote: > Thanks for reply Shawn. I will try it out. > > The reason that I am forced to do a hard commit through code is to handle a > problem I am facing with transaction logs. > > I am forced to delete tlogs manually at regular interval and hence I want to > iss

Re: What's the need for copyField> when you have "fq"

2015-03-31 Thread Erick Erickson
Yet a third is that is often used when you want to treat the same data different ways. For instance, consider a "title" field. You might want to sort by title, but sorting on a tokenized field is undefined so I might use a copyField from "title" to "title_sort" and analyze the sort field with some

Re: SolrJ commit with openSearcher=false

2015-03-31 Thread Erick Erickson
Hmmm, you really shouldn't have to do this. What have you tried to figure out why the strange node isn't getting cleaned up? Is there anything in the Solr logs that might help? Is it a Windows machine? Some of the delete semantics for Windows can leave things around. What happens if you restart th

Filtering in Solr

2015-03-31 Thread Steven White
Hi folks, I need filtering capability just as described here for Lucene: http://www.javaranch.com/journal/2009/02/filtering-a-lucene-search.html "Filtering is a mechanism of narrowing the search space, allowing only a subset of the documents to be considered as possible hits. They can be used to

Re: Solr 5.0.0 and HDFS

2015-03-31 Thread Joseph Obernberger
I've tried to replicate the issue starting from new, but so far it hasn't happened again. -Joe On 3/28/2015 2:10 PM, Mark Miller wrote: Hmm...can you file a JIRA issue with this info? - Mark On Fri, Mar 27, 2015 at 6:09 PM Joseph Obernberger wrote: I just started up a two shard cluster on

Re: Filtering in Solr

2015-03-31 Thread Shawn Heisey
On 3/31/2015 12:25 PM, Steven White wrote: > I need filtering capability just as described here for Lucene: > http://www.javaranch.com/journal/2009/02/filtering-a-lucene-search.html > > "Filtering is a mechanism of narrowing the search space, allowing only a > subset of the documents to be consider

Re: Spark-Solr in python

2015-03-31 Thread Timothy Potter
You'll need a python lib that uses a python ZooKeeper client to be SolrCloud-aware so that you can do RDD like things, such as reading from all shards in a collection in parallel. I'm not aware of any Solr py libs that are cloud-aware yet, but it would be a good contribution to upgrade https://gith

RE: Spark-Solr in python

2015-03-31 Thread Davis, Daniel (NIH/NLM) [C]
There is a pull request for that - https://github.com/toastdriven/pysolr/pull/138. Depending on how you install Python modules, you could grab the cone for the feature, and run that version. -Original Message- From: Timothy Potter [mailto:thelabd...@gmail.com] Sent: Tuesday, March 31,

Stopwords magic

2015-03-31 Thread Alex Sylka
My stopwords don't works as expected. Here is part of my schema:

How to find out which fields a search came from

2015-03-31 Thread Steven White
Hi folks, When I get my hits back from Solr, is there a way to find out into which fields my search term matched in? For example, if the indexed document is: doc_1: title = From Russia with Love director = Terence Young starting = Sean Connery, Redro Amendariz, Lotte Lenya, mus

Re: Stopwords magic

2015-03-31 Thread Jack Krupansky
Use the Solr Admin UI analysis page to see how the text is analyzed at both index and query time. My e-book does have more narrative and examples for stop word processing: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-21203548.html -- Jack Kr

RE: How to find out which fields a search came from

2015-03-31 Thread Reitzel, Charles
Highlighting is the way to go. Note, you have options to make it better suit your application. e.g. You can control the delimiters the highlighter uses. You can also choose from a couple different implementations. We have been able to use the highlight results, as is, to pull data from fie

Re: how do you replicate solr-cloud between datacenters?

2015-03-31 Thread Timothy Ehlers
Yes, thank you. On Tue, Mar 31, 2015 at 9:54 AM, Davis, Daniel (NIH/NLM) [C] < daniel.da...@nih.gov> wrote: > I got the answer to my most recent question without even asking it! > Thanks > > -Original Message- > From: Jack Krupansky [mailto:jack.krupan...@gmail.com] > Sent: Monday, March

Re: Collapse and Expand behaviour on result with 1 document.

2015-03-31 Thread Derek Poh
There is only 1 document in the main result set.The expanded section is empty. On 3/31/2015 7:37 PM, Joel Bernstein wrote: You should be able to use collapse/expand with one result. Does the document in the main result set have group members that aren't being expanded? Joel Bernstein http:

Re: Unable to perform search query after changing uniqueKey

2015-03-31 Thread Zheng Lin Edwin Yeo
Thanks Erick. Yes, it is able to work correct if I do not use spaces for the field names, especially for the uniqueKey. Regards, Edwin On 31 March 2015 at 13:58, Erick Erickson wrote: > I would never put spaces in my field names! Frankly I have no clue > what Solr does with that, but it can't

Solr Cloud Security not working for internal authentication

2015-03-31 Thread Swaraj Kumar
I am trying to use Solr Security on Solr 5.0 Cloud. Following process I have used :- 1. Modifying web.xml :- AdminAllowedQueries /admin/* admin BASIC Solr Realm Admin admin 1. Changes in jetty.xml :- Solr Realm /etc/realm.properties 0 2.

RE: Spark-Solr in python

2015-03-31 Thread Chaushu, Shani
There is a package of python with solr-cloud https://pypi.python.org/pypi/solrcloudpy but I don't know if there is possibility to connect it to spark -Original Message- From: Timothy Potter [mailto:thelabd...@gmail.com] Sent: Tuesday, March 31, 2015 23:15 To: solr-user@lucene.apache.org