RE: # of daily/weekly/monthly Solr downloads?

2014-12-09 Thread Alexey Kozhemiakin
Hi, according to slides #3 it's 250,000+ monthly downloads. http://www.slideshare.net/anshumg/ease-of-use-in-apache-solr -Original Message- From: Otis Gospodnetic [mailto:otis.gospodne...@gmail.com] Sent: Wednesday, December 10, 2014 01:25 To: solr-user@lucene.apache.org Subject: # of da

Re: Solr Composite Unique key from existing fields in schema

2014-12-09 Thread Ahmet Arslan
Hi, Once I used template transformer to generate unique id across entities. http://wiki.apache.org/solr/DataImportHandler#TemplateTransformer On Wednesday, December 10, 2014 8:51 AM, Rajesh Panneerselvam wrote: Hi, I'm using DIH to index my entities. I'm facing an issue while delta-import.

Solr Composite Unique key from existing fields in schema

2014-12-09 Thread Rajesh Panneerselvam
Hi, I'm using DIH to index my entities. I'm facing an issue while delta-import. I've declared multiple entities in one data-config.xml. The entities will have different primary key. Now if I want to delta-import how should I mention the UniqueKey in schema.xml. My data-config structure is like t

Re: AW: AW: Keeping capitalization in suggestions?

2014-12-09 Thread Ryan Yacyshyn
Hi Clemens, I recently added typeahead functionality to something I'm playing with and I used the EdgeNGramFilterFactory to help. I just tried this out after adding a doc with "Chamäleon" in my title. I was able to get "Chamäleon", with a capital C, returned I searched for chama, Chama, chamã, an

Re: SOLR shards stay down forever

2014-12-09 Thread Norgorn
The problem is, that hard commit is on, max uncommited docs = 500.000. And tlog size is just about 200 MB per shard - doesn't seem too big for me. The reason of my panic is the fact, that one shard in my old collection is down forever, without any unusual entries in logs. I tried different magic (

Re: Disappearance of post.jar from the new tutorial

2014-12-09 Thread Chris Hostetter
: Subject: Re: Disappearance of post.jar from the new tutorial : : I removed reference to it as the same class is in solr-core's JAR. : : The idea is to hide the details behind bin/post and before end of year : (before 5.0 release at least) to get that taken care of. This doesn't make any sen

Re: Get matched Term in join query

2014-12-09 Thread Peter Sturge
Hi, Your question is a good one - I have added an option to search through results and filter that way, but it's not ideal, as very often there are 10,000 or millions of hits, with only 20 results per page returned. I've realized I run into the classic 'Terms-can't-filtered' issue. To filter Term

# of daily/weekly/monthly Solr downloads?

2014-12-09 Thread Otis Gospodnetic
Hi, Does anyone know the number of daily/weekly/monthly Solr downloads? Thanks, Otis -- Monitoring * Alerting * Anomaly Detection * Centralized Log Management Solr & Elasticsearch Support * http://sematext.com/

Re: CLUSTERSTATUS timeout

2014-12-09 Thread Shalin Shekhar Mangar
Hi Jonathan, That shouldn't happen. The API returns the answer from the Overseer node (via ZK) and should return immediately. The API will timeout after 180s if somehow it cannot get a response from Overseer. I don't see why it would timeout. What's the read timeout on your monitoring system? Abo

Re: Q: Does anybody asks/answer Solr questions on Stack Overflow? Why?

2014-12-09 Thread Alexandre Rafalovitch
On 9 December 2014 at 16:05, Chris Hostetter wrote: > (don't even get me started on quora and their bullshit "you can't even > *read* the content foolish people wrote for us for free w/o giving us > access to your social graph") +2 on Quora annoyance. It's even worse than that, because they don't

Re: Q: Does anybody asks/answer Solr questions on Stack Overflow? Why?

2014-12-09 Thread Chris Hostetter
: But I am curious about other peoples' experiences with SO. Do you ask : questions in that forum? Do you answer? Why? How do you compare that : "support channel" with this one? Did you migrate from one to another? : Private replies are welcome, though I suspect this topic might be : interesting f

Re: Q: Does anybody asks/answer Solr questions on Stack Overflow? Why?

2014-12-09 Thread Michael Sokolov
Alex, I spent some time answering questions there, but got ultimately got turned off by the competitive nature of it. I wanted to increase my score -- fun! But if you are not watching it all the time, the questions go by very fast, and you lose your edge. The typical pattern seems to be: so-so

RE: Does anybody asks/answer Solr questions on Stack Overflow? Why?

2014-12-09 Thread Toke Eskildsen
Alexandre Rafalovitch [arafa...@gmail.com] wrote: > But I am curious about other peoples' experiences with SO. Do you ask > questions in that forum? Do you answer? Why? How do you compare that > "support channel" with this one? Did you migrate from one to another? I have answered a few questions o

Q: Does anybody asks/answer Solr questions on Stack Overflow? Why?

2014-12-09 Thread Alexandre Rafalovitch
Hello, This is an informal survey trying to understand the community participation patterns. Most of the non-interactive Solr information-gathering activity is happening on Google/Bing/DDG/Yandex/etc. That's probably very common, though I'd love to see Google Analytics stats from websites with la

Re: Comparing Solr & Elasticsearch performance

2014-12-09 Thread Charlie Hull
Yes of course, starting with an OOTB configuration seemed sensible and obviously there is scope for tuning. It occurs to me that a comparison between tuned and OOTB Solr would also be interesting. We do sometimes find Solr configs that are barely modified example files! Cheers Charlie -- Charlie

Re: How to stop Solr tokenising search terms with spaces

2014-12-09 Thread Yonik Seeley
On Tue, Dec 9, 2014 at 12:49 PM, Dinesh Babu wrote: > > But my requirement is A* B* to be A* B* . A* OR B*won't meet my requirement. The syntax is what it is... With the complexphrase parser, if you want at phrase, you need to surround the clauses with double quotes: "A* B*" -Yonik http://heli

Re: "facet.mincount=0" returns facet values with 0 counts for "q=*" query

2014-12-09 Thread Chris Hostetter
in general, a facet count of 0 means the term is in the index but does not match an of the docs in the result set. if you are doing a query that matches all docs, and seeing facet values with a mincount of 0, that means the *term* is still in the index, but the documents that contained those t

"facet.mincount=0" returns facet values with 0 counts for "q=*" query

2014-12-09 Thread Abhishek Sharma
Hi, Can any one help me understand what does it mean to have facet results like this - "values": [ "4th of july flags", 0, "angela moore", 0, "anklets", 0, "applique flags",

RE: How to stop Solr tokenising search terms with spaces

2014-12-09 Thread Dinesh Babu
But my requirement is A* B* to be A* B* . A* OR B*won't meet my requirement. We have chosen the NGram solution and it is working for our rquirement at the moment. Thanks for your input and help Yonik Regards, Dinesh Babu. -Original Message- From: ysee...@gmail.com [mailto:ysee...@gma

Re: Comparing Solr & Elasticsearch performance

2014-12-09 Thread Alexandre Rafalovitch
I guess when you said you did not tune instances, you really really meant it. The Solr one looks like an example one with all the config files and Carrot enabled, etc. I was hoping for a bit more TodoMVC style. I guess that's for the next lull in the client work. Still, great to have it out there.

Comparing Solr & Elasticsearch performance

2014-12-09 Thread Charlie Hull
Hi all, We've been working on a study of any performance differences between Solr and Elasticsearch and we've also published the code we used - here's the background with links to Github http://www.flax.co.uk/blog/2014/12/09/comparing-solr-and-elasticsearch-heres-the-code-we-used/ Cheers

Re: Clearing SolrCaches

2014-12-09 Thread Shawn Heisey
On 12/8/2014 11:10 PM, Manohar Sripada wrote: > How to edit the configuration that is linked to a collection?? I am using > SolrCloud and I upload my config to Zookeeper. So, if I modify and upload > the config, will that not impact the latest collection as well, if I don't > reload the latest coll

Re: AW: AW: Keeping capitalization in suggestions?

2014-12-09 Thread Michael Sokolov
Clemens -- what I do (see suggestions of titles of books on $EMPLOYER's web site) is to define a field with no analysis (type=keyword, use KeywordAnalyzer) and build the suggestions from that. Then tell AIS to use an analyzer internally to pick out word from that (StandardAnalyzer, or Whit

Re: SOLR shards stay down forever

2014-12-09 Thread Erick Erickson
How big is your transaction log? If you don't do a hard commit (openSearcher = true or false doesn't matter), then the tlog can grow and upon restart the tlog gets replayed. I've seen tlogs in the 10s of G range which can take a long time to replay. In the mean time, new updates are written to, you

AW: AW: Keeping capitalization in suggestions?

2014-12-09 Thread Clemens Wyss DEV
Thanks for all the insightful links. I tried http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr but that approach returns searchresults instead of term-suggestions. I have (at the moment) a solution based on http://wiki.apache.org/solr/TermsComponent . But I might want m

Re: Length norm not functioning in solr queries.

2014-12-09 Thread Mikhail Khludnev
I wonder why your explains are so brief, mine looks like 0.4500489 = (MATCH) weight(text:inc in 17) [DefaultSimilarity], result of: 0.4500489 = fieldWeight in 17, product of: 1.0 = tf(freq=1.0), with freq of: 1.0 = termFreq=1.0 2.880313 = idf(docFreq=8, maxDocs=59) 0.15625

Re: Length norm not functioning in solr queries.

2014-12-09 Thread Ahmet Arslan
Hi, Default length norm is not best option for differentiating very short documents, like product names. Please see : http://find.searchhub.org/document/b3f776512ab640ec#b3f776512ab640ec I suggest you to create an additional integer field, that holds number of tokens. You can populate it via u

Re: Length norm not functioning in solr queries.

2014-12-09 Thread S.L
Hi , Mikhail Thanks , I looked at the explain and this is what I see for the two different documents in questions, they have identical scores even though the document 2 has a shorter productName field, I do not see any lenghtNorm related information in the explain. Also I am not exactly clear o

Re: Boosting the score using edismax for a non empty and non indexed field.

2014-12-09 Thread Erik Hatcher
Boosting will need to be done off an indexed field. But maybe rather than indexing the url value, maybe index another new hasImage field as a boolean true. No need to index the false values even. Erik > On Dec 8, 2014, at 02:45, S.L wrote: > > Hi All, > > I have a situation where I