Data Import handler and join select

2014-08-06 Thread Alejandro Marqués Rodríguez
Hi, I have one problem while indexing with data import hadler while doing a join select. I have two tables, one with products and another one with descriptions for each product in several languages. So it would be: Products: ID, NAME, BRAND, PRICE, ... Descriptions: ID, LANGUAGE, DESCRIPTION I

Solarium Extension

2014-08-06 Thread pushkar sawant
Hi All, I have done installation of Solarium Search on Magento 1.7 ver. my Solr 4.9 is also working in background. My Base OS is Ubuntu 13.10 on which solr 4.9 is running. when i go & check the extension in magento admin it only shows Test Connection. Please find attached image. Note - When Instal

Re: SOLRJ Stop Streaming

2014-08-06 Thread Mikhail Khludnev
Hello, Note that Response Streaming is purely client side sugar. Could you please take a stack trace proves your hypothesis? On Wed, Aug 6, 2014 at 11:40 PM, Felipe Dantas de Souza Paiva < cad_fpa...@uolinc.com> wrote: > Hi Guys, > > in version 4.0 of SOLRJ a support for streaming response was

How to grab matching stats in Similarity class

2014-08-06 Thread Hafiz Mian M Hamid
We're using solr 4.2.1 and use an extension of Lucene's DefaultSimilarity as our similarity class. I am trying to figure out how we could get hold of the matching stats (i.e. how many/which terms in the query matched on different fields in the retrieved document set) in our similarity class sinc

Re: how to change field value during index time?

2014-08-06 Thread Jack Krupansky
An update request processor could do the trick. You can use the stateless script update processor to code a JavaScript snippet to do whatever logic you want. Plenty of examples in my e-book: http://www.lulu.com/us/en/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-212

how to change field value during index time?

2014-08-06 Thread abhayd
hi I am indexing a csv file using csv handler. I have two fields f1 and f2. Based on value of f1 i want to set the value of f2. Like if(f1=='T') then f2=True; Is this something i can do during index time? I was reading about javascript transformers but that only seem to work with DIH Any help?

Re: Content-Charset header in HttpSolrServer

2014-08-06 Thread Chris Hostetter
: I was reviewing the httpclient code in HttpSolrServer and noticed that : it sets a "Content-Charset" header. As far as I know this is not a real : header and is not necessary. Anyone know a reason for this to be there? : I'm guessing this was just a mistake when converting from httpclient3 to

Re: Implementing custom analyzer for multi-language stemming

2014-08-06 Thread Rich Cariens
Yes, each token could have a LanguageAttribute on it, just like ScriptAttributes. I didn't *think* a span would be necessary. I would also add a multivalued "lang" field to the document. Searching English documents for "die" might look like: "q=die&lang=eng". The "lang" param could tell the Reques

Re: Anybody uses Solr JMX?

2014-08-06 Thread Erick Erickson
Lots of organizations use monitoring Solr's JMX for dashboards/healthchecks, all that sort of thing. AFAIK they're very light-weight. Best, Erick On Wed, Aug 6, 2014 at 2:18 PM, Paul Libbrecht wrote: > Hello Otis, > > this looks like an excellent idea! > I'm in need of that, erm… last week and

Re: indexing comments with Apache Solr

2014-08-06 Thread Jack Krupansky
Nested documents and block join MAY work, but... I'm not so sure that nutch be be able to send the data in the structure that Solr and Lucene would expect. You may have to do some sort of customer connector between nutch and Solr to do that. I mean, normally the output of nutch is simply a strea

Re: Anybody uses Solr JMX?

2014-08-06 Thread Paul Libbrecht
Hello Otis, this looks like an excellent idea! I'm in need of that, erm… last week and probably this one too. Is there not a risk that reading certain JMX properties actually hogs the process? (or is it by design that MBeans are supposed to be read without any lock effect?). thanks for the hin

Re: Suggestion for term searches

2014-08-06 Thread Jack Krupansky
The term vector component is only needed if you wish to retrieve specific term position and frequency information, but is not needed to perform queries. I think all you need to do is index the first word of the business name as a separate field and then boost the first term of the query for th

SOLRJ Stop Streaming

2014-08-06 Thread Felipe Dantas de Souza Paiva
Hi Guys, in version 4.0 of SOLRJ a support for streaming response was added: https://issues.apache.org/jira/browse/SOLR-2112 In my application, the output for the SOLR input stream is a response stream from a REST web service. It works fine, but if the client closes the connection with the RES

Suggestion for term searches

2014-08-06 Thread Corey Gerhardt
I have an interesting situation of searching Business Names where results should be partially sorted by position. Searching for "Kramer Tractors" will not result in any matches as there no results that exactly match this. However there are business names that start with Kramer and there are als

Re: matching "starts with" only

2014-08-06 Thread Erick Erickson
Right, this is a quirk of phrase queries. For wildcards to work in phrase queries you need SOLR-1604 (ComplexPhraseQueryParser). Or you need to escape your spaces, i.e. black\ cat* Best, Erick On Tue, Aug 5, 2014 at 11:09 PM, zameer wrote: > If we search only "black*" it works but when we use

Re: solr over hdfs for accessing/ changing indexes outside solr

2014-08-06 Thread Erick Erickson
bq: Are you aware of Cloudera search? I know they provide an integrated Hadoop ecosystem. What Cloudera Search does via the MapReduceIndexerTool (MRIT) is create N sub-indexes for each shard in the M/R paradigm via EmbeddedSolrServer. Eventually, these sub-indexes for each shard are merged (perhap

Re: Auto Complete

2014-08-06 Thread Michael Della Bitta
You'd still need to modify that schema to use the ASCII folding filter. Alternatively, if you want something off the shelf, you might check out Sematext's autocomplete product: http://www.sematext.com/products/autocomplete/index.html Michael Della Bitta Applications Developer o: +1 646 532 3062

enabling termVectors for a stored field/MoreLikeThis

2014-08-06 Thread Steven Sian
Hello, I'm currently using Solr's MoreLikeThis functionality. It's working, but slow. I've read that "for best results, use stored TermVectors in schema.xml" (https://wiki.apache.org/solr/MoreLikeThis). Can anyone tell me what other steps I need to perform to enable term vector storage, othe

Re: Anybody uses Solr JMX?

2014-08-06 Thread Gopal Patwa
Another option to get JMX data from Solr to Graphite or Ganglia using jmxtrans https://github.com/jmxtrans/jmxtrans/wiki On Wed, Aug 6, 2014 at 3:09 AM, rulinma wrote: > good job . > > > > -- > View this message in context: > ht

Re: Auto Complete

2014-08-06 Thread benjelloun
Hello thanks for the tutorial i test all schema but its not what i need. what i need is to auto complete with an autocorrection like i said before: q="gene" -->autocomplete "genève" with accent 2014-08-05 18:03 GMT+02:00 Michael Della Bitta-2 [via Lucene] < ml-node+s472066n4151261...@n3.nabble.co

Re: indexing comments with Apache Solr

2014-08-06 Thread Umesh Prasad
griddynamics blog is useful. It has 4 parts which covers block join quite well .. http://blog.griddynamics.com/2012/08/block-join-query-performs.html http://blog.griddynamics.com/2013/09/solr-block-join-support.html http://blog.griddynamics.com/2013/12/grandchildren-and-siblings-with-block.html

Unable to get the class from external jar in Update handler

2014-08-06 Thread Vivekanand Ittigi
Hi, I've made the a jar which contains a class called "ConcatClass" and i've put this jar under lib of solr. And i'm trying to access this class in update-script.groovy in /update handleer. But groovy is not picking up "ConcatClass" class, giving the following error: SEVERE: Unable to create cor

Re: indexing comments with Apache Solr

2014-08-06 Thread Ali Nazemian
Dear Alexandre, Hi, Thank you very much. I think nested document is what I need. Do you have more information about how can I define such thing in solr schema? Your mentioned blog post was all about retrieving nested docs. Best regards. On Wed, Aug 6, 2014 at 5:16 PM, Alexandre Rafalovitch wrote

Re: ExternalFileFieldReloader and commit

2014-08-06 Thread Mikhail Khludnev
Agree with you. In this case dummy docs makes sense, definitely. Don't consider it as a pitch, but there is a prototype of commit-free eff https://issues.apache.org/jira/browse/SOLR-4085 It might be useful for some extreme usecases. On Wed, Aug 6, 2014 at 5:20 PM, Peter Keegan wrote: > The use

Re: ExternalFileFieldReloader and commit

2014-08-06 Thread Peter Keegan
The use case is: 1. A SolrJ client updates the main index (and replicas) and issues a commit at regular intervals. 2. Another component updates the external files at other intervals. Usually, the commits result in a new searcher which triggers the org.apache.solr.schema.ExternalFileFieldReloader,

Re: indexing comments with Apache Solr

2014-08-06 Thread Alexandre Rafalovitch
You can index comments as child records. The structure of the Solr document should be able to incorporate both parents and children fields and you need to index them all together. Then, just search for JOIN syntax for nested documents. Also, latest Solr (4.9) has some extra functionality that allow

Re: indexing comments with Apache Solr

2014-08-06 Thread Jack Krupansky
Indexing decisions should always be based primarily by query and access requirements. So please tell us what your query and access requirements are. For example, what query terms might a user enter and what exactly might they want to see for results? I mean, on the indexing side you can index

Re: ExternalFileFieldReloader and commit

2014-08-06 Thread Mikhail Khludnev
Peter, Providing SOLR-6326 is about a bug in ExternalFileFieldReloader, I'm asking here: Did you try to use org.apache.solr.search.function.FileFloatSource.ReloadCacheRequestHandler ? Let's me know if you need help with it. As a workaround you can reload the core via REST or click a button at Solr

Re: ExternalFileFieldReloader and commit

2014-08-06 Thread Peter Keegan
I entered SOLR-6326 thanks, Peter On Tue, Aug 5, 2014 at 6:50 PM, Koji Sekiguchi wrote: > Hi Peter, > > It seems like a bug to me, too. Please file a JIRA ticket if you can > so that someone can take it. > > Koji > -- > http://soleami.com/blog/

Re: Anybody uses Solr JMX?

2014-08-06 Thread rulinma
good job . -- View this message in context: http://lucene.472066.n3.nabble.com/Anybody-uses-Solr-JMX-tp4134598p4151408.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: indexing comments with Apache Solr

2014-08-06 Thread Ali Nazemian
Dear Gora, I think you misunderstood my problem. Actually I used nutch for crawling websites and my problem is in index side and not crawl side. Suppose page is fetch and parsed by Nutch and all comments and the date and source of comments are identified by parsing. Now what can I do for indexing t

Re: indexing comments with Apache Solr

2014-08-06 Thread Gora Mohanty
On 6 August 2014 14:13, Ali Nazemian wrote: > > Dear all, > Hi, > I was wondering how can I mange to index comments in solr? suppose I am > going to index a web page that has a content of news and some comments that > are presented by people at the end of this page. How can I index these > comment

indexing comments with Apache Solr

2014-08-06 Thread Ali Nazemian
Dear all, Hi, I was wondering how can I mange to index comments in solr? suppose I am going to index a web page that has a content of news and some comments that are presented by people at the end of this page. How can I index these comments in solr? consider the fact that I am going to do some ana

Re: Solr vs ElasticSearch

2014-08-06 Thread Salman Akram
Thanks everyone!! This has been really helpful discussion and in short based on this we have taken the decision to stick to SOLR. On Mon, Aug 4, 2014 at 6:17 PM, Jack Krupansky wrote: > And neither project supports the Lucene faceting module, correct? > > And the ES web site says: "WARNING: Fac