Re: anyone has solrcloud perfromance numbers ?

2012-10-02 Thread Otis Gospodnetic
Hi, Was trying to say you will need to run the benchmark yourself because each context is different. The linkedin tool I referred you to will help you do that - you don't have to bench non-solr engines. I also tried suggesting that while you are benchmarking you really want to be looking at variou

Re: anyone has solrcloud perfromance numbers ?

2012-10-02 Thread varun srivastava
Otis, I am looking for performance benchmark number rather than performance monitoring tools. SPM looks like monitoring tool. Moreover its comparing Solr with Elastic Search etc, I want comparison between Solr 3.6 and solrcloud. Thanks Varun On Tue, Oct 2, 2012 at 9:15 PM, varun srivastava wrote:

Re: anyone has solrcloud perfromance numbers ?

2012-10-02 Thread varun srivastava
Thanks Otis On Tue, Oct 2, 2012 at 8:06 PM, Otis Gospodnetic wrote: > I don't have the URL handy, but guys at LinkedIn have a benchmark tool for > Solr, ElasticSearch, and Sensei. Check the list archives for URL and my > signature below for a tool that can show metrics for any of those systems,

Re: PHP client for a web application

2012-10-02 Thread Esteban Cacavelos
Thanks for your response Damien. As you said, you can do some basic things quiclier than solr-php-client. I think is a good choice for basic applications, and if needed more specific things, then go with solr-php-client also. 2012/10/2 Damien Camilleri > Hi esteban. Im currently using both i

Re: Query among multiple cores

2012-10-02 Thread Otis Gospodnetic
Solr join does work across multiple cores, as long as they are in the same JVM. Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring - http://sematext.com/spm/index.html On Tue, Oct 2, 2012 at 11:09 PM, Nicholas Ding wrote: > Join is cool, but does i

Re: Follow links in xml doc

2012-10-02 Thread Otis Gospodnetic
Hi Billy, There is nothing in Solr that will do XML parsing and link extraction, so you'll need to do that part. Once you do that have a look at Solr join for parent-child querying. http://search-lucene.com/?q=solr+join Otis -- Search Analytics - http://sematext.com/search-analytics/index.html

Re: Query among multiple cores

2012-10-02 Thread Nicholas Ding
Join is cool, but does it work among multiple cores? On Solr's wiki, I saw it's only applied to single core. On Tue, Oct 2, 2012 at 11:06 PM, Otis Gospodnetic < otis.gospodne...@gmail.com> wrote: > Are the cores join-able? If so, you can use Solr's join feature to execute > just one query. > > Ot

Re: NoHttpResponseException using Solrj to index

2012-10-02 Thread Otis Gospodnetic
You need to add the jar with that missing class to the startup command line. Otis -- Performance Monitoring - http://sematext.com/spm On Oct 2, 2012 5:42 PM, "Rui Vaz" wrote: > Hey I am trying to make a simple application using solrj to index > documents. > I used the start.jar to start the Solr

Re: Query among multiple cores

2012-10-02 Thread Otis Gospodnetic
Are the cores join-able? If so, you can use Solr's join feature to execute just one query. Otis -- Performance Monitoring - http://sematext.com/spm On Oct 2, 2012 5:50 PM, "Nicholas Ding" wrote: > Hello, > > I'm working on a search project, that involves searching against more than > one cores.

Re: anyone has solrcloud perfromance numbers ?

2012-10-02 Thread Otis Gospodnetic
I don't have the URL handy, but guys at LinkedIn have a benchmark tool for Solr, ElasticSearch, and Sensei. Check the list archives for URL and my signature below for a tool that can show metrics for any of those systems, which you'll probably want to observe during testing. Otis -- Performance Mo

Re: SolrJ - IOException

2012-10-02 Thread Rozdev29
Was it stalled due to gc pause? Sent from my iPhone On Oct 2, 2012, at 10:02 AM, "balaji.gandhi" wrote: > Hi Toke, > > We encountered this issue again. This time the SOLR servers were stalled. We > are at 30 TPS. > > Please let us know any updates in the HTTP issue. > > Thanks, > Balaji >

RE: Can SOLR Index UTF-16 Text

2012-10-02 Thread Fuad Efendi
Solr can index bytearrays too: unigram, bigram, trigram... even bitsets, tritsets, qatrisets ;- ) LOL I got strong cold... BTW, don't forget to configure UTF-8 as your default (Java) container encoding... -Fuad

Re: Problem with spellchecker

2012-10-02 Thread Jose Aguilar
Thank you for your help, the whole team overlooked this simple error. It was driving us crazy! :) Thanks!! Jose. On 10/2/12 1:23 AM, "Markus Jelsma" wrote: >The problem is your stray double quote: >text_general_fr" > >I'd think this would throw an exception somewhere. > > >-Original mes

ContentStreamUpdateRequest example in 4.0 Beta

2012-10-02 Thread Rui Vaz
Hi, Is there any complete implementation for Solr 4.0 Beta of a class which uses ContentStreamUpdateRequest to send data to the ExtractingRequestHandler , similar to this one for 3.1 version? http://wiki.apache.org/solr/ContentStreamUpdateRequ

Re: Can SOLR Index UTF-16 Text

2012-10-02 Thread Lance Norskog
If it is a simple text file, does that text file start with the UTF-16 "BOM" marker? http://unicode.org/faq/utf_bom.html Also, do UTF-8 files work? If not, then your setup has a basic encoding problem. And, when you post such a text file (for example, with curl), use the UTF-16 charset mime-type

NoHttpResponseException using Solrj to index

2012-10-02 Thread Rui Vaz
Hey I am trying to make a simple application using solrj to index documents. I used the start.jar to start the Solr,. When I try to index a document to Solr I get the following exception: Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/http/NoHttpResponseException The except

RE: WordBreak spell correction makes split terms optional?

2012-10-02 Thread Dyer, James
The parenthesis are being added by the spellchecker. I tried to envision a number of different scenarios when designing how this would work and at the time it seemed best to add parenthesis around terms that originally were together but now are split up. From your example, I see this is a mist

Re: PHP client for a web application

2012-10-02 Thread Damien Camilleri
Hi esteban. Im currently using both in my application. Both are fine. Solarium is great because it models the concepts of solr and can build queries using OOP. The other one is more lower level, so u have to write queries manually, which can be good in some situations. Both are fast enough. Solariu

WordBreak spell correction makes split terms optional?

2012-10-02 Thread Carrie Coy
The user query "design your ownbinoculars" is corrected by the 'wordbreak' dictionary to: design your (own binoculars) Where are the parentheses coming from? Can I strip them with a post-processing filter? The parentheses make the terms optional, so, while the first match is excellent, t

Re: Can I rely on correct handling of interrupted status of threads?

2012-10-02 Thread Mikhail Khludnev
I remember a bug in EmbeddedSolrServer at 1.4.1 when exception bypasses request closing that lead to searcher leak and OOM. It was fixed about two years ago. On Tue, Oct 2, 2012 at 1:48 PM, Robert Krüger wrote: > Hi, > > I'm using Solr 3.6.1 in an application embedded directly, i.e. via > Embedd

Re: Hierarchical Data

2012-10-02 Thread Maurizio Cucchiara
Ciao Davide, Unfortunately changing the structure of the dbs is not an option for me (there are many legacy dbs involved), otherwise I would have chosen a /closure table/ instead of the /path enumeration/ you mentioned before. Furthermore, I'd need 2 PE fields: 1 for the values (ids) and 1 for th

Re: multivalued filed question (FieldCache error)

2012-10-02 Thread Chris Hostetter
: I'm also using that field for a facet: Hmmm... that still doesn't make sense. faceting can use FieldCache, but it will check if ht field is mutivalued to decide if/when/how to do this. There's nothing else in your requestHandler config that would suggest why you might get this error. can y

Re: Hierarchical Data

2012-10-02 Thread Davide Lorenzo Marino
Hi Maurizio, if you can manipulate your MySql db a simpler solution can be the following: 1 - Add a new field for your hierarchical data inside your table MY_HIERARCHICAL_FIELD 2 - Populate directly in MySql this new field with a simple procedure* 3 - Import the data in your Solr index *The MySql

RE: SolrJ - IOException

2012-10-02 Thread balaji.gandhi
Hi Toke, We encountered this issue again. This time the SOLR servers were stalled. We are at 30 TPS. Please let us know any updates in the HTTP issue. Thanks, Balaji Balaji Gandhi, Senior Software Developer, Horizontal Platform Services Product Engineering │ Apollo Group, Inc. 1225 W. Washin

Hierarchical Data

2012-10-02 Thread Maurizio Cucchiara
Hi all, I'm trying to import some hierarchical data (stored in MySQL) on Solr, using DataImportHandler. Unfortunately, as most of you already knows, MySQL has no support for recursive queries, so there is no way to get hierarchical data stored as an adjacency list. So I considered writing a DIH cus

Question about MoreLikeThis query with solrj

2012-10-02 Thread G.Long
Hi :) I'm using Solr 3.6.1 and i'm trying to use the similarity features of lucene/solr to compare texts. The content of my documents is in french so I defined a field like : indexed="true" stored="true"/> (it uses the default text_fr fieldType provided with the default schema.xml file)

Re: mapping values in fields

2012-10-02 Thread Erick Erickson
Ah, I get it (finally). OK, there's no good way to do what you want that I know of. The problem is that the stored="true" takes effect long before any transformations are applied, and is always the raw input. You effectively want to chain the fields together, i.e. apply the analysis chain _then_ ha

Re: At a high level how does faceting in SolrCloud work?

2012-10-02 Thread Jamie Johnson
So does mincount get considered in this as well? On Tue, Oct 2, 2012 at 10:19 AM, Jamie Johnson wrote: > Thanks for this guys, really excellent explanation! > > On Thu, Sep 27, 2012 at 12:15 AM, Yonik Seeley wrote: >> On Wed, Sep 26, 2012 at 6:21 PM, Chris Hostetter >> wrote: >>> 2) the coordin

Re: At a high level how does faceting in SolrCloud work?

2012-10-02 Thread Jamie Johnson
Thanks for this guys, really excellent explanation! On Thu, Sep 27, 2012 at 12:15 AM, Yonik Seeley wrote: > On Wed, Sep 26, 2012 at 6:21 PM, Chris Hostetter > wrote: >> 2) the coordinator node sums up the counts for any constraint returned by >> multiple nodes, and then picks the top (facet.limi

Re: mapping values in fields

2012-10-02 Thread tech.vronk
the query is: mapped_field:mapped_value1 and seems to correctly return the documents. the mapped_field has attribute stored=true and also appears in the result (even without requesting it explicitely with fl), just with the orig_value1 instead of mapped_value1 matej Am 02.10.2012 15:46, s

Re: mapping values in fields

2012-10-02 Thread Erick Erickson
What's the query you send? I'm guessing a bit here since you haven't included it, but try insuring two things: 1> your mapped_field is has 'stored="true" ' 2> you specify (either in your request handler on on the URL) fl=mapped_value Best Erick On Tue, Oct 2, 2012 at 9:04 AM, tech.vronk wrote:

mapping values in fields

2012-10-02 Thread tech.vronk
Hi, I try to map values from one field into other values in another field. For example: original_field: orig_value1 mapped_field: mapped_value1 with the help of an explicitely defined (N:1) mapping: orig_value1 => mapped_value1 orig_value2 => mapped_value1 orig_value3 => mapped_value2 I have tr

Tuning DirectUpdateHandler2.addDoc

2012-10-02 Thread Trym R. Møller
Hi I have been profiling SolrCloud when indexing into a sharded non-replica collection because indexing slows down when the index files (*.fdt) grows to a couple of GB (the largest is about 3.5GB). When profiling for a couple of minutes I see that most time is spend in the DirectUpdateHandle

Re: successfully move to glassfish but got error accessing Velocity sample code

2012-10-02 Thread Iwan Hanjoyo
Hello list, I finally solved the problem. I miss the configuration of solr jar files in the solrconfig.xml file. thank you. Kind regards, Hanjoyo On Mon, Oct 1, 2012 at 8:58 PM, Iwan Hanjoyo wrote: > Hello all, > > First, after extracting apache-solr-3.6.1.zip file, I can run and access > ht

Re: move solr.war to Glassfish and got error running http://host:port/ProjectName/browse

2012-10-02 Thread Iwan Hanjoyo
Hello list, I finally solved the problem. I miss the configuration of solr jar file in the solrconfig.xml file. thank you. Kind regards, Hanjoyo On Tue, Oct 2, 2012 at 5:57 PM, Iwan Hanjoyo wrote: > Hello list, > > > > On Sun, Sep 30, 2012 at 6:43 PM, Iwan Hanjoyo wrote: > >> Hello all, >>

Re: move solr.war to Glassfish and got error running http://host:port/ProjectName/browse

2012-10-02 Thread Iwan Hanjoyo
Hello list, On Sun, Sep 30, 2012 at 6:43 PM, Iwan Hanjoyo wrote: > Hello all, > > I used older Solr 3.6.1 version. > I created a new web project (called SolrRedo) on Netbeans 7.1.1 running on > Glassfish Web Server > Then I moved sources from the solr.war sample code (that resided inside > apa

Can I rely on correct handling of interrupted status of threads?

2012-10-02 Thread Robert Krüger
Hi, I'm using Solr 3.6.1 in an application embedded directly, i.e. via EmbeddedSolrServer, not over an HTTP connection, which works perfectly. Our application uses Thread.interrupt() for canceling long-running tasks (e.g. through Future.cancel). A while (and a few Solr versions) back a colleague o

Re: Synonyms Phrase not working

2012-10-02 Thread Mikhail Khludnev
Gustav, AFAIK, multi words synonyms is one of the weak points for Lucene/Solr. I'm going to propose a solution approach at forthcoming Eurocon http://www.apachecon.eu/schedule/presentation/18/ . You are welcome! -- Sincerely yours Mikhail Khludnev Tech Lead Grid Dynamics

Re: Deploying and securing Solr war in JBoss AS

2012-10-02 Thread Lee Carroll
Hi Billy see http://wiki.apache.org/solr/SolrSecurity One approach is keep master internal, read only slaves with just select handlers defined in the solr config for public facing requests. See your app container security docs for other approaches On 1 October 2012 16:32, Billy Newman wrote: >

RE: Problem with spellchecker

2012-10-02 Thread Markus Jelsma
The problem is your stray double quote: text_general_fr" I'd think this would throw an exception somewhere. -Original message- > From:Jose Aguilar > Sent: Tue 02-Oct-2012 01:40 > To: solr-user@lucene.apache.org > Subject: Problem with spellchecker > > We have configured 2 spellchecke

Re: httpSolrServer and exyternal load balancer

2012-10-02 Thread Lee Carroll
Cheers, saved the day Lee C On 28 September 2012 23:27, Chris Hostetter wrote: > > : The issue we face is the f5 balancer is returning a cookie which the > client > : is hanging onto. resulting in the same slave being hit for all requests. > ... > : My question is can I configure the sol