Re: Incomplete documents with parent child DB relationship

2012-03-12 Thread Mikhail Khludnev
Hello, IIRC, DIH UI has some kind of verbose or debug option, which provides expanded trace of what's going on. Can't you try to check it? On Tue, Mar 13, 2012 at 4:35 AM, Tim Hurring wrote: > I'm new to SOLR and have managed to get some basic indexing and querying > working. However I haven't b

Re: Using multiple DirectSolrSpellcheckers for a query

2012-03-12 Thread Nalini Kartha
Hi James/Robert, Thanks for the responses. Robert: What is it about the current APIs that makes this hard? How much/what kind of refactoring would open this up? James: I didn't quite understand the usage you suggested. I thought that the spellcheck.q param shouldn't include field names, etc and

RE: How to index doc file in solr?

2012-03-12 Thread Rohan
Thanks Erick ,really appreciated. From: Erick Erickson [via Lucene] [mailto:ml-node+s472066n3819585...@n3.nabble.com] Sent: Monday, March 12, 2012 9:05 PM To: Rohan Ashok Kumbhar Subject: Re: How to index doc file in solr? Consider using SolrJ, possibly combined with Tika (which is what underlie

RE: List of recommendation engines with solr

2012-03-12 Thread Rohan
Hi Gora, Thanks a lot for your valuable comments, really appreciated. Yeah , You got me correctly I am exactly looking for "Mahout" as I am using Java as my business layer with Apache solr. Thanks, Rohan From: Gora Mohanty-3 [via Lucene] [mailto:ml-node+s472066n3819480...@n3.nabble.com] Sent

Highlighting a font without bold or italic modes

2012-03-12 Thread Lance Norskog
How do you highlight terms in languages without boldface or italic modes? Maybe raise the text size a couple of sizes just for that word? -- Lance Norskog goks...@gmail.com

Can solr-langid(Solr3.5.0) detect multiple languages in one text?

2012-03-12 Thread bing
Hi, all, I am using solr-langid(Solr3.5.0) to do language detection, and I hope multiple languages in one text can be detected. The example text is: 咖哩起源於印度。印度民間傳說咖哩是佛祖釋迦牟尼所創,由於咖哩的辛辣與香味可以幫助遮掩羊肉的腥騷,此舉即為用以幫助不吃豬肉與牛肉的印度人。在泰米爾語中,「kari」是「醬」的意思。在馬來西亞,kari也稱dal(當在mamak檔)。早期印度被蒙古人所建立的莫臥兒帝國(Mughal Empir

Incomplete documents with parent child DB relationship

2012-03-12 Thread Tim Hurring
I'm new to SOLR and have managed to get some basic indexing and querying working. However I haven't been able to successfully implement the indexing of a parent child database relationship. My db-data-config.xml is:

Additional Query with MLT

2012-03-12 Thread Jamie Johnson
Is there a way to provide an additional query constraint to the MLT component? My particular use case is I want to get similar documents, but limit them to the documents a user can actually see based on some authorization query. Is this currently possible?

Re: Trouble indexing word documents

2012-03-12 Thread Tomás Fernández Löbbe
it should be in "lucidworks-solr-3.2.0_01/dist/lucidworks-solr-cell-3.2.0_01.jar", don't you have that one? On Mon, Mar 12, 2012 at 5:44 PM, rdancy wrote: > I see the line - /> > but I don't see any solr cell jars, only Tika jars. I moved all the jars > over to my classpath directory. I'm using

Re: SolrCore error

2012-03-12 Thread Erick Erickson
You attachment didn't come through, the mail server often strips this stuff. Please either inline it or put it up on some publicly accessible place Best Erick On Sun, Mar 11, 2012 at 10:51 PM, Nikhila Pala wrote: > Hi, > > ** ** > > I’m getting some exceptions while shutting the hybris s

Re: Display of highlighted search result should start with the beginning of the sentence that contains the search string.

2012-03-12 Thread Koorosh Vakhshoori
Hi Koji, I am Shyam's coworker. After some looking into this issue, I believe the problem of chopped word has to do with org.apache.lucene.search.vectorhighlight.SimpleFragListBuilder class' 'margin' field. It is set to 6 by default. My understanding is having margin value of greater than zero re

Re: MISSING LICENSE

2012-03-12 Thread Per Steffensen
Thank you both for your kind help. Regards, Steff Erick Erickson skrev: Per: You've been working with SolrCloud, haven't you? Yonik's right on, removing exampleB is what I had to do with the exact same problem. Erick On Mon, Mar 12, 2012 at 2:33 PM, Yonik Seeley wrote: Over-aggressive l

Re: Trouble indexing word documents

2012-03-12 Thread rdancy
I see the line - but I don't see any solr cell jars, only Tika jars. I moved all the jars over to my classpath directory. I'm using version lucidworks-solr-3.2.0_01. -- View this message in context: http://lucene.472066.n3.nabble.com/Trouble-indexing-word-documents-tp3819949p3820472.html Sent fr

Re: Performance (responsetime) on request

2012-03-12 Thread Dmitry Kan
This page should help you: http://wiki.apache.org/solr/SolrCaching -- Dmitry On Mon, Mar 12, 2012 at 5:37 PM, Ramo Karahasan < ramo.karaha...@googlemail.com> wrote: > Hi, > > thanks for you advice. Do you have any documentation on that? I'm not > sure, how and where to configure this stuff and

Re: query to some field in solr for multiple values

2012-03-12 Thread Ahmet Arslan
> I want to query on id field for multiple values like.. > > "q=id:['1', '5', '17']..." > > in mysql we perform the same query like.. > > "select * from table where id in(1,5,17) " > > how can we perform the same query in solr on id field? &q=1 5 17&q.op=OR&df=id

Re: Relational data

2012-03-12 Thread André Maldonado
Thank's Ahmet and Tomás. It worked like a charm. * -- * *"E conhecereis a verdade, e a verdade vos libertará." (João 8:32)* *andre.maldonado*@gmail.com (11) 9112-4227

query to some field in solr for multiple values

2012-03-12 Thread preetesh dubey
How can we perform query to single string type field for multiple values? e.g. I have the schema field like I want to query on id field for multiple values like.. "q=id:['1', '5', '17']..." in mysql we perform the same query like.. "select * from table where id in(1,5,17) " how can we perfor

Re: MISSING LICENSE

2012-03-12 Thread Erick Erickson
Per: You've been working with SolrCloud, haven't you? Yonik's right on, removing exampleB is what I had to do with the exact same problem. Erick On Mon, Mar 12, 2012 at 2:33 PM, Yonik Seeley wrote: > Over-aggressive license checking code doesn't like jars in extraneous > directories (like the w

RE: Including an attribute value from a higher level entity when using DIH to index an XML file

2012-03-12 Thread Mike O'Leary
I found an answer to my question, but it comes with a cost. With an XML file like this (this is simplified to remove extraneous elements and attributes): [message text] ... ... I can index the user id as a field in documents that represent each of the user's messages with this

Sort by bayesian function for 5 star rating

2012-03-12 Thread Zac Smith
Does anyone have an example formula that can be used to sort by a 5 star rating in SOLR? I am looking at an example on IMDB's top 250 movie list: The formula for calculating the Top Rated 250 Titles gives a true Bayesian estimate: weighted rating (WR) = (v ÷ (v+m)) × R + (m ÷ (v+m)) × C where

Solr Monitoring / Stats

2012-03-12 Thread Alex Leonhardt
Hi All, I was wondering if anyone knows of a free tool to use to monitor multiple Solr hosts under one roof ? I found some non functioning cacti & munin trial implementation but would really like more direct statistics of the JVM itself + all Solr cores (i.e. requests /s , etc.) ? Does anyon

Re: Trouble indexing word documents

2012-03-12 Thread Tomás Fernández Löbbe
Make sure the Solr cell jar is in the classpath. You probably have a line like this in your solrconfig.xml: Make sure that points to the right file. On Mon, Mar 12, 2012 at 2:59 PM, rdancy wrote: > Hello, I running Solr inside Tomcat and I'm trying to index a word.doc > using > curl and I g

Re: Relational data

2012-03-12 Thread Tomás Fernández Löbbe
You could use the grouping feature, depending on your needs: http://wiki.apache.org/solr/FieldCollapsing 2012/3/12 André Maldonado > Hi. > > I need to setup an index that have relational data. This index will be for > houses to rent, where the user will search for date, price, holydays (by > nam

Trouble indexing word documents

2012-03-12 Thread rdancy
Hello, I running Solr inside Tomcat and I'm trying to index a word.doc using curl and I get the following error: bash-3.2# curl "http://localhost:8585/solr/update/extract?literal.id=1&commit=true"; -F "myfile=@troubleshooting_performance.doc" Apache Tomcat/6.0.14 - Error report HTTP Status 500 -

Re: Relational data

2012-03-12 Thread Ahmet Arslan
> The problem is that the same house can have different prices > for different > dates. > > If I denormalyze this data, I will show the same house > multiple times in > the resultset, and I don't want this. > > So, for example: > > House  Holyday       Price per > day > 1          Xmas      >  

Relational data

2012-03-12 Thread André Maldonado
Hi. I need to setup an index that have relational data. This index will be for houses to rent, where the user will search for date, price, holydays (by name), etc. The problem is that the same house can have different prices for different dates. If I denormalyze this data, I will show the same h

Re: Knowing which fields matched a search

2012-03-12 Thread Russell Black
Paul, I would think debugQuery would make it slower too, wouldn't it? Where is the thread you are referring to? Is there a lucene jira ticket for this? On Mar 11, 2012, at 9:38 AM, Paul Libbrecht wrote: > Russel, > > there's been a thread on that in the lucene world... it's not really perfec

Re: Zookeeper view not displaying on latest trunk

2012-03-12 Thread Jamie Johnson
I have not pulled the latest (I am pulled a week or 2 ago) and it works on that version. On Mon, Mar 12, 2012 at 11:40 AM, Stefan Matheis wrote: > Jamie, would you mind to give the latest another try, if the Cloud-Tab is > working as it should? > > On Thursday, February 9, 2012 at 6:57 PM, Mark

RE: solr 3.5 and indexing performance

2012-03-12 Thread Agnieszka Kukałowicz
Hi guys, I have hit the same problem with Hunspell. Doing a few tests for 500 000 documents, I've got: Hunspell from http://code.google.com/p/lucene-hunspell/ with 3.4 version - 125 documents per second Build Hunspell from 4.0 trunk - 11 documents per second. All the tests were made on 8 core CP

Re: Zookeeper view not displaying on latest trunk

2012-03-12 Thread Stefan Matheis
Jamie, would you mind to give the latest another try, if the Cloud-Tab is working as it should? On Thursday, February 9, 2012 at 6:57 PM, Mark Miller wrote: > > On Feb 9, 2012, at 12:09 PM, Jamie Johnson wrote: > > > To get this to work I had to modify my solr.xml to add a > > defaultCoreName

AW: Performance (responsetime) on request

2012-03-12 Thread Ramo Karahasan
Hi, thanks for you advice. Do you have any documentation on that? I'm not sure, how and where to configure this stuff and what impact it has. Thans, Ramo -Ursprüngliche Nachricht- Von: Dmitry Kan [mailto:dmitry@gmail.com] Gesendet: Montag, 12. März 2012 16:21 An: solr-user@lucene.a

Re: I wanna subscribe this maillist

2012-03-12 Thread Erick Erickson
Please follow the instructions here: http://lucene.apache.org/solr/discussion.html Best Erick On Mon, Mar 12, 2012 at 2:35 AM, 刘翀 wrote: > I wanna subscribe this maillist

Re: How to index doc file in solr?

2012-03-12 Thread Erick Erickson
Consider using SolrJ, possibly combined with Tika (which is what underlies Solr Cel). http://www.lucidimagination.com/blog/2012/02/14/indexing-with-solrj/ AlthoughExtractingRequestHandler has the capability of indexing metadata as well if you map the fields. See: http://wiki.apache.org/solr/Extra

Re: SOLR Query Intersection

2012-03-12 Thread balaji
Hi, Thank you guys Erik and Mikhail , You saved my day Thanks Balaji -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-Query-Intersection-tp3818756p3819571.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: 3 Way Solr Join . . ?

2012-03-12 Thread Erick Erickson
I know it goes against the grain here for a DB person, but... denormalize. Really. Solr does many things well, but whenever you start trying to make it do database-like stuff you need to back up and re-think things. Simplest thing: Try indexing one record for each customer/purchase/complaint t

Re: Strange behavior with search on empty string and NOT

2012-03-12 Thread Erick Erickson
Because Lucene query syntax is not a strict Boolean logic system. There's a good explanation here: http://www.lucidimagination.com/blog/2011/12/28/why-not-and-or-and-not/ Adding &debugQuery=on to your search is your friend .. You'll see that your return (at least on 3.5 with going at /solr/select)

Re: Performance (responsetime) on request

2012-03-12 Thread Dmitry Kan
you can optimize the documentCache by setting maxSize to some decent value, like 2000. Also configure some meaningful warming queries in the solrconfig. When increasing the cache size, monitor the RAM usage, as that can starting increasing as well. Do you / would you need to use filter queries? T

AW: Performance (responsetime) on request

2012-03-12 Thread Ramo Karahasan
Hi, this are the results form the solr admin page for cache: name: queryResultCache class: org.apache.solr.search.LRUCache version:1.0 description:LRU Cache(maxSize=512, initialSize=512) stats: lookups : 376 hits : 246 hitratio : 0.65 inserts : 130 evictions : 0 s

Re: Faster Solr Indexing

2012-03-12 Thread Erick Erickson
How have you determined that it's the solr add? By timing the call on the SolrJ side or by looking at the machine where Solr is running? This is the very first thing you have to answer. You can get a rough ides with any simple profiler (say Activity Monitor no a Mac, Task Manager on a Windows box).

Re: List of recommendation engines with solr

2012-03-12 Thread Gora Mohanty
On 12 March 2012 16:30, Rohan wrote: > Hi All, > > I would require list of recs engine which can be integrated with solr and > also suggest best one out of this. > > any comments would be appriciated!! What exactly do you mean by that? Why is integration with Solr a requirement, and what do you e

Re: MISSING LICENSE

2012-03-12 Thread Yonik Seeley
Over-aggressive license checking code doesn't like jars in extraneous directories (like the work directory that the war is exploded into under exampleB). delete "exampleB" and the build should work. -Yonik lucenerevolution.com - Lucene/Solr Open Source Search Conference. Boston May 7-10 On Mon, M

Re: MISSING LICENSE

2012-03-12 Thread Per Steffensen
Shawn Heisey skrev: On 3/12/2012 1:24 AM, Per Steffensen wrote: $ ant -version Apache Ant(TM) version 1.8.2 compiled on October 14 2011 What might be wrong? If you check lucene/BUILD.txt in your source, it says to use ant 1.7.1 or later, but not 1.8.x. This is from a recent trunk checkout:

Re: Performance (responsetime) on request

2012-03-12 Thread Dmitry Kan
If you look at solr admin page / statistics of cache, you could check the evictions of different types of cache. If some of them are larger than zero, try minimizing them by increasing the corresponding cache params in the solrconfig.xml. On Mon, Mar 12, 2012 at 10:12 AM, Ramo Karahasan < ramo.kar

Re: MISSING LICENSE

2012-03-12 Thread Shawn Heisey
On 3/12/2012 1:24 AM, Per Steffensen wrote: $ ant -version Apache Ant(TM) version 1.8.2 compiled on October 14 2011 What might be wrong? If you check lucene/BUILD.txt in your source, it says to use ant 1.7.1 or later, but not 1.8.x. This is from a recent trunk checkout: Basic steps: 0) I

Re: SOLR Query Intersection

2012-03-12 Thread Erik Hatcher
I've done exactly this, rendering Venn diagrams using Google Charts from Solr. See my presentation here: See slides 26-29, even with full code in the slides, but the code is also available here:

Re: SOLR Query Intersection

2012-03-12 Thread balaji
Hi, I got your point are you suggesting me to run using the *facet.query* param for the various combinations Thanks Balaji -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-Query-Intersection-tp3818756p3819165.html Sent from the Solr - User mailing list archive at Nabb

Re: Does the lucene support the substring search?

2012-03-12 Thread Ahmet Arslan
> Return to the post, I would like to know about whether the > lucene support > the substring search or not. > As you can see, one field of my document is long string > filed without any > spaces. It means the token doesn't work here. Suppose I want > to search a > string "TARCSV" in my documents.

Re: SOLR Query Intersection

2012-03-12 Thread balaji
Hi Mikhail, Yes I am trying to get the facets counts for all these and populate the chart , but comparison between the values is what I am wondering Will facets handle all the 3 possible scenarios Thanks Balaji -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-Qu

Re: Solr 4.0

2012-03-12 Thread Jan Høydahl
Hi Robert, See http://wiki.apache.org/solr/Solr4.0 The developer community is working towards a 4.0-Alpha release expected in a few months, however no dates are fixed. Many already use a snapshot version of TRUNK. You are free to do so, at your own risk. -- Jan Høydahl, search solution architec

SolrCore error

2012-03-12 Thread Nikhila Pala
Hi, I'm getting some exceptions while shutting the hybris server and the exception details are specifies in the file attached to this mail. Please try to resolve it as soon as possible. Thanks & Regards, Nikhila Pala Systems engineer Infosys Technologies Limited CAUTION - D

Re: how to ignore indexing of duplicated documents?

2012-03-12 Thread Marc Sturlese
http://wiki.apache.org/solr/Deduplication -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-ignore-indexing-of-duplicated-documents-tp3814858p3818973.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to index doc file in solr?

2012-03-12 Thread Rohan
Hi Erick, Thanks for the valuable comments on this. See i have few set of word docs file and i would like to index meta data part includeing the content of the page , so is there any way to complete this task? Need your comments on this. Thanks, Rohan -- View this message in context: http://l

List of recommendation engines with solr

2012-03-12 Thread Rohan
Hi All, I would require list of recs engine which can be integrated with solr and also suggest best one out of this. any comments would be appriciated!! Thanks, Rohan -- View this message in context: http://lucene.472066.n3.nabble.com/List-of-recommendation-engines-with-solr-tp3818917p3818917.

Re: SOLR Query Intersection

2012-03-12 Thread Mikhail Khludnev
It sounds like facets http://wiki.apache.org/solr/SolrFacetingOverview . Doesn't it? On Mon, Mar 12, 2012 at 1:16 PM, balaji wrote: > Hi , > > I am trying to Compare three independent queries,intersection among them > and draw an Venn diagram using the Google CHART . By using OR I will be > a

SOLR Query Intersection

2012-03-12 Thread balaji
Hi , I am trying to Compare three independent queries,intersection among them and draw an Venn diagram using the Google CHART . By using OR I will be able to get the union of the 3 fields and using AND I will be able to get the intersection among the three , Is it possible to get the union an

Performance (responsetime) on request

2012-03-12 Thread Ramo Karahasan
Hi, i've got two virtual machines in the same subnet at the same hostingprovider. On one machine my webapplication is running, on the second a solr instance. In solr I use the following If I search from my webapplication in

MISSING LICENSE

2012-03-12 Thread Per Steffensen
Hi Just tried to "ant clean test" on latest code from trunk. I get a lot of "MISSING LICENSE" messages - e.g. [licenses] MISSING LICENSE for the following file: [licenses] .../solr/exampleB/work/Jetty_0_0_0_0_8900_solr.war__solr__dsbrc0/webapp/WEB-INF/lib/zookeeper-3.3.3.jar [licenses] Ex