Re: result grouping?

2007-01-03 Thread Ricardo Borillo
Hi, I don't know if solr can manage grouping. But you can do it using an XSLT stylesheet: http://www.jenitennison.com/xslt/grouping/muenchian.html Hope it helps :) On 1/2/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: Is it possible to group the results from a solr query? I have indexed the c

Re: result grouping?

2007-01-03 Thread Ryan McKinley
thanks. Yes, the presentation layer could group results, but that is not practical if i want to show the first 20 results out of 200,000 matches. Nutch groups the results by site. Any idea how they do it? thanks ryan On 1/3/07, Ricardo Borillo <[EMAIL PROTECTED]> wrote: Hi, I don't know if

Re: How to tell the highlighter not to escape?

2007-01-03 Thread Thorsten Scherler
On Wed, 2007-01-03 at 02:16 +, Edward Garrett wrote: > thorsten, > > see the following for discussion. your case is indeed an annoyance--the > thread below discusses motivations for it and ways of working around it. (i > too confess that i wish it were not so.) > > http://www.mail-archive.com

Re: How to tell the highlighter not to escape?

2007-01-03 Thread Edward Garrett
for what it's worth, i wrote a recursive template in xsl that replaces the escaped characters with actual elements. here, the variable $val would be the tag, e.g. "em". this has been working okay for me so far.

Re: How to tell the highlighter not to escape?

2007-01-03 Thread Thorsten Scherler
On Wed, 2007-01-03 at 12:06 +, Edward Garrett wrote: > for what it's worth, i wrote a recursive template in xsl that replaces the > escaped characters with actual elements. here, the variable $val would be > the tag, e.g. "em". this has been working okay for me so far. Yeah, many thanks for po

Solr integration with Compass?

2007-01-03 Thread Lukas Vlcek
Hi, Have anybody integrated Solr with Compass yet? Is there any Solr-Compass integration best practices document? I am new to the Solr so sorry if my question sounds odd (it may very well be this case). Thanks, Lukas

Re: How to tell the highlighter not to escape?

2007-01-03 Thread Erik Hatcher
On Jan 3, 2007, at 7:39 AM, Thorsten Scherler wrote: However I still think the highlighter should return unescaped tags for highlighting. There is IMO no benefit for the current behavior. That really isn't practical. Suppose the prefix were ">>" and the suffix were "<<"? It would return i

Re: How to tell the highlighter not to escape?

2007-01-03 Thread Yonik Seeley
On 1/3/07, Thorsten Scherler <[EMAIL PROTECTED]> wrote: However I still think the highlighter should return unescaped tags for highlighting. There is IMO no benefit for the current behavior. The problems all stem from the simple highlighter formatter mixing highlighting info directly into the s

Re: Solr Flare

2007-01-03 Thread Erik Hatcher
Based on a chat with an early adopter of Flare, we decided to split out the Ruby-Solr DSL sooner rather than later. I've pulled out the library code into a separate directory under client/ruby/solrb ("rb" being the standard Ruby file extension). Info for this library is here:

Re: result grouping?

2007-01-03 Thread Yonik Seeley
On 1/3/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: thanks. Yes, the presentation layer could group results, but that is not practical if i want to show the first 20 results out of 200,000 matches. Nutch groups the results by site. Any idea how they do it? Good question. Off the top of my he

Problem with running Solr Example in Example directory

2007-01-03 Thread Jagadish Channagiri
I am trying to run the example in Solr Distribution and can't instantiate http://localhost:8983/solr/admin. Her below is the stack trace I get as soon as I instantiate http://localhost:8983/solr/admin. I have not set the classpath variable. I have set JAVA_HOME c:\Java\jdk1.6.0 I have noticed th

Re: Solr Flare

2007-01-03 Thread Zaheed Haque
Erik: Thank you for your answer. I think I will be focusing more on the rails application part. Some random thoughts and questions for brainstorming .. Some of this questions are really way down .. but I am trying to understand your roadmap rather .. 1. Are you thinking of any user/password/my p

Re: Problem with running Solr Example in Example directory

2007-01-03 Thread Yonik Seeley
It looks like the jre is being picked up before the JDK. Put the bin directory of the JDK in your path first, or use the full path to the JDK: c:\Java\jdk1.6.0\bin\java.exe -jar start.jar -Yonik On 1/3/07, Jagadish Channagiri <[EMAIL PROTECTED]> wrote: I am trying to run the example in Solr D

Re: Custom scorer...

2007-01-03 Thread escher2k
Yonik, I only saw options to specify the default operator (AND|OR) and to specify similarity. There was nothing for scoring - in case I need to modify the scoring. I got the similarity working by creating my own class. Thanks. Yonik Seeley wrote: > > On 1/2/07, escher2k <[EMAIL PROTECTED]>

Re: Question about similarity manipulation...

2007-01-03 Thread escher2k
Chris Hostetter wrote: > > > : The DisjunctionMaxQuery seems to yield the maximum score only. From my > > NOTE: by setting the "tiebreaker" value of a DisjunctionMaxQuery to "1.0" > it generates the sum of the scores > > : understanding, I would > : need to do the following - > : (1) Create a

Re: Custom scorer...

2007-01-03 Thread Yonik Seeley
On 1/3/07, escher2k <[EMAIL PROTECTED]> wrote: I only saw options to specify the default operator (AND|OR) and to specify similarity. There was nothing for scoring - in case I need to modify the scoring. I got the similarity working by creating my own class. If you created your own Similarit

Re: Solr Flare

2007-01-03 Thread Erik Hatcher
Zaheed, All great questions. My answers will be vague, since I'm still in the early stages of how this will work, but I'll provide what I can below On Jan 3, 2007, at 2:28 PM, Zaheed Haque wrote: 1. Are you thinking of any user/password/my page/settings what not... for folksonomy/tagg

Multi column search...

2007-01-03 Thread escher2k
What is the syntax to use to specify a search across multiple across fields in a URL ? This is in case someone needs to alter the "searchable" fields dynamically as opposed to reading them at server startup from "qf" in solrconfig.xml. Also, it is easier at debug time :) Thanks. -- View this m

Re: Problem with running Solr Example in Example directory

2007-01-03 Thread Jagadish Channagiri
Thanks Yonik. I will go ahead and try your suggestion. -Jagadish On 1/3/07, Yonik Seeley <[EMAIL PROTECTED]> wrote: It looks like the jre is being picked up before the JDK. Put the bin directory of the JDK in your path first, or use the full path to the JDK: c:\Java\jdk1.6.0\bin\java.exe -ja

Re: Multi column search...

2007-01-03 Thread Mike Klaas
On 1/3/07, escher2k <[EMAIL PROTECTED]> wrote: What is the syntax to use to specify a search across multiple across fields in a URL ? This is in case someone needs to alter the "searchable" fields dynamically as opposed to reading them at server startup from "qf" in solrconfig.xml. Also, it is e

Better highlighting fragmenter

2007-01-03 Thread Mike Klaas
I've written an unpolished custom fragmenter for highlighting which is more expensive than the BasicFragmenter that ships with lucene, but generates more natural candidate fragments (it will tend to produce beginning/ends of sentences). Would there be interest in the community in releasing it and

Re: Better highlighting fragmenter

2007-01-03 Thread Michael Imbeault
I for one would be interested in such a fragmenter, as the default one is lacking and doesnt produce acceptable results for most applications. Michael Mike Klaas wrote: I've written an unpolished custom fragmenter for highlighting which is more expensive than the BasicFragmenter that ships wit

Re: Better highlighting fragmenter

2007-01-03 Thread Erik Hatcher
On Jan 3, 2007, at 6:36 PM, Mike Klaas wrote: I've written an unpolished custom fragmenter for highlighting which is more expensive than the BasicFragmenter that ships with lucene, but generates more natural candidate fragments (it will tend to produce beginning/ends of sentences). Would there

Re: How to tell the highlighter not to escape?

2007-01-03 Thread Chris Hostetter
: > However I still think the highlighter should return unescaped tags for : > highlighting. There is IMO no benefit for the current behavior. the advantage is that the XmlResponseWriter has a duty to ensure that it produces wellformed XML regardless of configuration, data, or input. : The probl

Re: Custom scorer...

2007-01-03 Thread Chris Hostetter
: specify similarity. There was nothing for scoring - in case I need to modify : the : scoring. I got the similarity working by creating my own class. customer Scorer classes are really only possible via custom Query and Weight classes -- which are typically way beyond what most people need to mo

Re: Better highlighting fragmenter

2007-01-03 Thread Chris Hostetter
: I've written an unpolished custom fragmenter for highlighting which is : more expensive than the BasicFragmenter that ships with lucene, but : generates more natural candidate fragments (it will tend to produce : beginning/ends of sentences). : : Would there be interest in the community in relea

Re: Custom scorer...

2007-01-03 Thread escher2k
I haven't written any unfortunately. Right now, I have only implemented custom Similarity - in here, I am trying to alter the scoring substantially depending on the field. However, I might need to alter the overall document score - I am in the process of investigating if I can get away without ha

Re: Better highlighting fragmenter

2007-01-03 Thread Mike Klaas
On 1/3/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: : I've written an unpolished custom fragmenter for highlighting which is : more expensive than the BasicFragmenter that ships with lucene, but : generates more natural candidate fragments (it will tend to produce : beginning/ends of sentences

Re: Better highlighting fragmenter

2007-01-03 Thread Chris Hostetter
: implementing Solr's highlighting. What I'd like to do is improve the : end-user's experience with highlighting in Solr. If as a result a : high-quality component for lucene Highlighter is fleshed out, that can : always be contributed to Lucene later. : : Generally, we should strive for a high-

Re: Better highlighting fragmenter

2007-01-03 Thread Walter Underwood
On 1/3/07 5:13 PM, "Mike Klaas" <[EMAIL PROTECTED]> wrote: > Generally, we should strive for a high-quality out-of-the-box > highlighting in Solr. That might involve making things like better > fragmenters and a few other tricks(*) the default setup, and providing > a "quick & dirty" setting for

Custom function...

2007-01-03 Thread escher2k
Hi, I am trying to create a linear function to influence the similarity computation. For example - if tf = 4, f(tf) = 150 * 1 + 150 * 0.3 = 195 The first occurrence is multiplied by 150. The next three occurrences are mulitplied by 150 and divided by 10 (3/10). Howev

Re: Better highlighting fragmenter

2007-01-03 Thread Mike Klaas
On 1/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote: I've implemented this before, once in Python and once in C, so I'd be glad to take a look at it. I'm not sure I have time to do a lot of implementation, but I'd sure be glad to help. Cool. I'll post the current fragmenter as a JIRA issue

Re: Searching multiple indices (solr newbie)

2007-01-03 Thread Chris Hostetter
Mekin: Yonik has done some brainstorming on ways of supporting "Feterated" searching across multiple instances of Solr - but the main motivation there is to deal with homogeneous indexes which are too big to fit on a single host efficiently... http://wiki.apache.org/solr/FederatedSearch

Re: Better highlighting fragmenter

2007-01-03 Thread Yonik Seeley
On 1/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote: We tried several APIs and decided that the best was an array of String with the odd elements containing the strings that needed highlighting. Good idea... the only thing I could think of was an array of start/end offsets into the string, wh

Re: Better highlighting fragmenter

2007-01-03 Thread Yonik Seeley
On 1/3/07, Mike Klaas <[EMAIL PROTECTED]> wrote: That is _much_ better than the current system. It wouldn't be hard to add start/end offsets to the fragments too, as Chris suggested so long ago. Or leave room for other info such as weights, or what term matched, etc. Quite a few aspects of t

Re: Better highlighting fragmenter

2007-01-03 Thread Walter Underwood
On 1/3/07 9:33 PM, "Yonik Seeley" <[EMAIL PROTECTED]> wrote: > On 1/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote: >> We tried several APIs and decided that the best was an array of >> String with the odd elements containing the strings that needed >> highlighting. > > Good idea... the only th

Re: Better highlighting fragmenter

2007-01-03 Thread Mike Klaas
On 1/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote: On 1/3/07 9:33 PM, "Yonik Seeley" <[EMAIL PROTECTED]> wrote: > On 1/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote: >> We tried several APIs and decided that the best was an array of >> String with the odd elements containing the strings t