Re: Snipets Solr/nutch

2008-04-10 Thread khirb7
hello every body just one other question, to analyse and modify Solr's snippet, I want to know if org.apache.solr.util.HighlightingUtils is the class generating the snippet and which methode generate them, and could you please explain me how are they generated in that class and where exactly to

Re: Human Powered Search Module

2008-04-10 Thread Mathieu Lecarme
Sushan Rungta a écrit : Hello Everybody, I am a newbie in Lucene and I am from India, currently working for a search module for our classifed website search module in clickindia.com. I have implemented the basic functionality of solr lucen and am pretty happy with the results. Search in Ind

How to custom solr sort?

2008-04-10 Thread shawnliu
I have inherited a new class from the org.apache.solr.schema.StrField and customed a new sort algorithm by implementing the SortComparatorSource interface.Then to export the jar file to the solr lib directory, and configure the schema.xml file.But when I test the new feature, It does't work at all

HTMLStripReader and script tags

2008-04-10 Thread Walter Ferrara
I've noticed that passing html to a field using HTMLStripWhitespaceTokenizerFactory, ends up in having some javascripts too. For example, using a analyzer like: with a text such as: title pre var time = new Date(); ordval= (time.getTime()); post

help on caching and index files of Solr

2008-04-10 Thread Sagar Khetkade
Hello, I have a hands on both Lucene and Solr. The difference between these two search engines are explained to some extend, still having some query on these. I am in need to know why 1. Want some information on the difference between caching of Lucene and Solr index files. 2. As Solr is

Re: HTMLStripReader and script tags

2008-04-10 Thread Yonik Seeley
It was the intention to remove script. I developed HTMLStripReader by just looking at a bunch of real-world HTML. I hadn't run across script in uppercase, so I didn't do a case insensitive check. The code is currently: if (name.equals("script") || name.equals("style")) { Should be easy enough

Re: HTMLStripReader and script tags

2008-04-10 Thread Yonik Seeley
I've just committed a change to ignore case when comparing tag names. -Yonik On Thu, Apr 10, 2008 at 9:03 AM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > It was the intention to remove script. > I developed HTMLStripReader by just looking at a bunch of real-world HTML. > I hadn't run across script

Re: Snipets Solr/nutch(maxFragSize?)

2008-04-10 Thread khirb7
khirb7 wrote: > > hello every body > > just one other question, to analyse and modify Solr's snippet, I want to > know if org.apache.solr.util.HighlightingUtils > is the class generating the snippet and which methode generate them, and > could you please explain me how are they generated in

Re: Snipets Solr/nutch

2008-04-10 Thread Mike Klaas
On 10-Apr-08, at 12:26 AM, khirb7 wrote: hello every body just one other question, to analyse and modify Solr's snippet, I want to know if org.apache.solr.util.HighlightingUtils is the class generating the snippet and which methode generate them, and could you please explain me how are th

Re: Multicore Issue with nightly build

2008-04-10 Thread kirk beers
Hi Ryan, I still can't seem to get my solr cores : core0 and core1 to accept new documents. I changed the appropriate code in the Perl client to accommodate the core as you mentioned in the previous email. I am able to delete docs. Is there any thing I might be missing in the basic core schema

Re: Solr + Complex Legacy Schema -- Best Practices?

2008-04-10 Thread Tkach
I realize this is a really vague sort of question with a lot of what-ifs, so feel free to just say we'll just have to try implementing one version, test, and see if the results are acceptable. :) Well, our searches are really more along the lines of searching on product "details" (brand/key wor

Searching for popular phrases or words

2008-04-10 Thread Edwin Koome
Gentlemen New to Solr and this may have been answered before. How can i search for popular phrases or words with an option to include only, for example, technical terms e.g "Oracle database" rather than common english phrases? Please point me in the right direction. regards, Eric

Re: Return the result only field A or field B is non-zero?

2008-04-10 Thread Chris Hostetter
If every document will definitely have a value for both fields, you can do... q = query & fq = -(+fieldA:0 +fieldB:0) ...it's more complicated if some docs don't have any value for one or both fields: if the fields are integers (and not floats) then the easiest thi

Re: help on caching and index files of Solr

2008-04-10 Thread Chris Hostetter
Solr is an application that uses the Lucene Java library -- everything that exists in Lucene exists in Solr, Solr just adds on top of it, the raw Lucene index is in the data directory, Lucene's (minimal) caching is still used, additional Solr specific caching is added on top (see the wiki for

chaching and indexes in Solr

2008-04-10 Thread Sagar Khetkade
Hello, I have a hands on both Lucene and Solr. The difference betweenthese two search engines are explained to some extend, still havingsome query on these. I am in need to know 1. The difference between caching of Lucene and Solr index files. 2. As Solr is built on Lucene, is the index file