Re: i wanna find one crawl that can crawl with defined urls and defined data

2007-04-30 Thread Graeme Merrall
i wanna crawl http://www.amazone.com/ and just wanna product title , product information, writer, publisher. and other data i wanna ignore. How about http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html or if you're prepared to wait or help out there's http://svn.apa

numFound for facet results

2007-04-30 Thread mirko
Hi, could you tell me what is the (simplest|elegant|fast) way of implementing the following: I use faceted browsing, but I limit the number of facet counts to 5 (i.e., facet.limit=5). 1. I would like to be able to show if there are more facet values (this can be achieved with the trick for askin

Re: numFound for facet results

2007-04-30 Thread Yonik Seeley
On 4/30/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: could you tell me what is the (simplest|elegant|fast) way of implementing the following: I use faceted browsing, but I limit the number of facet counts to 5 (i.e., facet.limit=5). 1. I would like to be able to show if there are more facet

Re: sorting by matched field, then title alpha

2007-04-30 Thread Simon Kahl
You can approximate it by doing something like: A:"phrase"^10 B:"phrase"^1 C:"phrase"^1000 D:"phrase"^100 E:"phrase"^30 Thanks for suggestion Mike. I tried boosting like this but all docs get slightly different scores (because of tf, idf etc) and so secondary sort on field X has no i

Re: resin faile to start with solr.

2007-04-30 Thread Ken Krugler
2007/4/29, Ken Krugler <[EMAIL PROTECTED]>: now i test the newest solr (nothing modified) i failed to start solr with resin 3.0 1. Which exact version of Resin? Still 3.0.23? 3.0.23 2. Just to confirm, you uncommented out the lines in web.xml mentioned previously? just newest solr's

RE: EmbeddedSolr class from Wiki

2007-04-30 Thread Chris Hostetter
: :you could even have the postCommit hook of your writer trigger a commit : :call on your readers so they reopen the newly updated index. : : Thanks, I need "separate JVMs" so "writer triggers a commit call on readers" : is slightly unclear... I want to use separate applications, webmodule with :

Re: resin faile to start with solr.

2007-04-30 Thread Chris Hostetter
: >>1. Which exact version of Resin? Still 3.0.23? : >2. Just to confirm, you uncommented out the lines in web.xml : >>mentioned previously? : Try uncommenting out the lines in the web.xml and see if that fixes : your problem. Ken: I'm not very familiar withteh problem you are describing, would

Re: sorting by matched field, then title alpha

2007-04-30 Thread Chris Hostetter
: Think I will have to modify o.a.s.s.QueryParsing.parseSort to hook in custom : sort. Is there any better way? If you write a custom SortComparatorSource, then the easiest way to use it would probably be to write your own subclass of TextField and override the getSortField method to construct a

Re: resin faile to start with solr.

2007-04-30 Thread Ryan McKinley
Chris Hostetter wrote: : >>1. Which exact version of Resin? Still 3.0.23? : >2. Just to confirm, you uncommented out the lines in web.xml : >>mentioned previously? : Try uncommenting out the lines in the web.xml and see if that fixes : your problem. Ken: I'm not very familiar withteh problem y

Re: numFound for facet results

2007-04-30 Thread Erik Hatcher
On Apr 30, 2007, at 11:16 AM, Yonik Seeley wrote: On 4/30/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: 2. I would like to be able to tell how many facet values are there total. (This would be a value like numFound for the results). Is there such a thing or a workaround like for 1. Numb

Delete from Solr index...

2007-04-30 Thread escher2k
I am trying to remove documents from my index using "delete by query". However when I did this, the deleted items seem to remain. This is the format of the XML file I am using - load_id:20070424150841 load_id:20070425145301 load_id:20070426145301 load_id:20070427145302 load_id:20070428145301 load

Faceted count syntax (exclude zeros)...

2007-04-30 Thread escher2k
I am trying to execute a faceted count on a field called "load_id" and want to exclude 0s. The URL below doesn't seem to be excluding zeros. http://localhost:12002/solr/select/?qt=dismax&q=Y&qf=show_all_flag&fl=load_id&facet=true&facet.limit=-1&facet.field=load_id&facet.mincount=1&rows=0 Result

Re: Delete from Solr index...

2007-04-30 Thread Ryan McKinley
escher2k wrote: I am trying to remove documents from my index using "delete by query". However when I did this, the deleted items seem to remain. This is the format of the XML file I am using - load_id:20070424150841 load_id:20070425145301 load_id:20070426145301 load_id:20070427145302 load_id:20

Re: Delete from Solr index...

2007-04-30 Thread escher2k
Thanks Ryan. I need to use query since I am deleting a range of documents. >From your comment, I wasn't sure if one doesn't need to do an explicit commit when using delete by query. Does delete by query not need an explicit commit. Thanks. ryan mckinley wrote: > > escher2k wrote: >> I am tryin

Specifying no-ops...

2007-04-30 Thread escher2k
I want to capture information about the user who is executing a particular search. Is there a way to specify in Solr that certain fields should just be treated as pass through and not processed ? This way I can use arbitrary params to do better logging. Thanks. -- View this message in context: