Re: what crawler do you use for Solr indexing?

2009-03-13 Thread ristretto.rb
Hello, I built my own crawler with Python, as I couldn't find (not complaining, probably didn't look hard enough) nutch documentation. I use BeautifulSoup, because the site is mostly based on Python/Django, and we like Python. Writing one was good for us because we spent most of out time figurin

Re: unique result

2009-03-13 Thread ristretto.rb
FWIW... We run a hash or the content and other bits of our docs, and then remove duplicates according to specific algorithms. (exactly the same page content can clearly be hosted on many different urls but, and domains) Then, the choosen ones are indexed. Though we toss the synonyms in the inde

Anyone use solr admin and Opera?

2009-03-25 Thread ristretto.rb
Hello, I'm a happy Solr user. Thanks for the excellent software!! Hopefully this is a good question, I have indeed looked around the FAQ and google and such first. I have just switched from Firefox to Opera for web browsing. (Another story) When I use the solr/admin the home page and stats works

Advice on moving from 1.3 to 1.4-dev or trunk?

2009-04-16 Thread ristretto.rb
Hello, I'm using solr 1.3 with solr.py. We have a basic schema.xml, nothing custom or out of the ordinary. I need the following the feature from http://svn.apache.org/repos/asf/lucene/solr/trunk/CHANGES.txt SOLR-911: Add support for multi-select faceting by allowing filters to be tagged

Fwd: Advice on moving from 1.3 to 1.4-dev or trunk?

2009-04-16 Thread ristretto.rb
From: ristretto.rb Date: Fri, Apr 17, 2009 at 11:16 AM Subject: Advice on moving from 1.3 to 1.4-dev or trunk? To: solr-user@lucene.apache.org Hello, I'm using solr 1.3 with solr.py.   We have a basic schema.xml, nothing custom or out of the ordinary. I need the following the feature from

Re: Seattle / PNW Hadoop + Lucene User Group?

2009-04-20 Thread ristretto.rb
Beer h, I'm in New Zealand, so probably can't make it, but I sounds tempting. cheers gene On Tue, Apr 21, 2009 at 11:28 AM, Bradford Stephens wrote: > Thanks for the responses, everyone. Where shall we host? My company > can offer space in  our building in Factoria, but it's not exactly

facet results in order of rank

2009-04-23 Thread ristretto.rb
Hello, Is it possible to order the facet results on some ranking score? I've had a look at the facet.sort param, (http://wiki.apache.org/solr/SimpleFacetParameters#head-569f93fb24ec41b061e37c702203c99d8853d5f1) but that seems to order the facet either by count or by index value (in my case alphabe

Re: facet results in order of rank

2009-04-26 Thread ristretto.rb
Apr 24, 2009 at 6:55 PM, ristretto.rb wrote: > Hello, > > Is it possible to order the facet results on some ranking score? > I've had a look at the facet.sort param, > (http://wiki.apache.org/solr/SimpleFacetParameters#head-569f93fb24ec41b061e37c702203c99d8853d5f1) > but t

Re: facet results in order of rank

2009-04-30 Thread ristretto.rb
ear up my post at all? > > Perhaps this wouldn't be too hard for me to implement.  I have lots of > Java experience, but no experience with Lucene or Solr code. > thoughts? > > thanks > gene > > > > > On Tue, Apr 28, 2009 at 10:56 AM, Shalin Shekhar Mangar &

Re: facet results in order of rank

2009-04-30 Thread ristretto.rb
Thanks for the reply. Hopefully I'll get more, and turn this into a mini project I can commit back to the project, or at least make available to anyone who'd like the functionality.Of course, if I'm the only one who cares, it could be a long road. :) gene On Fri, May 1, 2009 at 9:41 AM, En