document support for file system crawling

2006-08-29 Thread Bruno
Hi there, browsing through the message thread I tried to find a trail addressing file system crawls. I want to implement an enterprise search over a networked filesystem, crawling all sorts of documents, such as html, doc, ppt and pdf. Nutch provides plugins enabling it to read proprietary format

Add doc limit - Follow Up

2006-08-29 Thread sangraal aiken
Hey guys, You might remember a bunch of emails going back and forth between me and the very helpful Solr folks a few weeks back. I just wanted to let you know about what I've learned about the problem in last week or so. The problem was that I would run into a hard limit of how many documents I c

RE: acts_as_solr

2006-08-29 Thread Brian Lucas
Let's create it as a top-level directory solely because it might give people a small head-start in SOLR evaluation and getting things off the ground (less navigation around the tree to get started). If there are any problems, we can always revert back to /contrib/clients. B -Original Message

Re: acts_as_solr

2006-08-29 Thread Mike Klaas
On 8/29/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: Maybe ... "contrib" in the "Java Lucene" project sense however is all java code, i would imagine that if someone wrote a perl utility to deal with index files it would not make sense to but in the Lucene "contrib" directory for that reason .

Re: acts_as_solr

2006-08-29 Thread Chris Hostetter
: > > perhaps a top level "clients" directory with this going in clients/ : > > ruby ? : > Pardon me for chiming in, but this is a very good idea. I would also : > suggest that Java clients should also go in here. : Might this fit better under a contrib/ umbrella? This would more : closely mod

Re: acts_as_solr

2006-08-29 Thread Mike Klaas
On 8/29/06, WHIRLYCOTT <[EMAIL PROTECTED]> wrote: On Aug 29, 2006, at 4:12 PM, Chris Hostetter wrote: > perhaps a top level "clients" directory with this going in clients/ > ruby ? Pardon me for chiming in, but this is a very good idea. I would also suggest that Java clients should also go in

Re: acts_as_solr

2006-08-29 Thread WHIRLYCOTT
On Aug 29, 2006, at 4:12 PM, Chris Hostetter wrote: perhaps a top level "clients" directory with this going in clients/ ruby ? Pardon me for chiming in, but this is a very good idea. I would also suggest that Java clients should also go in here. phil. --

Re: acts_as_solr

2006-08-29 Thread Chris Hostetter
: I've spent a few hours tinkering with an Ruby ActiveRecord plugin to : index, delete, and search models fronted by a database into Solr. I don't know crap about Ruby, but that looks pretty cool. : I'd like to commit this to the Solr repository. Any objections? : commit, what directory should

Re: Simplest way to load a custom RequestHandler in Jetty?

2006-08-29 Thread Chris Hostetter
: I don't think Jetty supports hot reload (correct me if I'm wrong). I wasn't even trying to hot reload -- i just wanted a way to specify in the configs (before i start Jetty) that it should include my extra jar in the webapps classpath. : > suggestions on how to do it cleanly? (ie: without deco

Re: Simplest way to load a custom RequestHandler in Jetty?

2006-08-29 Thread Andrew May
Erik Hatcher wrote: > Why not deconstruct the WAR? Alternatively you can just update the WAR directly which is what I'm doing to add in a custom TokenFilter I wrote: Here's the Jetty FAQ entry on Hot-Deploy: http://jetty.mortbay.org/jetty5/faq/faq_s_200-General_t_HotDeploy.htm