Hi there,
browsing through the message thread I tried to find a trail addressing file
system crawls. I want to implement an enterprise search over a networked
filesystem, crawling all sorts of documents, such as html, doc, ppt and pdf.
Nutch provides plugins enabling it to read proprietary format
Hey guys,
You might remember a bunch of emails going back and forth between me and the
very helpful Solr folks a few weeks back. I just wanted to let you know
about what I've learned about the problem in last week or so.
The problem was that I would run into a hard limit of how many documents I
c
Let's create it as a top-level directory solely because it might give people
a small head-start in SOLR evaluation and getting things off the ground
(less navigation around the tree to get started). If there are any
problems, we can always revert back to /contrib/clients.
B
-Original Message
On 8/29/06, Chris Hostetter <[EMAIL PROTECTED]> wrote:
Maybe ... "contrib" in the "Java Lucene" project sense however is all java
code, i would imagine that if someone wrote a perl utility to deal with
index files it would not make sense to but in the Lucene "contrib"
directory for that reason .
: > > perhaps a top level "clients" directory with this going in clients/
: > > ruby ?
: > Pardon me for chiming in, but this is a very good idea. I would also
: > suggest that Java clients should also go in here.
: Might this fit better under a contrib/ umbrella? This would more
: closely mod
On 8/29/06, WHIRLYCOTT <[EMAIL PROTECTED]> wrote:
On Aug 29, 2006, at 4:12 PM, Chris Hostetter wrote:
> perhaps a top level "clients" directory with this going in clients/
> ruby ?
Pardon me for chiming in, but this is a very good idea. I would also
suggest that Java clients should also go in
On Aug 29, 2006, at 4:12 PM, Chris Hostetter wrote:
perhaps a top level "clients" directory with this going in clients/
ruby ?
Pardon me for chiming in, but this is a very good idea. I would also
suggest that Java clients should also go in here.
phil.
--
: I've spent a few hours tinkering with an Ruby ActiveRecord plugin to
: index, delete, and search models fronted by a database into Solr.
I don't know crap about Ruby, but that looks pretty cool.
: I'd like to commit this to the Solr repository. Any objections?
: commit, what directory should
: I don't think Jetty supports hot reload (correct me if I'm wrong).
I wasn't even trying to hot reload -- i just wanted a way to specify in
the configs (before i start Jetty) that it should include my extra jar in
the webapps classpath.
: > suggestions on how to do it cleanly? (ie: without deco
Erik Hatcher wrote:
> Why not deconstruct the WAR?
Alternatively you can just update the WAR directly which is what I'm doing to add in a
custom TokenFilter I wrote:
Here's the Jetty FAQ entry on Hot-Deploy:
http://jetty.mortbay.org/jetty5/faq/faq_s_200-General_t_HotDeploy.htm
10 matches
Mail list logo