Hi,

I posted related question into to Nutch-user yesterday. Here is the
post: Crawling
MOSS 2007 content using Nutch via GSA
connector<http://www.nabble.com/Crawling-MOSS-2007-content-using-Nutch-via-GSA-connector-td16874191.html>

My specific situation if as folows:
We are deploying MOSS 2007 which includes its own search server. However, we
found that the search is lacking in some areas and solution requires
additional expenses on HW or SW. Thus we are evaluating alternatives. GSA is
one of them. But after I saw a presentation from technical guys on GSA I
thought myself that Nutch could do the same (or even better in terms of term
boosting for example :-).
GSA is able to use connectors for external datasources and for Share Point
there is sharepoint connector which is written in Java and is Apache
licenced. This connector can crawl document links out of MOSS 2007 and push
them into GSA which is then responsible for crawling. I wonder if I am able
to use sharepoint connector to get the list of URLs which I can then crawl
and index by Nutch. Is there any chance that using Solr make sanse in such
scenario? Is Solr more convenient for such job?

I have no experience with Solr. I think I just understand basic concept:
Solr is a search server which can accept document in XML via HTTP. So I
don't see a match with my use case because I would have to download all
those documents from MOSS on my own and convert them into XML prior to
sending to Solr. Am I correct?

Regards,
Lukas

On Fri, Apr 25, 2008 at 3:42 AM, Otis Gospodnetic <
[EMAIL PROTECTED]> wrote:

> Ask me in about a month.  I will likely be converting one *very* large and
> well-known organization from the expensive GSA to Solr.... if that's what
> you are asking about.
>
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
> ----- Original Message ----
> > From: Jon Baer <[EMAIL PROTECTED]>
> > To: solr-user@lucene.apache.org
> > Sent: Thursday, April 24, 2008 8:03:19 PM
> > Subject: GSA <-> Solr
> >
> > Hi,
> >
> > Going to try to persuade my employer to switch away some functions,
> > maybe all from the GSA black box to Solr and was trying to find some
> > (any?) case studies where this was done ...
> >
> > Also what is the similar function to a "KeyMatch" in Solr?  Is it
> > elevate.xml?
> >
> > BTW, have been testing the DataImportHandler w/ MultiCore and it works
> > very nicely.
> >
> > Thanks!
> >
> > - Jon
>
>


-- 
http://blog.lukas-vlcek.com/

Reply via email to