Hi, I posted related question into to Nutch-user yesterday. Here is the post: Crawling MOSS 2007 content using Nutch via GSA connector<http://www.nabble.com/Crawling-MOSS-2007-content-using-Nutch-via-GSA-connector-td16874191.html>
My specific situation if as folows: We are deploying MOSS 2007 which includes its own search server. However, we found that the search is lacking in some areas and solution requires additional expenses on HW or SW. Thus we are evaluating alternatives. GSA is one of them. But after I saw a presentation from technical guys on GSA I thought myself that Nutch could do the same (or even better in terms of term boosting for example :-). GSA is able to use connectors for external datasources and for Share Point there is sharepoint connector which is written in Java and is Apache licenced. This connector can crawl document links out of MOSS 2007 and push them into GSA which is then responsible for crawling. I wonder if I am able to use sharepoint connector to get the list of URLs which I can then crawl and index by Nutch. Is there any chance that using Solr make sanse in such scenario? Is Solr more convenient for such job? I have no experience with Solr. I think I just understand basic concept: Solr is a search server which can accept document in XML via HTTP. So I don't see a match with my use case because I would have to download all those documents from MOSS on my own and convert them into XML prior to sending to Solr. Am I correct? Regards, Lukas On Fri, Apr 25, 2008 at 3:42 AM, Otis Gospodnetic < [EMAIL PROTECTED]> wrote: > Ask me in about a month. I will likely be converting one *very* large and > well-known organization from the expensive GSA to Solr.... if that's what > you are asking about. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > ----- Original Message ---- > > From: Jon Baer <[EMAIL PROTECTED]> > > To: solr-user@lucene.apache.org > > Sent: Thursday, April 24, 2008 8:03:19 PM > > Subject: GSA <-> Solr > > > > Hi, > > > > Going to try to persuade my employer to switch away some functions, > > maybe all from the GSA black box to Solr and was trying to find some > > (any?) case studies where this was done ... > > > > Also what is the similar function to a "KeyMatch" in Solr? Is it > > elevate.xml? > > > > BTW, have been testing the DataImportHandler w/ MultiCore and it works > > very nicely. > > > > Thanks! > > > > - Jon > > -- http://blog.lukas-vlcek.com/