Lukas, >From your description, this looks like a Nutch job, not Solr (no crawling >component), though one can also use Nutch with Solr now.
I can't share the reasons, unfortunately. But from a personal stand point, I've seen GSA and it's not all that impressive, it costs a pile of money, and the price raises exponentially with the number of documents, it seems. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- > From: Lukas Vlcek <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Friday, April 25, 2008 12:31:13 AM > Subject: Re: GSA <-> Solr > > BTW: Do you think you can share reasons why your clients are switching from > GSA? I am very interested in their experience. > > On Fri, Apr 25, 2008 at 6:29 AM, Lukas Vlcek wrote: > > > Hi, > > > > I posted related question into to Nutch-user yesterday. Here is the post: > Crawling > > MOSS 2007 content using Nutch via GSA > connector > > > > My specific situation if as folows: > > We are deploying MOSS 2007 which includes its own search server. However, > > we found that the search is lacking in some areas and solution requires > > additional expenses on HW or SW. Thus we are evaluating alternatives. GSA is > > one of them. But after I saw a presentation from technical guys on GSA I > > thought myself that Nutch could do the same (or even better in terms of term > > boosting for example :-). > > GSA is able to use connectors for external datasources and for Share Point > > there is sharepoint connector which is written in Java and is Apache > > licenced. This connector can crawl document links out of MOSS 2007 and push > > them into GSA which is then responsible for crawling. I wonder if I am able > > to use sharepoint connector to get the list of URLs which I can then crawl > > and index by Nutch. Is there any chance that using Solr make sanse in such > > scenario? Is Solr more convenient for such job? > > > > I have no experience with Solr. I think I just understand basic concept: > > Solr is a search server which can accept document in XML via HTTP. So I > > don't see a match with my use case because I would have to download all > > those documents from MOSS on my own and convert them into XML prior to > > sending to Solr. Am I correct? > > > > Regards, > > Lukas > > > > > > On Fri, Apr 25, 2008 at 3:42 AM, Otis Gospodnetic < > > [EMAIL PROTECTED]> wrote: > > > >> Ask me in about a month. I will likely be converting one *very* large and > >> well-known organization from the expensive GSA to Solr.... if that's what > >> you are asking about. > >> > >> Otis > >> -- > >> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > >> > >> > >> ----- Original Message ---- > >> > From: Jon Baer > >> > To: solr-user@lucene.apache.org > >> > Sent: Thursday, April 24, 2008 8:03:19 PM > >> > Subject: GSA <-> Solr > >> > > >> > Hi, > >> > > >> > Going to try to persuade my employer to switch away some functions, > >> > maybe all from the GSA black box to Solr and was trying to find some > >> > (any?) case studies where this was done ... > >> > > >> > Also what is the similar function to a "KeyMatch" in Solr? Is it > >> > elevate.xml? > >> > > >> > BTW, have been testing the DataImportHandler w/ MultiCore and it works > >> > very nicely. > >> > > >> > Thanks! > >> > > >> > - Jon > >> > >> > > > > > > -- > > http://blog.lukas-vlcek.com/ > > > > > -- > http://blog.lukas-vlcek.com/