Just to comment on that last part: "There are FAST deployments out there which run on dozens, in some cases hundreds of nodes serving multiple terabyte size indexes and achieving hundreds of queries per seconds."
There are also a lot of Lucene or Solr deployments with similar setups - I've worked on and with some decent-sized search clusters with 50-100 search servers, fault tolerance, high input rates, and high query rates. It's all doable with Lucene and Solr, it's just that not everything comes out of the box, so you have to either find somebody to help out or do it in house. You pay developers to build exactly what you need as opposed to paying FAST a pile of $$$ based on, say, the query rate. Or you don't pay $250,000 for a Google Appliance that can index only 10MM docs (this is a real number). Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- From: Nuno Leitao <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Tuesday, December 11, 2007 8:50:37 PM Subject: Re: SOLR X FAST FAST uses two pipelines - an ingestion pipeline (for document feeding) and a query pipeline which are fully programmable (i.e., you can customize it fully). At ingestion time you typically prepare documents for indexing (tokenize, character normalize, lemmatize, clean up text, perform entity extraction for facets, perform static boosting for certain documents, etc.), while at query time you can expand synonyms, and do other general query side tasks (not unlike Solr). Horizontal scalability means the ability to cluster your search engine across a large number of servers, so you can scale up on the number of documents, queries, crawls, etc. There are FAST deployments out there which run on dozens, in some cases hundreds of nodes serving multiple terabyte size indexes and achieving hundreds of queries per seconds. Yet again, if your requirements are relatively simple then Lucene might do the job just fine. Hope this helps. --Nuno. On 12 Dec 2007, at 01:33, Ravish Bhagdev wrote: > Could you please elaborate on what you mean by ingestion pipeline and > horizontal scalability? I apologize if this is a stupid question > everyone else on the forum is familiar with. > > Thanks, > Ravi > > On Dec 12, 2007 1:09 AM, Nuno Leitao <[EMAIL PROTECTED]> wrote: >> Depends, if you are looking for a small sized index (gigabytes rather >> than dozens or hundreds of gigabytes or terabytes) with relatively >> simple requirements (a few facets, simple tokenization, English only >> linguistics, etc.) Solr is likely to be appropriate for most cases. >> >> FAST however gives you great horizontal scalability, out of the box >> linguistics for many languages (including CJK), contextual and scope >> searching, a web, file and database crawler, programmable ingestion >> pipeline, etc. >> >> Regards. >> >> --Nuno >> >> >> On 11 Dec 2007, at 22:09, William Silva wrote: >> >>> Hi, >>> How is the best way to compare SOLR and FAST Search ? >>> Thanks, >>> William. >> >>