subject:"Re\: Faceting on text fields"

Re: Faceting on text fields

2009-06-12 Thread Stanislaw Osinski

Hi, Sorry for being late to the party, let me try to clear some doubts about Carrot2. Do you know under what circumstances or application should we cluster the > whole corpus of documents vs just the search results? I think it depends on what you're trying to achieve. If you'd like to give the

Re: Faceting on text fields

2009-06-11 Thread Jeffrey Tiong

atext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > - Original Message > > From: Jeffrey Tiong > > To: solr-user@lucene.apache.org > > Sent: Friday, June 12, 2009 12:44:55 AM > > Subject: Re: Faceting on text fields > > > > Hi all, &

Re: Faceting on text fields

2009-06-11 Thread Otis Gospodnetic

ematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Jeffrey Tiong > To: solr-user@lucene.apache.org > Sent: Friday, June 12, 2009 12:44:55 AM > Subject: Re: Faceting on text fields > > Hi all, > > We are thinking of using the carrot

Re: Faceting on text fields

2009-06-11 Thread Jeffrey Tiong

Hi all, We are thinking of using the carrot clustering too. But we saw that carrot maybe can only cluster up to 1000 search snippets. Does anyone know how can we cluster snippets that is much more than that ? (maybe in the million range?) And what is the difference between mahout and carrot? Tha

Re: Faceting on text fields

2009-06-11 Thread Michael Ludwig

Yao Ge schrieb: BTW, Carrot2 has a very impressive Clustering Workbench (based on eclipse) that has built-in integration with Solr. If you have a Solr service running, it is a just a matter of point the workbench to it. The clustering results and visualization are amazing. (http://project.carrot2

Re: Faceting on text fields

2009-06-11 Thread Yao Ge

exactly which algo is used under >> the hood. >> >> Otis >> -- >> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch >> >> >> >> - Original Message >>> From: Michael Ludwig >>> To: solr-user@lucene.apac

Re: Faceting on text fields

2009-06-11 Thread Yao Ge

/ -- Lucene - Solr - Nutch > > > > - Original Message >> From: Michael Ludwig >> To: solr-user@lucene.apache.org >> Sent: Wednesday, June 10, 2009 9:41:54 AM >> Subject: Re: Faceting on text fields >> >> Otis Gospodnetic schrieb: >> &g

Re: Faceting on text fields

2009-06-10 Thread Otis Gospodnetic

o is used under the hood. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Michael Ludwig > To: solr-user@lucene.apache.org > Sent: Wednesday, June 10, 2009 9:41:54 AM > Subject: Re: Faceting on text fields > > Otis Gosp

Re: Faceting on text fields

2009-06-10 Thread Yao Ge

> Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > - Original Message >> From: Yao Ge >> To: solr-user@lucene.apache.org >> Sent: Tuesday, June 9, 2009 3:46:13 PM >> Subject: Re: Faceting on text fields >>

Re: Faceting on text fields

2009-06-10 Thread Michael Ludwig

Otis Gospodnetic schrieb: Solr can already cluster top N hits using Carrot2: http://wiki.apache.org/solr/ClusteringComponent Would it be fair to say that clustering as detailed on the page you're referring to is a kind of dynamic faceting? The faceting not being done based on distinct values o

Re: Faceting on text fields

2009-06-10 Thread Michael Ludwig

Yonik Seeley schrieb: Yep, all that sounds right. An additional optimization counts terms for the documents *not* in the set when the base set is over half the size of the index. Cool :-) Thanks for confirming my assumptions! Michael Ludwig

Re: Faceting on text fields

2009-06-09 Thread Otis Gospodnetic

thing like http://www.sematext.com/product-key-phrase-extractor.html could also be used. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Yao Ge > To: solr-user@lucene.apache.org > Sent: Tuesday, June 9, 2009 3:46:13 PM > Subject:

Re: Faceting on text fields

2009-06-09 Thread Yao Ge

Michael, Thanks for the update! I definitely need to get a 1.4 build see if it makes a difference. BTW, maybe instead of using faceting for text mining/clustering/visualization purpose, we can build a separate feature in SOLR for this. Many of commercial search engines I have experiences with (G

Re: Faceting on text fields

2009-06-09 Thread Yonik Seeley

Yep, all that sounds right. An additional optimization counts terms for the documents *not* in the set when the base set is over half the size of the index. -Yonik http://www.lucidimagination.com On Tue, Jun 9, 2009 at 1:01 PM, Michael Ludwig wrote: > Yonik, > > from your initial comment for SO

Re: Faceting on text fields

2009-06-09 Thread Michael Ludwig

Yonik Seeley schrieb: Are you using Solr 1.3? You might want to try the latest 1.4 test build - faceting has changed a lot. I found two significant changes (but there may well be more): [#SOLR-911] multi-select facets - ASF JIRA https://issues.apache.org/jira/browse/SOLR-911 Yao, it sounds l

Re: Faceting on text fields

2009-06-09 Thread Michael Ludwig

Yao Ge schrieb: The facet query is considerably slower comparing to other facets from structured database fields (with highly repeated values). What I found interesting is that even after I constrained search results to just a few hunderd hits using other facets, these text facets are still very

Re: Faceting on text fields

2009-06-04 Thread Yao Ge

Yes. I am using 1.3. When is 1.4 due for release? Yonik Seeley-2 wrote: > > Are you using Solr 1.3? > You might want to try the latest 1.4 test build - faceting has changed a > lot. > > -Yonik > http://www.lucidimagination.com > > On Thu, Jun 4, 2009 at 12:01 PM, Yao Ge wrote: >> >> I am ind

Re: Faceting on text fields

2009-06-04 Thread Yonik Seeley

Are you using Solr 1.3? You might want to try the latest 1.4 test build - faceting has changed a lot. -Yonik http://www.lucidimagination.com On Thu, Jun 4, 2009 at 12:01 PM, Yao Ge wrote: > > I am index a database with over 1 millions rows. Two of fields contain > unstructured text but size of e

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

Re: Faceting on text fields

18 matches

Site Navigation

Mail list logo

Footer information