Hi,
Sorry for being late to the party, let me try to clear some doubts about
Carrot2.
Do you know under what circumstances or application should we cluster the
> whole corpus of documents vs just the search results?
I think it depends on what you're trying to achieve. If you'd like to give
the
atext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
> - Original Message
> > From: Jeffrey Tiong
> > To: solr-user@lucene.apache.org
> > Sent: Friday, June 12, 2009 12:44:55 AM
> > Subject: Re: Faceting on text fields
> >
> > Hi all,
&
ematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Jeffrey Tiong
> To: solr-user@lucene.apache.org
> Sent: Friday, June 12, 2009 12:44:55 AM
> Subject: Re: Faceting on text fields
>
> Hi all,
>
> We are thinking of using the carrot
Hi all,
We are thinking of using the carrot clustering too. But we saw that carrot
maybe can only cluster up to 1000 search snippets. Does anyone know how can
we cluster snippets that is much more than that ? (maybe in the million
range?)
And what is the difference between mahout and carrot?
Tha
Yao Ge schrieb:
BTW, Carrot2 has a very impressive Clustering Workbench (based on
eclipse) that has built-in integration with Solr. If you have a Solr
service running, it is a just a matter of point the workbench to it.
The clustering results and visualization are amazing.
(http://project.carrot2
exactly which algo is used under
>> the hood.
>>
>> Otis
>> --
>> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>>
>>
>>
>> - Original Message
>>> From: Michael Ludwig
>>> To: solr-user@lucene.apac
/ -- Lucene - Solr - Nutch
>
>
>
> - Original Message
>> From: Michael Ludwig
>> To: solr-user@lucene.apache.org
>> Sent: Wednesday, June 10, 2009 9:41:54 AM
>> Subject: Re: Faceting on text fields
>>
>> Otis Gospodnetic schrieb:
>> &g
o is used under the hood.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Michael Ludwig
> To: solr-user@lucene.apache.org
> Sent: Wednesday, June 10, 2009 9:41:54 AM
> Subject: Re: Faceting on text fields
>
> Otis Gosp
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
> - Original Message
>> From: Yao Ge
>> To: solr-user@lucene.apache.org
>> Sent: Tuesday, June 9, 2009 3:46:13 PM
>> Subject: Re: Faceting on text fields
>>
Otis Gospodnetic schrieb:
Solr can already cluster top N hits using Carrot2:
http://wiki.apache.org/solr/ClusteringComponent
Would it be fair to say that clustering as detailed on the page you're
referring to is a kind of dynamic faceting? The faceting not being done
based on distinct values o
Yonik Seeley schrieb:
Yep, all that sounds right.
An additional optimization counts terms for the documents *not* in the
set when the base set is over half the size of the index.
Cool :-) Thanks for confirming my assumptions!
Michael Ludwig
thing like http://www.sematext.com/product-key-phrase-extractor.html could
also be used.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Yao Ge
> To: solr-user@lucene.apache.org
> Sent: Tuesday, June 9, 2009 3:46:13 PM
> Subject:
Michael,
Thanks for the update! I definitely need to get a 1.4 build see if it makes
a difference.
BTW, maybe instead of using faceting for text
mining/clustering/visualization purpose, we can build a separate feature in
SOLR for this. Many of commercial search engines I have experiences with
(G
Yep, all that sounds right.
An additional optimization counts terms for the documents *not* in the
set when the base set is over half the size of the index.
-Yonik
http://www.lucidimagination.com
On Tue, Jun 9, 2009 at 1:01 PM, Michael Ludwig wrote:
> Yonik,
>
> from your initial comment for SO
Yonik Seeley schrieb:
Are you using Solr 1.3?
You might want to try the latest 1.4 test build -
faceting has changed a lot.
I found two significant changes (but there may well be more):
[#SOLR-911] multi-select facets - ASF JIRA
https://issues.apache.org/jira/browse/SOLR-911
Yao,
it sounds l
Yao Ge schrieb:
The facet query is considerably slower comparing to other facets from
structured database fields (with highly repeated values). What I found
interesting is that even after I constrained search results to just a
few hunderd hits using other facets, these text facets are still very
Yes. I am using 1.3. When is 1.4 due for release?
Yonik Seeley-2 wrote:
>
> Are you using Solr 1.3?
> You might want to try the latest 1.4 test build - faceting has changed a
> lot.
>
> -Yonik
> http://www.lucidimagination.com
>
> On Thu, Jun 4, 2009 at 12:01 PM, Yao Ge wrote:
>>
>> I am ind
Are you using Solr 1.3?
You might want to try the latest 1.4 test build - faceting has changed a lot.
-Yonik
http://www.lucidimagination.com
On Thu, Jun 4, 2009 at 12:01 PM, Yao Ge wrote:
>
> I am index a database with over 1 millions rows. Two of fields contain
> unstructured text but size of e
18 matches
Mail list logo