Hi,

You're right -- currently Carrot2 clustering ignores the Solr analysis
chain and uses its own pipeline. It is possible to integrate with Solr's
analysis components to some extent, see the discussion here:
https://issues.apache.org/jira/browse/SOLR-2917.

Staszek


> > Hi
> > Trying to use carrot2 for clustering search results. I have it setup
> except it seems to treat the field as regular text instead of applying some
> custom filters I have.
> >
> > So my schema says something like
> > <field name="title" type="ic_text" indexed="true" stored="true"
> omitNorms="true"/>
> > <field name="content" type="ic_text" indexed="true" stored="true"
> compressed="true"/>
> >
> > ic_text is our internal fieldtype with some custom analysers that strip
> out certain special characters from the text.
> >
> > My solrconfig has something like this setup in our default search
> handler.
> > <bool name="clustering">true</bool>
> > <str name="clustering.engine">default</str>
> > <bool name="clustering.results">true</bool>
> > <!-- The title field -->
> > <str name="carrot.title">title</str>
> > <!-- The field to cluster on -->
> > <str name="carrot.snippet">content</str>
> >
> > In my search results, I see clusters but the labels on these clusters
> have the special characters in them - which means that the clustering must
> be running on raw text and not on the "ic_text" field.
> > Can someone let me know if this is the default setup and if there is a
> way to fix this ?
> > Thanks !
> > Geetu
> >
>

Reply via email to