That's fine as far as it goes, but the input is multi-valued, so merely splitting tokens on comma doesn't make the tokens separate values.

Given:

{"worked_company_name":["Dell","Microsoft,Facebook"] }

The regex would produced the equivalent of :

{"worked_company_name":["Dell","Microsoft Facebook"] }

Or is the desired goal:

{"worked_company_name":["Dell","Microsoft","Facebook"] }

Or, something else?

-- Jack Krupansky

-----Original Message----- From: Keswani, Nitin - BLS CTR
Sent: Thursday, March 21, 2013 2:54 PM
To: solr-user@lucene.apache.org ; anurag.k...@gmail.com
Subject: RE: CommaSplit and query is free text search

You can use a type defined below to split on comma. Please note I have not used any additional filters. Based on your requirements you might want to add more filters for further processing after tokenisation :

<!-- A text field that only splits on comma for exact matching of words -->
<fieldType name="text_split_on_comma" class="solr.TextField" positionIncrementGap="100">
     <analyzer>
       <tokenizer class="solr.PatternTokenizerFactory" pattern="," />
     </analyzer>
   </fieldType>

Thanks.

Regards,

Nitin Keswani


-----Original Message-----
From: anurag.jain [mailto:anurag.k...@gmail.com]
Sent: Thursday, March 21, 2013 10:10 AM
To: solr-user@lucene.apache.org
Subject: CommaSplit and query is free text search

I have field named as  worked_company_name.

in json input i am giving value like

{
"worked_company_name":["Dell","Microsoft,Facebook"]
}

-> data is very bad. means it may have comma etc.


<field name="worked_company_name" type="comaSplitwithsearch" indexed="true"
stored="true"/>


so can you please tell me how type should ?


comaSplitwithsearch ??


thanks






--
View this message in context: http://lucene.472066.n3.nabble.com/CommaSplit-and-query-is-free-text-search-tp4049734.html Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to