Are you suggesting something like this:
<fieldtype name="splitField" class="solr.TextField" sortMissingLast="true" omitNorms="true"> <multi-field> <tokenizer class="solr.RegexTokenizerFactory" pattern=";" /> <filter class="solr.TrimFilterFactory" /> </multi-field> <analyzer> ... </analyzer> </fieldtype> On 1/21/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
On 1/21/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: > Is there any easy way to split a string into a multi-field on the server: From an indexing perspective, yes... just assign a tokenizer that splits on ';' I don't think we currently have such as configurable Tokenizer though. The (hypothetical) tokenizer could even add a positionIncrement, emulating multiple fields exactly from the indexing perspective. Then you could follow it with the newly added TrimFilter to trim whitespace. From the stored field perspective, you get back what you put in. To be nice and general, perhaps it could be regex based like String.split() -Yonik > given: > <add> > <field name="subject">subject1; subject2; subject- 3</field> > </doc> > > I would like: > <add> > <field name="subject">subject1</field> > <field name="subject">subject2</field> > <field name="subject">subject- 3</field> > </doc> > > Thanks for any pointers > > ryan >