Re: Splitting fields

2011-05-31 Thread Markus Jelsma
I'd go for this option as well. The example update processor can't make it more easier and it's a very flexible approach. Judging from the patch in SOLR-2105 it should still work with the current 3.2 branch. https://issues.apache.org/jira/browse/SOLR-2105 > Hi, > > Write a custom UpdateProces

Re: Splitting fields

2011-05-31 Thread Jan Høydahl
Hi, Write a custom UpdateProcessor, which gives you full control of the SolrDocument prior to indexing. The best would be if you write a generic FieldSplitterProcessor which is configurable on what field to take as input, what delimiter or regex to split on and finally what fields to write the

Re: Splitting fields

2011-05-31 Thread Erick Erickson
Hmmm, I wonder if a custom Transformer would help here? It can be inserted into a chain of transformers in DIH. Essentially, you subclass Transformer and implement one method (transformRow) and do anything you want. The input is a map of that is a simple representation of the Solr document. You c