Re: updates on the server

Erik Hatcher Thu, 06 Sep 2007 18:56:15 -0700


On Sep 6, 2007, at 2:56 PM, Matthew Runo wrote:

On a related note, it'd be great if we could set up a series oftransformations to be done on data when it comes into the index,before being indexed. I guess a custom tokenizer might be the bestway to do this though..?
ie:

-Post
-Data is cleaned up, properly escaped, etc
-Then data is passed to whatever tokenizer we want to use.

Solr should do more work on the data indexing side, to allow clientsto more easily hand documents to it and modify them. XML isn'tnecessarily the prettiest way, and we see other formats beingsupported with the CSV and rich document indexing.

A custom tokenizer or token filter make great sense in the singlefield sense of data transformation, but parsing some request datainto multiple fields must be done at a higher level.


        Erik

Re: updates on the server

Reply via email to