2010/12/6 Ahmet Arslan :
>
> If you are already using DIH,
> http://wiki.apache.org/solr/DataImportHandler#HTMLStripTransformer can do
> what you want.
Indeed it can. Many thanks.
> - I have zero control over what is stored in the database
> - using the Solr XML update protocol i could probably
> transform the
> data before sending it
> - ... but I'd much rather continue using DataImportHandler
> to access
> the database
If you are already using DIH,
http://wiki.apache.or
Hi,
You can create a custom update request processor [1] to strip unwanted input
as it is about to enter the index.
[1]: http://wiki.apache.org/solr/UpdateRequestProcessor
Cheers,
On Monday 06 December 2010 17:36:09 Emmanuel Bégué wrote:
> Hello,
>
> Is it possible to manipulate the value of
Hello,
Is it possible to manipulate the value of a field before it is stored?
I'm indexing a database where some field contain raw HTML, including
named character entities.
Using solr.HTMLStripCharFilterFactory on the index analyzer, results
in this HTML being correctly stripped, and named chara