(11/02/16 8:03), Tanner Postert wrote:
I am using the data import handler and using the HTMLStripTransformer
doesn't seem to be working either.
I've changed webtitle and webdescription to not by copied from title and
description in the schema.xml file then set them both to just but duplicates
of title and description in the data importer query:
<document name="items">
<entity dataSource="db" name="item" transformer="HTMLStripTransformer"
query="select
title as title,
title as webtitle,
description as description,
description as webdescription
FROM ...>
<field column="webtitle" stripHTML="true" />
<field column="webdescription" stripHTML="true" />
</entity>
</document>
Just for input (I'm not sure that I could help you), I'm using
HTMLStripTransformer
with PlainTextEntityProcessor and it works fine with me:
<dataConfig>
<dataSource name="f" type="URLDataSource" encoding="UTF-8"
baseUrl="http://lucene.apache.org/"/>
<document>
<entity name="solr" processor="PlainTextEntityProcessor"
transformer="HTMLStripTransformer"
dataSource="f" url="solr/">
<field column="plainText" name="text" stripHTML="true"/>
</entity>
</document>
</dataConfig>
Koji
--
http://www.rondhuit.com/en/