I updated my data importer.

I used to have:

<field column="webtitle" stripHTML="true" />
<field column="webdescription" stripHTML="true" />

which wasn't working. But I changed that to

<field column="webtitle" name="webtitle" stripHTML="true" />
<field column="webdescription" name="webdescription" stripHTML="true" />

and it is working fine.

On Tue, Feb 15, 2011 at 5:50 PM, Koji Sekiguchi <k...@r.email.ne.jp> wrote:

> (11/02/16 8:03), Tanner Postert wrote:
>
>> I am using the data import handler and using the HTMLStripTransformer
>> doesn't seem to be working either.
>>
>> I've changed webtitle and webdescription to not by copied from title and
>> description in the schema.xml file then set them both to just but
>> duplicates
>> of title and description in the data importer query:
>>
>> <document name="items">
>>  <entity dataSource="db" name="item" transformer="HTMLStripTransformer"
>>     query="select
>>       title as title,
>>       title as webtitle,
>>       description as description,
>>       description as webdescription
>>       FROM ...>
>>       <field column="webtitle" stripHTML="true" />
>>       <field column="webdescription" stripHTML="true" />
>>  </entity>
>> </document>
>>
>>
> Just for input (I'm not sure that I could help you), I'm using
> HTMLStripTransformer
> with PlainTextEntityProcessor and it works fine with me:
>
> <dataConfig>
>  <dataSource name="f" type="URLDataSource" encoding="UTF-8"
>              baseUrl="http://lucene.apache.org/"/>
>  <document>
>    <entity name="solr" processor="PlainTextEntityProcessor"
> transformer="HTMLStripTransformer"
>            dataSource="f" url="solr/">
>      <field column="plainText" name="text" stripHTML="true"/>
>    </entity>
>  </document>
> </dataConfig>
>
> Koji
> --
> http://www.rondhuit.com/en/
>

Reply via email to