On 15 March 2013 20:16, Luis <reneonta...@gmail.com> wrote:
>
> Hi Gora, thank you for your reply.  I am not using any commands, I just go
> on
> the Solr dashboard, db > Dataimport and execute a full-import.

In that case, you are not using the ExtractingRequestHandler, but
using the DataImportHandler, even though you have both handlers
defined.

>
> *My schema.xml looks like this:*
[...]

This cannot be the complete schema.xml, but in any case,
the issue probably does not lie there.

> *My db-data-config.xml looks like this:*
>
> <dataConfig>
>         <dataSource type="JdbcDataSource" driver="com.mysql.jdbc.Driver"
>                      url="jdbc:mysql://localhost:3306/opspedia"
>                      user="username" batchSize="-1" name="mysql" />
>         <dataSource type="BinURLDataSource" name="bin"/>
>
>         <document>
>
>                 <entity onError="skip" name="fileSourcePaths"
> rootEntity="true"
> dataSource="mysql" query="select ID, urlpath from myposts"
>                 deltaImportQuery="SELECT * FROM myposts WHERE id =
> '${dataimporter.delta.id}'"
>                   deltaQuery="SELECT id FROM myposts WHERE last_modified >
> '${dataimporter.last_index_time}'">
>
>                         <entity name="tika-test"
> processor="TikaEntityProcessor" fileName=".*"
> recursive="true" url="${fileSourcePaths.guid}" format="text"
> dataSource="bin" >

Your query on the root entity, fileSourcePaths, only selects ID
and urlpath, but the url attribute in the nested TikaEntityProcessor
refers to ${fileSourcePaths.guid} which has never been selected.

Regards,
Gora

Reply via email to