On 15 March 2013 20:16, Luis <reneonta...@gmail.com> wrote: > > Hi Gora, thank you for your reply. I am not using any commands, I just go > on > the Solr dashboard, db > Dataimport and execute a full-import.
In that case, you are not using the ExtractingRequestHandler, but using the DataImportHandler, even though you have both handlers defined. > > *My schema.xml looks like this:* [...] This cannot be the complete schema.xml, but in any case, the issue probably does not lie there. > *My db-data-config.xml looks like this:* > > <dataConfig> > <dataSource type="JdbcDataSource" driver="com.mysql.jdbc.Driver" > url="jdbc:mysql://localhost:3306/opspedia" > user="username" batchSize="-1" name="mysql" /> > <dataSource type="BinURLDataSource" name="bin"/> > > <document> > > <entity onError="skip" name="fileSourcePaths" > rootEntity="true" > dataSource="mysql" query="select ID, urlpath from myposts" > deltaImportQuery="SELECT * FROM myposts WHERE id = > '${dataimporter.delta.id}'" > deltaQuery="SELECT id FROM myposts WHERE last_modified > > '${dataimporter.last_index_time}'"> > > <entity name="tika-test" > processor="TikaEntityProcessor" fileName=".*" > recursive="true" url="${fileSourcePaths.guid}" format="text" > dataSource="bin" > Your query on the root entity, fileSourcePaths, only selects ID and urlpath, but the url attribute in the nested TikaEntityProcessor refers to ${fileSourcePaths.guid} which has never been selected. Regards, Gora