I can't seem to get delta-imports to work with a FileDataSource DIH
full-import works fine.
delta-import always imports nothing, no error. I can add a new file or
change an existing one, no joy.
my requesthandler declaration
<requestHandler name="/update/tei"
class="org.apache.solr.handler.dataimport.DataImportHandler">
<lst name="defaults">
<str name="config">tei-config.xml</str>
</lst>
</requestHandler>
my dataimporthandler config(truncated for brevity)
<dataConfig>
<dataSource name="myfilereader" type="FileDataSource"/>
<document name="mbepapers">
<entity name="jcurrent"
processor="FileListEntityProcessor"
fileName=".*xml"
rootEntity="true"
dataSource="null"
baseDir="/home/admin/mbepp/tei2html/build/search"
onError="continue"
pk="id"
deltaQuery="select id from item where last_modified >
'${dataimporter.last_index_time}'"
>
<field column="id" name="id" />
<entity name="meta"
dataSource="myfilereader"
processor="XPathEntityProcessor"
url="${jcurrent.fileAbsolutePath}"
stream="false"
forEach="/TEI/teiHeader/fileDesc"
xsl="xslt/meta.xsl"
>
<field column="title" xpath="/TEI/teiHeader//title" />
<field column="author" xpath="/TEI/teiHeader//author" />
<field column="publisher" xpath="/TEI/teiHeader//publisher" />
<field column="accession" xpath="/TEI/teiHeader//idno" />
<field column="origin" xpath="/TEI/teiHeader//origin" />
<field column="note" xpath="/TEI/teiHeader//note"
flatten="true" />
</entity>
</entity>
</document>
</dataConfig>
on a full-import I get all the documents and all the fields indexed and
searchable.
I'd really like to be able to do a delta import to reduce the server
load of re-indexing the entire data source when a doc changes or is added.
thanks,
Scott
--
Even if you're on the right track, you'll get run over if you just sit
there.
Will Rogers