I am constructing documents from a JDBC datasource and a HTTP datasource (see data-config file below.) My problem is that I cannot know if a particular HTTP URL is available at index time, so I need DIH to continue processing even if the HTTP location returns a 404. onError="continue" does not appear to help in this case. Should it?
<dataConfig> <dataSource type="JdbcDataSource" name="db" driver="oracle.jdbc.driver.OracleDriver" url="jdbc:oracle:thin:@?????" user="???" password="???"/> <dataSource type="HttpDataSource" name="http"/> <document name="resources"> <entity name="metadata" dataSource="db" pk="RESOURCEID" query="select * from ????" onError="continue"> <entity name="xmltext" url="http://???.com/${metadata.RESOURCEID}.xml" forEach="/content" dataSource="http" processor="XPathEntityProcessor" onError="continue"> <field column="FULLTEXT" xpath="/content"/> </entity> </entity> </document> </dataConfig> Thanks, Nathan