I am constructing documents from a JDBC datasource and a HTTP datasource
(see data-config file below.)  My problem is that I cannot know if a
particular HTTP URL is available at index time, so I need DIH to
continue processing even if the HTTP location returns a 404.
onError="continue" does not appear to help in this case.  Should it?

<dataConfig>

    <dataSource type="JdbcDataSource" name="db"
driver="oracle.jdbc.driver.OracleDriver" url="jdbc:oracle:thin:@?????"
user="???" password="???"/>

    <dataSource type="HttpDataSource" name="http"/>
 
    <document name="resources"> 

        <entity name="metadata" dataSource="db" pk="RESOURCEID"
query="select * from ????" onError="continue">

        <entity name="xmltext"
url="http://???.com/${metadata.RESOURCEID}.xml"; forEach="/content"
dataSource="http" processor="XPathEntityProcessor" onError="continue">

            <field column="FULLTEXT" xpath="/content"/>

        </entity>

        </entity>

    </document> 

</dataConfig>

Thanks,
Nathan

Reply via email to