onError="continue" must help . which version of DIH are you using? onError is a Solr 1.4 feature --Noble
On Thu, Jan 29, 2009 at 5:04 AM, Nathan Adams <na...@umich.edu> wrote: > I am constructing documents from a JDBC datasource and a HTTP datasource > (see data-config file below.) My problem is that I cannot know if a > particular HTTP URL is available at index time, so I need DIH to > continue processing even if the HTTP location returns a 404. > onError="continue" does not appear to help in this case. Should it? > > <dataConfig> > > <dataSource type="JdbcDataSource" name="db" > driver="oracle.jdbc.driver.OracleDriver" url="jdbc:oracle:thin:@?????" > user="???" password="???"/> > > <dataSource type="HttpDataSource" name="http"/> > > <document name="resources"> > > <entity name="metadata" dataSource="db" pk="RESOURCEID" > query="select * from ????" onError="continue"> > > <entity name="xmltext" > url="http://???.com/${metadata.RESOURCEID}.xml" forEach="/content" > dataSource="http" processor="XPathEntityProcessor" onError="continue"> > > <field column="FULLTEXT" xpath="/content"/> > > </entity> > > </entity> > > </document> > > </dataConfig> > > Thanks, > Nathan > -- --Noble Paul