I'm new to Solr so I expect that I'm making some newbie error. I run my data-config.xml file through the DataImportHandler Development Console and I see all the results of the xpath queries scroll past in the debug pane. It processes all the content without reporting an error in the terminal window that runs Jetty, or in the Dev Console itself. This is what appears at the end of the debug pane: ---<snip>-- <str name="status">idle</str> <str name="importResponse">Configuration Re-loaded sucessfully</str> − <lst name="statusMessages"> <str name="Total Requests made to DataSource">0</str> <str name="Total Rows Fetched">5322</str> <str name="Total Documents Skipped">0</str> <str name="Full Dump Started">2010-06-15 21:51:14</str> <str name="Total Documents Processed">0</str> <str name="Time taken ">0:0:0.71</str> </lst> − ---<snip>---
It fetches 5322 rows but doesn't process any documents and doesn't populate the index. Any suggestions would be appreciated. /peter Here's my data-config.xml file: <dataConfig> <dataSource name="myFileReader" type="FileDataSource" /> <document> <entity name="f" processor="FileListEntityProcessor" baseDir="/Users/pascal/tools/apache-solr-1.4.0/example/example-DIH/timetext" fileName=".*ttml" recursive="true" rootEntity="false" dataSource="null" > <entity name="transcript" pk="tid" url="${f.fileAbsolutePath}" processor="XPathEntityProcessor" forEach="/tt/body/div/p" rootEntity="false" dataSource="myFileReader" onError="continue"> <field column="begin" xpath="/tt/body/div/p/@begin" /> <field column="dur" xpath="/tt/body/div/p/@dur" /> <field column="end" xpath="/tt/body/div/p/@end" /> <field column="phrase" xpath="/tt/body/div/p" /> <field column="tid" xpath="/tt/body/div/p/@xml:id" /> </entity> </entity> </document> </dataConfig>