Replacing the contents of
solr-4.3.0\example\example-DIH\solr\rss\conf\rss-data-config.xml
by
<dataConfig>
<dataSource type="URLDataSource" />
<document>
<entity name="beautybooks88 " pk="link"
url="http://beautybooks88.blogspot.com/feeds/posts/default"
processor="XPathEntityProcessor" forEach="/feed/entry"
transformer="DateFormatTransformer">
<field column="source" xpath="/feed/title"
commonField="true" />
<field column="source-link" xpath="/feed/link[@rel='self']/@href"
commonField="true" />
<field column="title" xpath="/feed/entry/title" />
<field column="link"
xpath="/feed/entry/link[@rel='self']/@href" />
<field column="description" xpath="/feed/entry/content"
stripHTML="true"/>
<field column="creator" xpath="/feed/entry/author" />
<field column="item-subject"
xpath="/feed/entry/category/@term"/>
<field column="date" xpath="/feed/entry/updated"
dateTimeFormat="yyyy-MM-dd'T'HH:mm:ss" />
</entity>
</document>
</dataConfig>
and running the full dataimport from
http://localhost:8983/solr/#/rss/dataimport//dataimport results in an error.
1) How could I have found the reason faster than I did - by looking into
which log files,....?
2) If you remove the first occurrence of /@href above, the import succeeds.
(Note that the same pattern works for column "link".) What's the reason
why?!!
Best regards and thanks in advance
Hans-Peter