Hi all,
Having frustrating issues with getting SOLR 6.4.0 to recognize the existence of
my DIH config. I’m using Oracle Java8 jdk on Ubuntu 14.04.
The DIH .jar file appears to be loading correctly. There are no errors in the
SOLR logs. It just says “Sorry, no dataimport-handler defined” in the SOLR
admin UI.
My config files are listed below. Can anyone spot any mistakes here?
Many thanks,
Chris
##### solrconfig.xml ######
<lib dir="${solr.install.dir:../../../..}/dist/"
regex=".*dataimporthandler-.*\.jar" />
…
<requestHandler name="/dataimport"
class="org.apache.solr.handler.dataimport.DataImportHandler">
<lst name="defaults">
<str name="config">DIH-data-config.xml</str>
</lst>
</requestHandler>
##### DIH-data-config.xml (in the same dir as solrconfig.xml) ######
<dataConfig>
<dataSource name='file-reader' type="FileDataSource"/>
<document>
<!-- this outer processor generates a list of files satisfying the
conditions
specified in the attributes -->
<entity name="f" processor="FileListEntityProcessor"
fileName=".*xml"
newerThan="'NOW-5YEARS'"
recursive="true"
rootEntity="false"
dataSource="null"
baseDir="/home/bodl-tei-svc/sites/bodl-tei-svc/var/data/tolkein_tei">
<!-- this processor extracts content using Xpath from each file found -->
<entity name="tei" processor="XPathEntityProcessor"
forEach="/TEI" url="${f.fileAbsolutePath}"
transformer="RegexTransformer" >
<field column="manuscript_title" name="manuscript_title"
xpath="/TEI/teiHeader/fileDesc/titleStmt/title"/>
<field column="repository" name="repository"
xpath="/TEI/teiHeader/fileDesc/publicationStmt/publisher"/>
<field column="id" name="id"
xpath="/TEI/teiHeader/fileDesc/sourceDesc/msDesc/msIdentifier/altIdentifier/idno"/>
</entity>
</entity>
</document>
</dataConfig>
--
Chris Rogers
Digital Projects Manager
Bodleian Digital Library Systems and Services
[email protected]