> I am using DIH to index local file system. But the file
> path, size and lastmodified field were not stored. in the
> schema.xml I defined:
>
> <fields>
> <field name="title" type="string"
> indexed="true" stored="true"/>
> <field name="author" type="string"
> indexed="true" stored="true" />
> <!--<field name="text" type="text"
> indexed="true" stored="true" />
> liang added-->
> <field name="path" type="string"
> indexed="true" stored="true" />
> <field name="size" type="long"
> indexed="true" stored="true" />
> <field name="lastmodified" type="date"
> indexed="true" stored="true" />
> </fields>
>
>
> And also defined tika-data-config.xml:
>
> <dataConfig>
> <dataSource name="bin"
> type="BinFileDataSource" />
> <document>
> <entity name="f"
> dataSource="null" rootEntity="false"
>
> processor="FileListEntityProcessor"
>
> baseDir="E:/my_project/ecmkit/infotouch"
>
> fileName=".*\.(DOC)|(PDF)|(pdf)|(doc)|(docx)|(ppt)"
> onError="skip"
>
> recursive="true">
> <entity
> name="tika-test" dataSource="bin"
> processor="TikaEntityProcessor"
>
> url="${f.fileAbsolutePath}" format="text"
> onError="skip">
>
> <field column="Author" name="author" meta="true"/>
>
> <field column="title" name="title" meta="true"/>
>
> <!--
>
> <field column="text" name="text"/> -->
>
> <field column="fileAbsolutePath" name="path" />
>
> <field column="fileSize" name="size" />
>
> <field column="fileLastModified" name="lastmodified"
> />
> </entity>
> </entity>
> </document>
> </dataConfig>
>
>
> The Solr version is 3.5. any idea?
The implicit fields fileDir, file, fileAbsolutePath, fileSize, fileLastModified
are generated by the FileListEntityProcessor. They should be defined above the
TikaEntityProcessor.