Re: file index format

2015-06-14 Thread Frank Ralf
This issue has also already been discussed in the Tika issue queue: "Add method get file extension from MimeTypes" https://issues.apache.org/jira/browse/TIKA-538 And http://svn.apache.org/repos/asf/tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml does support DITA X

Re: file index format

2015-06-14 Thread Frank Ralf
Looks like this has been solved recently in the current dev branch: "SimplePostTool (and thus bin/post) cannot index files with unknown extensions" https://issues.apache.org/jira/browse/SOLR-7546 -- View this message in context: http://lucene.472066.n3.nabble.com/file-index-format-tp4199892p42

Re: file index format

2015-06-14 Thread Frank Ralf
Hi, I face the same problem when trying to index DITA XML files. These are XML files but have the file extension .dita which Solr ignores. According to java -jar post.jar -h only the following file extensions are supported: /-Dfiletypes=[,,...] (default=xml,json,csv,pdf,doc,docx,ppt,pptx,xls