This issue has also already been discussed in the Tika issue queue: "Add method get file extension from MimeTypes" https://issues.apache.org/jira/browse/TIKA-538
And http://svn.apache.org/repos/asf/tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml does support DITA XML file types. I will investigate further and report back. Frank -- View this message in context: http://lucene.472066.n3.nabble.com/file-index-format-tp4199892p4211738.html Sent from the Solr - User mailing list archive at Nabble.com.