Danny McKinney created TIKA-3113:
------------------------------------
Summary: Currently Tika is detecting a .aux file as text/html
Key: TIKA-3113
URL: https://issues.apache.org/jira/browse/TIKA-3113
Project: Tika
Issue Type: Bug
Components: detector
Affects Versions: 1.24
Reporter: Danny McKinney
Attachments: TES.PC.00010363.1.aux
While processing files from an Enron test data set a file with extension aux
was detected to be MediaType of text/html. The file contains elements <Header>
and <Data> but is a type of LaTex file I believe. I am attachingĀ sample
file.[^TES.PC.00010363.1.aux]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)