[ 
https://issues.apache.org/jira/browse/TIKA-3110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17134381#comment-17134381
 ] 

Christoph Läubrich commented on TIKA-3110:
------------------------------------------

[~tallison] from an API point of view I think TikaInputstream behaves wrong. 
The 
[javadoc|https://docs.oracle.com/javase/7/docs/api/java/io/InputStream.html#skip(long)]
 states that {quote}The skip method may, for a variety of reasons, end up 
skipping over some smaller number of bytes, possibly 0. This may result from 
any of a number of conditions; reaching end of file before n bytes have been 
skipped is only one possibility{quote} so skipping over EOF is clearly allowed 
and should result in returning the number of bytes skipped (possibly zero), 
IOException is only to be expected if IO operation fails or the stream does not 
support skip at all...

> cannot extract metadata from 7z .tar archive
> --------------------------------------------
>
>                 Key: TIKA-3110
>                 URL: https://issues.apache.org/jira/browse/TIKA-3110
>             Project: Tika
>          Issue Type: Bug
>          Components: mime, parser
>    Affects Versions: 1.24.1
>            Reporter: Alex
>            Priority: Major
>         Attachments: 7ztar.tar
>
>
> When I extracted metadata from .tar archive wich was created by linux bash 
> it's works as I expect but if .tar archive was created by 7z I got an error:
>  TikaException: TIKA-198: Illegal IOException from 
> org.apache.tika.parser.pkg.PackageParser@4d0f2471 
> I created a project on GitHub for your convenience. It includes 2 files and 
> code for play around: [https://github.com/AlexOkayJ/apache-tika-tar-issue.git]
>   



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to