[
https://issues.apache.org/jira/browse/TIKA-3110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17134381#comment-17134381
]
Christoph Läubrich commented on TIKA-3110:
------------------------------------------
[~tallison] from an API point of view I think TikaInputstream behaves wrong.
The
[javadoc|https://docs.oracle.com/javase/7/docs/api/java/io/InputStream.html#skip(long)]
states that {quote}The skip method may, for a variety of reasons, end up
skipping over some smaller number of bytes, possibly 0. This may result from
any of a number of conditions; reaching end of file before n bytes have been
skipped is only one possibility{quote} so skipping over EOF is clearly allowed
and should result in returning the number of bytes skipped (possibly zero),
IOException is only to be expected if IO operation fails or the stream does not
support skip at all...
> cannot extract metadata from 7z .tar archive
> --------------------------------------------
>
> Key: TIKA-3110
> URL: https://issues.apache.org/jira/browse/TIKA-3110
> Project: Tika
> Issue Type: Bug
> Components: mime, parser
> Affects Versions: 1.24.1
> Reporter: Alex
> Priority: Major
> Attachments: 7ztar.tar
>
>
> When I extracted metadata from .tar archive wich was created by linux bash
> it's works as I expect but if .tar archive was created by 7z I got an error:
> TikaException: TIKA-198: Illegal IOException from
> org.apache.tika.parser.pkg.PackageParser@4d0f2471
> I created a project on GitHub for your convenience. It includes 2 files and
> code for play around: [https://github.com/AlexOkayJ/apache-tika-tar-issue.git]
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)