RE: Zip Bomb Exception in HTML File

2017-01-04 Thread Allison, Timothy B.
Tim [1] http://git.net/ml/solr-user.lucene.apache.org/2016-09/msg00561.html [2] https://issues.apache.org/jira/browse/TIKA-2091 -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Wednesday, January 4, 2017 12:20 PM To: solr-user Subject: Re: Zip Bomb Ex

Re: Zip Bomb Exception in HTML File

2017-01-04 Thread Erick Erickson
You might get a more knowledgeable response from the Tika folks, that's really not something Solr controls. Best, Erick On Wed, Jan 4, 2017 at 8:50 AM, wrote: > i get an exception "org.apache.tika.exception.TikaException: > Zip bomb detected! > if i would like to parse a html file - and i thin

Zip Bomb Exception in HTML File

2017-01-04 Thread sn00py
i get an exception "org.apache.tika.exception.TikaException: Zip bomb detected! if i would like to parse a html file - and i think i know why. because there are many many in cascade over 200 divs and span are inside each. Is it correct that there is this limit for html files? ---