Re: processPages bug?

britt fitch Fri, 04 Dec 2015 12:40:27 -0800

Thanks very much for the quick replies!

I think setting startPage & endPage with make it so you correctly only extract 
the pages you want, but on every extraction it will iterate over all pages 
first.

For example, if you have a 100 page document and want to extract page 2 & page 
90, you will iterate over all 100 pages and process page 2, then iterate over 
all 100 pages and process page 90.

The 1.8 version allowed you to pass a single page to be processed. I’m curious 
if that functionality was removed because of an issue or if it was just a bug.

It looks like I can get around this a bit by overriding startPage(PDPage) and 
endPage(PDPage) though.

Thanks again, I really appreciate all your feedback.

Cheers,

Britt

Britt Fitch
Wired Informatics
265 Franklin St Ste 1702
Boston, MA 02110
http://wiredinformatics.com
[email protected]

> On Dec 4, 2015, at 3:07 PM, Tilman Hausherr <[email protected]> wrote:
> 
> Am 04.12.2015 um 20:56 schrieb britt fitch:
>> Awesome, thanks. That takes care of #1 & 2.
>> 
>> For #3, is the check on currentPageNo necessary?
>> Right now processPage must be called from processPages or nothing happens.
>> This has a negative effect for cases like mine where I want to override 
>> processTextPosition and handle different pages or even if you only want to 
>> extract data from particular pages.
> 
> You can set the start and endpage through the setters setStartPage() and 
> setEndPage(). That's the official way to do it.
> 
> Tilman
>

signature.asc
Description: Message signed with OpenPGP using GPGMail

Re: processPages bug?

Reply via email to