Richard Owlett <rowl...@access.net> wrote:
> I'm running Debian 12.8.
> I have a 100+ page PDF document.
> I wish to extract 2 of those pages, each to their own PDF file for
> later editing.
> 
> I'm focusing on poppler-utils as it appears to offer tools for
> current and future goals.
> 
> Doing "pdftotext -layout -f 116 -l 116 TFP2021.pdf jul24-a.txt" comes 
> very close to what I want.
> 
> Having been surrounded by TECO-buffs in the 70's, comparing the
> output of "pdftotext -f 116 -l 116 TFP2021.pdf jul24-b.txt" to the
> above suggests an approach to resolving.

I don't understand the paragraph above, and especially what the mention
of TECO infers?

> It involves being able to edit a *SINGLE* rather than all 100+
> companion pages.
> 
> I tried "pdfseparate -f 116 -l 116 TFP2021.pdf dianostic.pdf" and got
> > Syntax Error (3868069): Missing 'endstream' or incorrect stream
> > length Syntax Error (3557294): Missing 'endstream' or incorrect
> > stream length [multiple repetitions of those 2 lines
> > Syntax Error (3556857): Bad FCHECK in flate stream
> > Syntax Error (3868069): Missing 'endstream' or incorrect stream
> > length Syntax Error (3866517): Bad FCHECK in flate stream  
> 
> How/where do I find interpretation of those?

Good question that I'm not able to help answer, I'm afraid.

But looking at the messages suggests that you PDF file may not be
perfectly formed, so I suggest trying to validate it. There seems to be
no shortage of PDF validators online!

Another approach might be to try using one of the other tools that were
suggested rather than poppler. They may produce clearer error messages.

Reply via email to