On Thu 24 Jul 2025 at 09:13:02 (-0500), David Wright wrote:
> On Thu 24 Jul 2025 at 08:20:33 (-0500), Richard Owlett wrote:
> > I'm running Debian 12.8.
> > I have a 100+ page PDF document.
> > I wish to extract 2 of those pages, each to their own PDF file for
> > later editing.
> >
> > I'm focusing on poppler-utils as it appears to offer tools for current
> > and future goals.
> >
> > Doing "pdftotext -layout -f 116 -l 116 TFP2021.pdf jul24-a.txt" comes
> > very close to what I want.
> >
> > Having been surrounded by TECO-buffs in the 70's, comparing the output
> > of "pdftotext -f 116 -l 116 TFP2021.pdf jul24-b.txt" to the above
> > suggests an approach to resolving.
> >
> > It involves being able to edit a *SINGLE* rather than all 100+
> > companion pages.
> >
> > I tried "pdfseparate -f 116 -l 116 TFP2021.pdf dianostic.pdf" and got
> > > Syntax Error (3868069): Missing 'endstream' or incorrect stream length
> > > Syntax Error (3557294): Missing 'endstream' or incorrect stream length
> > > [multiple repetitions of those 2 lines
> > > Syntax Error (3556857): Bad FCHECK in flate stream
> > > Syntax Error (3868069): Missing 'endstream' or incorrect stream length
> > > Syntax Error (3866517): Bad FCHECK in flate stream
> >
> > How/where do I find interpretation of those?
>
> Why on earth are you trying to debug these errors from either the
> PDF or pdfseparate? What's wrong with its output, apart from its size?
> pdftotext can happily convert dianostic.pdf.
>
> I would point out that pdfseparate produces a 3917970-byte PDF,
> whereas pdftk's output is 39666 bytes. Their converted text files
> are identical.
>
> Cheers,
> David.
[2nd attempt]