On 4/15/25 11:01 AM, Kent West wrote:
On Tue, Apr 15, 2025 at 10:32 AM Nicolas George <geo...@nsup.org> wrote:
Richard Owlett (HE12025-04-15):
I don't know how to approach the problem.
What I would like to end up with is a CSV formatted file containing the
two
left columns of Table A4.14 (pages 106&107) of
[
https://fns-prod.azureedge.us/sites/default/files/resource-files/TFP2021.pdf
].
Suggestions?
Have you tried starting with pdftotext -layout and then adding the CSV
delimiters using a powerful editor. The rectangle selection of Vim might
be useful.
Regards,
--
Nicolas George
Riffing off of Nicolas' suggestion, here's what I would do:
$ pdftotext -f 106 -l 107 TFP2021.pdf TFP2021.txt
As I replied to Nicolas I'll try both that and also a run with the
"-layout" option.
Then open LibreCalc, and File/Open this file. When the import options
window appears, change the selection criteria to "Fixed width", and then in
the "ruler" bar above the text, click where you want a column divider (like
at Columns 39, 60, and 76; just eyeball it. Finish importing the document,
and now you have a spreadsheet with the info you want that should be pretty
easy to massage into the form you want.
Any particularly relavant tutorials?
Thanks.