Hi
My name is Hanan and I am developing an in-house application that requires
reading pdf file and extract tables text to a local Database.
Of course the table number of rows might change from time to time .
After reading a lot about PDF as well as pdfbox I have succeeded to :
Load a PDF document
Iterate through its pages
My questions are:
1. Is there a way to identify a table in PDF file ?
2. What are the alternatives for extracting tables data only using pdfBox ?
3. How is it possible to step through a table ?
Best Regards,
Hanan Harush