Hi

 

My name is Hanan and I am developing an in-house application that requires
reading pdf file and extract tables text  to a  local Database.

Of course the table number of rows might change from time to time .

 

After reading a lot about PDF as well as pdfbox I have  succeeded to  :

                Load a PDF document 

     Iterate through its pages 

 

My questions are:

1. Is there a way to identify a table in PDF file ? 

2. What are the alternatives for extracting tables data only using pdfBox  ?


3. How is it possible to step through a table ? 

 

Best Regards,

Hanan Harush 

 

Reply via email to