Re: [Tutor] How to Scrape Text from PDFs

2019-06-19 Thread Malcolm Herbert
This isn't a response that's python-related, sorry, I'm still learning python myself, but more questions around the nature of the PDF and where I might start looking to solve the problem, were it mine. The URLs that you are intending to match - are they themselves clickable when you open the P

Re: [Tutor] How to Scrape Text from PDFs

2019-06-17 Thread Alan Gauld via Tutor
On 17/06/2019 06:30, Cem Vardar wrote: > some PDF files that have links for some websites and I need to extract these > links There is a module that may help: PyPDF2 Here is a post showing how to extract the text from a PDF which should include the links. https://stackoverflow.com/questions/34

Re: [Tutor] How to Scrape Text from PDFs

2019-06-17 Thread William Ray Wing via Tutor
> On Jun 17, 2019, at 1:30 AM, Cem Vardar wrote: > > Hello, > > I have been working on assignment that was described to me as “fairly > trivial” for a couple of days now. I have some PDF files that have links for > some websites and I need to extract these links from these files by using >