Re: How to loop over a text file (to remove tags and normalize) using Python

2021-03-10 Thread S Monzur
. I am now trying to remove the html tags from this text file, and running into issues as mentioned in the previous post. On Wed, Mar 10, 2021 at 3:46 PM Peter Otten <[email protected]> wrote: > On 10/03/2021 04:35, S Monzur wrote: > > Thanks! I ended up using beautiful soup to

Re: How to loop over a text file (to remove tags and normalize) using Python

2021-03-09 Thread S Monzur
rsing > data without using regex ? Just a hint .. > > On Wed, Mar 10, 2021 at 04:22:19AM +0600, S Monzur wrote: > > Thank you and apologies! I did not realize how jumbled it was at the > > receiver's end. > > The code is now at this site : [1]https://pastebi

Re: How to loop over a text file (to remove tags and normalize) using Python

2021-03-09 Thread S Monzur
usly since it's an unindented mess now and can not be read easily. > > On Wed, Mar 10, 2021 at 03:07:14AM +0600, S Monzur wrote: > >Dear List, > > > >Newbie here. I am trying to loop over a text file to remove html tags, > >punctuation marks, stopwords. I have alrea

How to loop over a text file (to remove tags and normalize) using Python

2021-03-09 Thread S Monzur
Dear List, Newbie here. I am trying to loop over a text file to remove html tags, punctuation marks, stopwords. I have already used Beautiful Soup (Python v 3.8.3) to scrape the text (newspaper articles) from the site. It returns a list that I saved as a file. However, I am not sure how to use a l