. I am now trying to remove the html tags
from this text file, and running into issues as mentioned in the previous
post.
On Wed, Mar 10, 2021 at 3:46 PM Peter Otten <[email protected]> wrote:
> On 10/03/2021 04:35, S Monzur wrote:
> > Thanks! I ended up using beautiful soup to
rsing
> data without using regex ? Just a hint ..
>
> On Wed, Mar 10, 2021 at 04:22:19AM +0600, S Monzur wrote:
> > Thank you and apologies! I did not realize how jumbled it was at the
> > receiver's end.
> > The code is now at this site : [1]https://pastebi
usly since it's an unindented mess now and can not be read easily.
>
> On Wed, Mar 10, 2021 at 03:07:14AM +0600, S Monzur wrote:
> >Dear List,
> >
> >Newbie here. I am trying to loop over a text file to remove html tags,
> >punctuation marks, stopwords. I have alrea
Dear List,
Newbie here. I am trying to loop over a text file to remove html tags,
punctuation marks, stopwords. I have already used Beautiful Soup (Python v
3.8.3) to scrape the text (newspaper articles) from the site. It returns a
list that I saved as a file. However, I am not sure how to use a l