> I'm trying to do this with the re module - the two tags looks like:
>
> <foo>
>     ...
>     a bunch of text (~1500 lines)
>     ...
> </foo>
>
> I need to identify the first tag, and the second, and
unconditionally
> strip out everything in between those two tags, making it look like:
>
> <foo>
> </foo>

A very simplistic approach uses a flag:

atTag = 0
f = open(...)
while not atTag:
    line = f.readline()
    if line == '<foo>':
      atTag = True
      break
    outFile.write(line) # + \n, I can't remember...
while atTag:
    line = f.readline()
    if line == '</foo>':
       atTag = False
while f:
    outfile.write(f.readline())

This flag approach is sometimes called a sentinal...

I'm sure somebody can find better ways of doing this but I'm
too tired to bother right now! :-(
The sentinel approach will work...

Alan G.

_______________________________________________
Tutor maillist  -  [EMAIL PROTECTED]
http://mail.python.org/mailman/listinfo/tutor

Reply via email to