On 1/25/2011 1:52 PM Juan Jose Del Toro said...
Dear List;

I am looking for a way to extract parts of a text from word (.doc,.docx)

I recently did a project extracting data from word documents and used antiword (http://www.winfield.demon.nl/) then used it like this:

def setContent(self):
    self.content =
      [
        ii.strip().replace("Ëš","")
        for ii in
commands.getoutput('/usr/local/bin/antiword "%s"' % doc).split("\n")
        if ii
      ]


Emile

_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor

Reply via email to