Re: [Tutor] Parsing Word Docs

2007-03-09 Thread Alan Gauld
"Stephen Nelson-Smith" <[EMAIL PROTECTED]> wrote > Now... I want to convert this to a cgi-script... how do I grab > $QUERY_STRING in python? Have you looked at the cgi module and the web topic guides? The examples are fairly complete. http://wiki.python.org/moin/CgiScripts -- Alan Gauld Aut

Re: [Tutor] Parsing Word Docs

2007-03-09 Thread Stephen Nelson-Smith
On 3/8/07, Tim Golden <[EMAIL PROTECTED]> wrote: > Simplest thing's probably antiword (http://www.winfield.demon.nl/) > and then whatever text-scanning approach you want. I've gone for: #!/usr/bin/env python import glob, os url = "/home/cherp/prddoc" searchstring = "dxpolbl.p" worddocs = [] f

Re: [Tutor] Parsing Word Docs

2007-03-08 Thread Tim Golden
Stephen Nelson-Smith wrote: > Hello all, > > I have a directory containing a load of word documents, say 100 or so. > which is updated every hour. > > I want a cgi script that effectively does a grep on the word docs, and > returns each doc that matches the search term. > > I've had a look at do

[Tutor] Parsing Word Docs

2007-03-08 Thread Stephen Nelson-Smith
Hello all, I have a directory containing a load of word documents, say 100 or so. which is updated every hour. I want a cgi script that effectively does a grep on the word docs, and returns each doc that matches the search term. I've had a look at doing this by looking at each binary file and re